Skip to main content

Showing 1–50 of 546 results for author: Lee, L

  1. arXiv:2407.13020  [pdf, other

    astro-ph.GA

    A hidden AGN powering bright [O III] nebulae in a protocluster core at $z=4.5$ revealed by JWST

    Authors: M. Solimano, J. González-López, M. Aravena, B. Alcalde Pampliega, R. J. Assef, M. Béthermin, M. Boquien, S. Bovino, C. M. Casey, P. Cassata, E. da Cunha, R. L. Davies, I. De Looze, X. Ding, T. Díaz-Santos, A. L. Faisst, A. Ferrara, D. B. Fisher, N. M. Förster-Schreiber, S. Fujimoto, M. Ginolfi, C. Gruppioni, L. Guaita, N. Hathi, R. Herrera-Camus , et al. (26 additional authors not shown)

    Abstract: We present new JWST/NIRSpec IFU observations of the J1000+0234 system at $z=4.54$, the dense core of a galaxy protocluster hosting a massive, dusty star forming galaxy (DSFG) with a low luminosity radio counterpart. The new data reveals two extended, high equivalent width (EW$_0 > 1000$ Å) nebulae at each side of the DSFG disk along its minor axis (namely O3-N and O3-S). On one hand, O3-N's spectr… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 5 pages, 4 figures plus 5 appendices (incl. 3 extra figures and one table). Submitted to A&A on July 17th 2024

  2. arXiv:2407.12450  [pdf, other

    physics.acc-ph hep-ex

    Interim report for the International Muon Collider Collaboration (IMCC)

    Authors: C. Accettura, S. Adrian, R. Agarwal, C. Ahdida, C. Aimé, A. Aksoy, G. L. Alberghi, S. Alden, N. Amapane, D. Amorim, P. Andreetto, F. Anulli, R. Appleby, A. Apresyan, P. Asadi, M. Attia Mahmoud, B. Auchmann, J. Back, A. Badea, K. J. Bae, E. J. Bahng, L. Balconi, F. Balli, L. Bandiera, C. Barbagallo , et al. (362 additional authors not shown)

    Abstract: The International Muon Collider Collaboration (IMCC) [1] was established in 2020 following the recommendations of the European Strategy for Particle Physics (ESPP) and the implementation of the European Strategy for Particle Physics-Accelerator R&D Roadmap by the Laboratory Directors Group [2], hereinafter referred to as the the European LDG roadmap. The Muon Collider Study (MuC) covers the accele… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: This document summarises the International Muon Collider Collaboration (IMCC) progress and status of the Muon Collider R&D programme

  3. arXiv:2407.10082  [pdf, ps, other

    math.AG math.DG math.SG

    On the blow-up formula of the Chow weights for polarized toric manifolds

    Authors: King Leung Lee, Naoto Yotsutani

    Abstract: Let $X$ be a smooth projective toric variety and let $\widetilde{X}$ be the blow-up manifold of $X$ at finitely many distinct tours invariants points of $X$. In this paper, we give an explicit combinatorial formula of the Chow weight of $\widetilde{X}$ in terms of the base toric manifold $X$ and the symplectic cuts of the Delzant polytope. We then apply this blow-up formula to the projective plane… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 23 pages, 3 figures. Comments welcome

    MSC Class: 51M20; 53C55; 14M25

  4. arXiv:2407.00925  [pdf, other

    cs.MM

    SIDQL: An Efficient Keyframe Extraction and Motion Reconstruction Framework in Motion Capture

    Authors: Xuling Zhang, Ziru Zhang, Yuyang Wang, Lik-hang Lee, Pan Hui

    Abstract: Metaverse, which integrates the virtual and physical worlds, has emerged as an innovative paradigm for changing people's lifestyles. Motion capture has become a reliable approach to achieve seamless synchronization of the movements between avatars and human beings, which plays an important role in diverse Metaverse applications. However, due to the continuous growth of data, current communication… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  5. arXiv:2406.16003  [pdf

    physics.optics

    Unidirectional Chiral Emission via Twisted Bi-layer Metasurfaces

    Authors: Dmitrii Gromyko, Shu An, Sergey Gorelik, Jiahui Xu, Li Jun Lim, Henry Yit Loong Lee, Febiana Tjiptoharsono, Zhi-Kuang Tan, Cheng-Wei Qiu, Zhaogang Dong, Lin Wu

    Abstract: Controlling and channelling light emissions from unpolarized quantum dots into specific directions with chiral polarization remains a key challenge in modern photonics. Stacked metasurface designs offer a potential compact solution for chirality and directionality engineering. However, experimental observations of directional chiral radiation from resonant metasurfaces with quantum emitters remain… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 16 pages, 4 figures

  6. arXiv:2406.15391  [pdf

    cs.CY cs.AI cs.ET

    Examining the Legal Status of Digital Assets as Property: A Comparative Analysis of Jurisdictional Approaches

    Authors: Luke Lee

    Abstract: This paper examines the complex legal landscape surrounding digital assets, analysing how they are defined and regulated as property across various jurisdictions. As digital assets such as cryptocurrencies and non-fungible tokens (NFTs) increasingly integrate with global economies, their intangible nature presents unique challenges to traditional property law concepts, necessitating a re-evaluatio… ▽ More

    Submitted 26 April, 2024; originally announced June 2024.

    Comments: 16 pages

  7. arXiv:2406.11886  [pdf, other

    cs.LG cs.AI cs.CE q-fin.CP

    Financial Assets Dependency Prediction Utilizing Spatiotemporal Patterns

    Authors: Haoren Zhu, Pengfei Zhao, Wilfred Siu Hung NG, Dik Lun Lee

    Abstract: Financial assets exhibit complex dependency structures, which are crucial for investors to create diversified portfolios to mitigate risk in volatile financial markets. To explore the financial asset dependencies dynamics, we propose a novel approach that models the dependencies of assets as an Asset Dependency Matrix (ADM) and treats the ADM sequences as image sequences. This allows us to leverag… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  8. arXiv:2406.04339  [pdf, other

    cs.CV

    RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation

    Authors: Jiaming Liu, Mengzhen Liu, Zhenyu Wang, Lily Lee, Kaichen Zhou, Pengju An, Senqiao Yang, Renrui Zhang, Yandong Guo, Shanghang Zhang

    Abstract: A fundamental objective in robot manipulation is to enable models to comprehend visual scenes and execute actions. Although existing robot Multimodal Large Language Models (MLLMs) can handle a range of basic tasks, they still face challenges in two areas: 1) inadequate reasoning ability to tackle complex tasks, and 2) high computational costs for MLLM fine-tuning and inference. The recently propos… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  9. arXiv:2406.02408  [pdf, other

    cond-mat.str-el

    Anomalous 4$f$ fine structure in TmSe$_{1-x}$Te$_x$ across the metal-insulator transition

    Authors: C. -H. Min, S. Müller, W. J. Choi, L. Dudy, V. Zabolotny, M. Heber, J. D. Denlinger, C. -J. Kang, M. Kalläne, N. Wind, M. Scholz, T. L. Lee, C. Schlueter, A. Gloskovskii, E. D. L. Rienks, V. Hinkov, H. Bentmann, Y. S. Kwon, F. Reinert, K. Rossnagel

    Abstract: Hybridization between localized 4$f$ and itinerant 5$d$6$s$ states in heavy fermion compounds is a well-studied phenomenon and commonly captured by the paradigmatic Anderson model. However, the investigation of additional electronic interactions, beyond the standard Anderson model, has been limited, despite their predicted important role in the exotic quasiparticle formation in mixed-valence syste… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 5 pages, 4 figures for the main text, 6 pages and 5 figures for the supplementary

  10. arXiv:2405.18047  [pdf, other

    cs.LG cs.AI cs.DC

    2BP: 2-Stage Backpropagation

    Authors: Christopher Rae, Joseph K. L. Lee, James Richings

    Abstract: As Deep Neural Networks (DNNs) grow in size and complexity, they often exceed the memory capacity of a single accelerator, necessitating the sharding of model parameters across multiple accelerators. Pipeline parallelism is a commonly used sharding strategy for training large DNNs. However, current implementations of pipeline parallelism are being unintentionally bottlenecked by the automatic diff… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  11. arXiv:2405.17418  [pdf, other

    cs.CV

    Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation

    Authors: Jiaming Liu, Chenxuan Li, Guanqun Wang, Lily Lee, Kaichen Zhou, Sixiang Chen, Chuyan Xiong, Jiaxin Ge, Renrui Zhang, Shanghang Zhang

    Abstract: Robot manipulation policies have shown unsatisfactory action performance when confronted with novel task or object instances. Hence, the capability to automatically detect and self-correct failure action is essential for a practical robotic system. Recently, Multimodal Large Language Models (MLLMs) have shown promise in visual instruction following and demonstrated strong reasoning abilities in va… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  12. arXiv:2405.08586  [pdf, other

    cs.CV

    Cross-Domain Feature Augmentation for Domain Generalization

    Authors: Yingnan Liu, Yingtian Zou, Rui Qiao, Fusheng Liu, Mong Li Lee, Wynne Hsu

    Abstract: Domain generalization aims to develop models that are robust to distribution shifts. Existing methods focus on learning invariance across domains to enhance model robustness, and data augmentation has been widely used to learn invariant predictors, with most methods performing augmentation in the input space. However, augmentation in the input space has limited diversity whereas in the feature spa… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: Accepted to the 33rd International Joint Conference on Artificial Intelligence (IJCAI 2024); Code is available at https://github.com/NancyQuris/XDomainMix

  13. arXiv:2405.06883  [pdf, ps, other

    math.AG math.DG

    Chow stability of $λ$-stable toric varieties

    Authors: King leung Lee, Naoto Yotsutani

    Abstract: For a given polarized toric variety, we define the notion of $λ$-stability which is a natural generalization of uniform K-stability. At the neighbourhoods of the vertices of the corresponding moment polytope $Δ$, we consider appropriate triangulations and give a sufficient criteria for a $λ$-stable polarized toric variety $(X,L)$ to be asymptotically Chow polystable when the obstruction of asympto… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 36pages. Comments are welcome!

    MSC Class: 51M20; 53C55; 14M25

  14. arXiv:2405.00653  [pdf, other

    cond-mat.soft

    Particle scale anisotropy controls bulk properties in sheared granular materials

    Authors: Carmen L. Lee, Ephraim Bililign, Emilien Azéma, Karen E. Daniels

    Abstract: The bulk dynamics of dense granular materials arise through a combination of particle-scale and mesoscale effects. Theoretical and numerical studies have shown that collective effects are created by particle-scale anisotropic structures such as grain connectivity (fabric), force transmission, and frictional mobilization, all of which influence bulk properties like bulk friction and the stress tens… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 5 pages, 3 figures

  15. arXiv:2404.14687  [pdf, other

    cs.MM cs.AI cs.CL cs.CV

    Pegasus-v1 Technical Report

    Authors: Raehyuk Jung, Hyojun Go, Jaehyuk Yi, Jiho Jang, Daniel Kim, Jay Suh, Aiden Lee, Cooper Han, Jae Lee, Jeff Kim, Jin-Young Kim, Junwan Kim, Kyle Park, Lucas Lee, Mars Ha, Minjoon Seo, Abraham Jo, Ed Park, Hassan Kianinejad, SJ Kim, Tony Moon, Wade Jeong, Andrei Popescu, Esther Kim, EK Yoon , et al. (19 additional authors not shown)

    Abstract: This technical report introduces Pegasus-1, a multimodal language model specialized in video content understanding and interaction through natural language. Pegasus-1 is designed to address the unique challenges posed by video data, such as interpreting spatiotemporal information, to offer nuanced video content comprehension across various lengths. This technical report overviews Pegasus-1's archi… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  16. arXiv:2404.11898  [pdf

    cs.AI

    Enhancing Financial Inclusion and Regulatory Challenges: A Critical Analysis of Digital Banks and Alternative Lenders Through Digital Platforms, Machine Learning, and Large Language Models Integration

    Authors: Luke Lee

    Abstract: This paper explores the dual impact of digital banks and alternative lenders on financial inclusion and the regulatory challenges posed by their business models. It discusses the integration of digital platforms, machine learning (ML), and Large Language Models (LLMs) in enhancing financial services accessibility for underserved populations. Through a detailed analysis of operational frameworks an… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 17 pages

  17. arXiv:2404.10536  [pdf, ps, other

    cs.DC

    Benchmarking Machine Learning Applications on Heterogeneous Architecture using Reframe

    Authors: Christopher Rae, Joseph K. L. Lee, James Richings, Michele Weiland

    Abstract: With the rapid increase in machine learning workloads performed on HPC systems, it is beneficial to regularly perform machine learning specific benchmarks to monitor performance and identify issues. Furthermore, as part of the Edinburgh International Data Facility, EPCC currently hosts a wide range of machine learning accelerators including Nvidia GPUs, the Graphcore Bow Pod64 and Cerebras CS-2, w… ▽ More

    Submitted 25 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: Author accepted version of paper in the PERMAVOST workshop at the 33rd International Symposium on High-Performance Parallel and Distributed Computing (HPDC 24)

  18. arXiv:2404.06466  [pdf, other

    cs.LG stat.ML

    Hyperparameter Selection in Continual Learning

    Authors: Thomas L. Lee, Sigrid Passano Hellan, Linus Ericsson, Elliot J. Crowley, Amos Storkey

    Abstract: In continual learning (CL) -- where a learner trains on a stream of data -- standard hyperparameter optimisation (HPO) cannot be applied, as a learner does not have access to all of the data at the same time. This has prompted the development of CL-specific HPO frameworks. The most popular way to tune hyperparameters in CL is to repeatedly train over the whole data stream with different hyperparam… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Preprint, 9 pages

  19. arXiv:2404.03575  [pdf, other

    cs.CV

    DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling

    Authors: Haoran Li, Haolin Shi, Wenli Zhang, Wenjun Wu, Yong Liao, Lin Wang, Lik-hang Lee, Pengyuan Zhou

    Abstract: Text-to-3D scene generation holds immense potential for the gaming, film, and architecture sectors. Despite significant progress, existing methods struggle with maintaining high quality, consistency, and editing flexibility. In this paper, we propose DreamScene, a 3D Gaussian-based novel text-to-3D scene generation framework, to tackle the aforementioned three challenges mainly via two strategies.… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  20. arXiv:2404.00874  [pdf, other

    cs.CV

    DiSR-NeRF: Diffusion-Guided View-Consistent Super-Resolution NeRF

    Authors: Jie Long Lee, Chen Li, Gim Hee Lee

    Abstract: We present DiSR-NeRF, a diffusion-guided framework for view-consistent super-resolution (SR) NeRF. Unlike prior works, we circumvent the requirement for high-resolution (HR) reference images by leveraging existing powerful 2D super-resolution models. Nonetheless, independent SR 2D images are often inconsistent across different views. We thus propose Iterative 3D Synchronization (I3DS) to mitigate… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

  21. arXiv:2403.14090  [pdf

    physics.optics physics.acc-ph physics.ins-det

    Dynamic motion trajectory control with nanoradian accuracy for multi-element X-ray optical systems via laser interferometry

    Authors: Sina M Koehlenbeck, Lance Lee, Mario D Balcazar, Ying Chen, Vincent Esposito, Jerry Hastings, Matthias C Hoffmann, Zhirong Huang, May-Ling Ng, Saxon Price, Takahiro Sato, Matthew Seaberg, Yanwen Sun, Adam White, Lin Zhang, Brian Lantz, Diling Zhu

    Abstract: The past decades have witnessed the development of new X-ray beam sources with brightness growing at a rate surpassing Moore's law. Current and upcoming diffraction limited and fully coherent X-ray beam sources, including multi-bend achromat based synchrotron sources and high repetition rate X-ray free electron lasers, puts increasingly stringent requirements on stability and accuracy of X-ray opt… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  22. arXiv:2403.11384  [pdf, other

    cs.HC cs.RO

    Towards Massive Interaction with Generalist Robotics: A Systematic Review of XR-enabled Remote Human-Robot Interaction Systems

    Authors: Xian Wang, Luyao Shen, Lik-Hang Lee

    Abstract: The rising interest of generalist robots seek to create robots with versatility to handle multiple tasks in a variety of environments, and human will interact with such robots through immersive interfaces. In the context of human-robot interaction (HRI), this survey provides an exhaustive review of the applications of extended reality (XR) technologies in the field of remote HRI. We developed a sy… ▽ More

    Submitted 26 March, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

  23. arXiv:2403.08295  [pdf, other

    cs.CL cs.AI

    Gemma: Open Models Based on Gemini Research and Technology

    Authors: Gemma Team, Thomas Mesnard, Cassidy Hardin, Robert Dadashi, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti, Léonard Hussenot, Pier Giuseppe Sessa, Aakanksha Chowdhery, Adam Roberts, Aditya Barua, Alex Botev, Alex Castro-Ros, Ambrose Slone, Amélie Héliou, Andrea Tacchetti, Anna Bulanova, Antonia Paterson, Beth Tsai, Bobak Shahriari , et al. (83 additional authors not shown)

    Abstract: This work introduces Gemma, a family of lightweight, state-of-the art open models built from the research and technology used to create Gemini models. Gemma models demonstrate strong performance across academic benchmarks for language understanding, reasoning, and safety. We release two sizes of models (2 billion and 7 billion parameters), and provide both pretrained and fine-tuned checkpoints. Ge… ▽ More

    Submitted 16 April, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  24. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  25. arXiv:2403.05131  [pdf, other

    cs.AI cs.CV

    Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation

    Authors: Joseph Cho, Fachrina Dewi Puspitasari, Sheng Zheng, Jingyao Zheng, Lik-Hang Lee, Tae-Ho Kim, Choong Seon Hong, Chaoning Zhang

    Abstract: The evolution of video generation from text, starting with animating MNIST numbers to simulating the physical world with Sora, has progressed at a breakneck speed over the past seven years. While often seen as a superficial expansion of the predecessor text-to-image generation model, text-to-video generation models are developed upon carefully engineered constituents. Here, we systematically discu… ▽ More

    Submitted 7 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: First complete survey on Text-to-Video Generation, 44 pages, 20 figures

  26. arXiv:2403.03379  [pdf, other

    astro-ph.GA

    The ALMA-CRISTAL survey: Extended [CII] emission in an interacting galaxy system at z ~ 5.5

    Authors: A. Posses, M. Aravena, J. González-López, N. M. Förster Schreiber, D. Liu, L. Lee, M. Solimano, T. Díaz-Santos, R. J. Assef, L. Barcos-Muñoz, S. Bovino, R. A. A. Bowler, G. Calistro Rivera, E. da Cunha, R. L. Davies, M. Killi, I. De Looze, A. Ferrara, D. B. Fisher, R. Herrera-Camus, R. Ikeda, T. Lambert, J. Li, D. Lutz, I. Mitsuhashi , et al. (9 additional authors not shown)

    Abstract: The ALMA [CII] Resolved Ism in STar-forming gALaxies (CRISTAL) survey is a Cycle 8 ALMA Large Programme that studies the cold gas component of high-redshift galaxies. Its sub-arcsecond resolution observations are key to disentangling physical mechanisms that shape galaxies during cosmic dawn. In this paper, we explore the morphology and kinematics of the cold gas, star-forming, and stellar compone… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Submitted to A&A - comments are welcome! - 19 pages, 13 figures

  27. arXiv:2403.03170  [pdf, other

    cs.MM cs.AI cs.CL cs.CV cs.CY

    SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection

    Authors: Peng Qi, Zehong Yan, Wynne Hsu, Mong Li Lee

    Abstract: Misinformation is a prevalent societal issue due to its potential high risks. Out-of-context (OOC) misinformation, where authentic images are repurposed with false text, is one of the easiest and most effective ways to mislead audiences. Current methods focus on assessing image-text consistency but lack convincing explanations for their judgments, which is essential for debunking misinformation. W… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: To appear in CVPR 2024

  28. arXiv:2402.13913  [pdf, other

    astro-ph.GA

    An Automated Chemical Exploration of NGC 6334I at 340 au Resolution

    Authors: Samer J. El-Abd, Crystal L. Brogan, Todd R. Hunter, Kin Long Kelvin Lee, Ryan A. Loomis, Brett A. McGuire

    Abstract: Much of the information gleaned from observations of star-forming regions comes from the analysis of their molecular emission spectra, particularly in the radio regime. The time-consuming nature of fitting synthetic spectra to observations interactively for such line-rich sources, however, often results in such analysis being limited to data extracted from a single-dish observation or a handful of… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 40 pages, 71 figures, accepted for publication in The Astrophysical Journal

  29. arXiv:2402.06642  [pdf, other

    q-fin.ST cs.LG

    From GARCH to Neural Network for Volatility Forecast

    Authors: Pengfei Zhao, Haoren Zhu, Wilfred Siu Hung NG, Dik Lun Lee

    Abstract: Volatility, as a measure of uncertainty, plays a crucial role in numerous financial activities such as risk management. The Econometrics and Machine Learning communities have developed two distinct approaches for financial volatility forecasting: the stochastic approach and the neural network (NN) approach. Despite their individual strengths, these methodologies have conventionally evolved in sepa… ▽ More

    Submitted 29 January, 2024; originally announced February 2024.

    Comments: Accepted by AAAI'24

  30. arXiv:2402.03988  [pdf, other

    eess.AS cs.CL cs.SD

    REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR

    Authors: Liang-Hsuan Tseng, En-Pei Hu, Cheng-Han Chiang, Yuan Tseng, Hung-yi Lee, Lin-shan Lee, Shao-Hua Sun

    Abstract: Unsupervised automatic speech recognition (ASR) aims to learn the mapping between the speech signal and its corresponding textual transcription without the supervision of paired speech-text data. A word/phoneme in the speech signal is represented by a segment of speech signal with variable length and unknown boundary, and this segmental structure makes learning the mapping between speech and text… ▽ More

    Submitted 28 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  31. Eigenmode Decomposition Method for Full-Wave Modeling of Microring Resonators

    Authors: Yuriy Akimov, Aswin Alexander Eapen, Shiyang Zhu, Doris K. T. Ng, Nanxi Li, Woon Leng Loh, Lennon Y. T. Lee, Alagappan Gandhi, Aravind P. Anthur

    Abstract: We develop a theoretical predictive model for an all-pass ring resonator that enables the most complete description of linear coupling regimes. The model is based on eigenmode decomposition of Maxwell's equations with full account of the confined and leaky modes, as opposed to the existing phenomenological methods restricted to the confined modes only. This model enables quantitative description o… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 8 pages, 11 figures

    Journal ref: Physical Review A 109, 043514 (2024)

  32. APT-Pipe: A Prompt-Tuning Tool for Social Data Annotation using ChatGPT

    Authors: Yiming Zhu, Zhizhuo Yin, Gareth Tyson, Ehsan-Ul Haq, Lik-Hang Lee, Pan Hui

    Abstract: Recent research has highlighted the potential of LLM applications, like ChatGPT, for performing label annotation on social computing text. However, it is already well known that performance hinges on the quality of the input prompts. To address this, there has been a flurry of research into prompt tuning -- techniques and guidelines that attempt to improve the quality of prompts. Yet these largely… ▽ More

    Submitted 20 February, 2024; v1 submitted 24 January, 2024; originally announced February 2024.

    Comments: Accepted by WWW 2024; Camera-ready version

  33. arXiv:2401.13463  [pdf, other

    cs.CL cs.IR cs.SD eess.AS

    SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering

    Authors: Chyi-Jiunn Lin, Guan-Ting Lin, Yung-Sung Chuang, Wei-Lun Wu, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Lin-shan Lee

    Abstract: Spoken Question Answering (SQA) is essential for machines to reply to user's question by finding the answer span within a given spoken passage. SQA has been previously achieved without ASR to avoid recognition errors and Out-of-Vocabulary (OOV) problems. However, the real-world problem of Open-domain SQA (openSQA), in which the machine needs to first retrieve passages that possibly contain the ans… ▽ More

    Submitted 18 March, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Accepted at ICASSP 2024

  34. arXiv:2401.07331  [pdf, other

    cs.CE

    Rapid Estimation of Left Ventricular Contractility with a Physics-Informed Neural Network Inverse Modeling Approach

    Authors: Ehsan Naghavi, Haifeng Wang, Lei Fan, Jenny S. Choy, Ghassan Kassab, Seungik Baek, Lik-Chuan Lee

    Abstract: Physics-based computer models based on numerical solution of the governing equations generally cannot make rapid predictions, which in turn, limits their applications in the clinic. To address this issue, we developed a physics-informed neural network (PINN) model that encodes the physics of a closed-loop blood circulation system embedding a left ventricle (LV). The PINN model is trained to satisf… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

  35. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  36. arXiv:2312.09799  [pdf, other

    eess.IV cs.AI cs.CV

    IQNet: Image Quality Assessment Guided Just Noticeable Difference Prefiltering For Versatile Video Coding

    Authors: Yu-Han Sun, Chiang Lo-Hsuan Lee, Tian-Sheuan Chang

    Abstract: Image prefiltering with just noticeable distortion (JND) improves coding efficiency in a visual lossless way by filtering the perceptually redundant information prior to compression. However, real JND cannot be well modeled with inaccurate masking equations in traditional approaches or image-level subject tests in deep learning approaches. Thus, this paper proposes a fine-grained JND prefiltering… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  37. arXiv:2312.08153  [pdf, other

    physics.comp-ph cs.LG

    $ρ$-Diffusion: A diffusion-based density estimation framework for computational physics

    Authors: Maxwell X. Cai, Kin Long Kelvin Lee

    Abstract: In physics, density $ρ(\cdot)$ is a fundamentally important scalar function to model, since it describes a scalar field or a probability density function that governs a physical process. Modeling $ρ(\cdot)$ typically scales poorly with parameter space, however, and quickly becomes prohibitively difficult and computationally expensive. One promising avenue to bypass this is to leverage the capabili… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: 6 pages, 2 figures, accepted for publication at the NeurIPS 2023 workshop "Machine Learning and the Physical Sciences"

  38. arXiv:2311.17671  [pdf, other

    astro-ph.GA

    The ALMA-CRISTAL survey: Widespread dust-obscured star formation in typical star-forming galaxies at z=4-6

    Authors: Ikki Mitsuhashi, Ken-ichi Tadaki, Ryota Ikeda, Rodrigo Herrera-Camus, Manuel Aravena, Ilse De Looze, Natascha M. Förster Schreiber, Jorge González-López, Justin Spilker, Roberto J. Assef, Rychard Bouwens, Loreto Barcos-Munoz, Jack Birkin, Rebecca A. A. Bowler, Gabriela Calistro Rivera, Rebecca Davies, Elisabete Da Cunha, Tanio Díaz-Santos, Andrea Ferrara, Deanne Fisher, Lilian L. Lee, Juno Li, Dieter Lutz, Monica Relaño, Thorsten Naab , et al. (7 additional authors not shown)

    Abstract: We present the morphological parameters and global properties of dust-obscured star formation in typical star-forming galaxies at z=4-6. Among 26 galaxies composed of 20 galaxies observed by the Cycle-8 ALMA Large Program, CRISTAL, and six galaxies from archival data, we have individually detected rest-frame 158$μ$m dust continuum emission from 19 galaxies, nine of which are reported for the first… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  39. arXiv:2311.15452  [pdf, other

    cond-mat.soft physics.flu-dyn

    Buckling instability in a chain of sticky bubbles

    Authors: Carmen L. Lee, Kari Dalnoki-Veress

    Abstract: A slender object undergoing an axial compression will buckle to alleviate the stress. Typically the morphology of the deformed object depends on the bending stiffness for solids, or the viscoelastic properties for liquid threads. We study a chain of uniform sticky air bubbles that rise due to buoyancy through an aqueous bath. A buckling instability of the bubble chain with a characteristic wavelen… ▽ More

    Submitted 30 May, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

    Comments: 7 pages, 5 figures

  40. arXiv:2311.15294  [pdf, other

    cs.SI cs.CY

    A Study of Partisan News Sharing in the Russian invasion of Ukraine

    Authors: Yiming Zhu, Ehsan-Ul Haq, Gareth Tyson, Lik-Hang Lee, Yuyang Wang, Pan Hui

    Abstract: Since the Russian invasion of Ukraine, a large volume of biased and partisan news has been spread via social media platforms. As this may lead to wider societal issues, we argue that understanding how partisan news sharing impacts users' communication is crucial for better governance of online communities. In this paper, we perform a measurement study of partisan news sharing. We aim to characteri… ▽ More

    Submitted 26 November, 2023; originally announced November 2023.

  41. arXiv:2311.03210  [pdf, other

    cs.DC

    Quantum Task Offloading with the OpenMP API

    Authors: Joseph K. L. Lee, Oliver T. Brown, Mark Bull, Martin Ruefenacht, Johannes Doerfert, Michael Klemm, Martin Schulz

    Abstract: Most of the widely used quantum programming languages and libraries are not designed for the tightly coupled nature of hybrid quantum-classical algorithms, which run on quantum resources that are integrated on-premise with classical HPC infrastructure. We propose a programming model using the API provided by OpenMP to target quantum devices, which provides an easy-to-use and efficient interface fo… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Poster extended abstract for Supercomputing 2023 (SC23)

  42. A Collaborative Filtering-Based Two Stage Model with Item Dependency for Course Recommendation

    Authors: Eric L. Lee, Tsung-Ting Kuo, Shou-De Lin

    Abstract: Recommender systems have been studied for decades with numerous promising models been proposed. Among them, Collaborative Filtering (CF) models are arguably the most successful one due to its high accuracy in recommendation and elimination of privacy-concerned personal meta-data from training. This paper extends the usage of CF-based model to the task of course recommendation. We point out several… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 8 pages, 2 figures, 2017 IEEE International Conference on Data Science and Advanced Analytics (DSAA)

    Journal ref: In 2017 IEEE DSAA, pp. 496-503. IEEE, 2017

  43. arXiv:2310.09129  [pdf, other

    cs.LG stat.ML

    Computing Marginal and Conditional Divergences between Decomposable Models with Applications

    Authors: Loong Kuan Lee, Geoffrey I. Webb, Daniel F. Schmidt, Nico Piatkowski

    Abstract: The ability to compute the exact divergence between two high-dimensional distributions is useful in many applications but doing so naively is intractable. Computing the alpha-beta divergence -- a family of divergences that includes the Kullback-Leibler divergence and Hellinger distance -- between the joint distribution of two decomposable models, i.e chordal Markov networks, can be done in time ex… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 10 pages, 8 figures, Accepted at the IEEE International Conference on Data Mining (ICDM) 2023

  44. arXiv:2310.08864  [pdf, other

    cs.RO

    Open X-Embodiment: Robotic Learning Datasets and RT-X Models

    Authors: Open X-Embodiment Collaboration, Abby O'Neill, Abdul Rehman, Abhinav Gupta, Abhiram Maddukuri, Abhishek Gupta, Abhishek Padalkar, Abraham Lee, Acorn Pooley, Agrim Gupta, Ajay Mandlekar, Ajinkya Jain, Albert Tung, Alex Bewley, Alex Herzog, Alex Irpan, Alexander Khazatsky, Anant Rai, Anchit Gupta, Andrew Wang, Andrey Kolobov, Anikait Singh, Animesh Garg, Aniruddha Kembhavi, Annie Xie , et al. (267 additional authors not shown)

    Abstract: Large, high-capacity models trained on diverse datasets have shown remarkable successes on efficiently tackling downstream applications. In domains from NLP to Computer Vision, this has led to a consolidation of pretrained models, with general pretrained backbones serving as a starting point for many applications. Can such a consolidation happen in robotics? Conventionally, robotic learning method… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Project website: https://robotics-transformer-x.github.io

  45. arXiv:2310.08738  [pdf, other

    cs.LG q-bio.GN

    Splicing Up Your Predictions with RNA Contrastive Learning

    Authors: Philip Fradkin, Ruian Shi, Bo Wang, Brendan Frey, Leo J. Lee

    Abstract: In the face of rapidly accumulating genomic data, our understanding of the RNA regulatory code remains incomplete. Recent self-supervised methods in other domains have demonstrated the ability to learn rules underlying the data-generating process such as sentence structure in language. Inspired by this, we extend contrastive learning techniques to genomic data by utilizing functional similarities… ▽ More

    Submitted 17 October, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

  46. arXiv:2310.07864  [pdf, other

    cond-mat.mtrl-sci physics.comp-ph

    Towards Foundation Models for Materials Science: The Open MatSci ML Toolkit

    Authors: Kin Long Kelvin Lee, Carmelo Gonzales, Matthew Spellings, Mikhail Galkin, Santiago Miret, Nalini Kumar

    Abstract: Artificial intelligence and machine learning have shown great promise in their ability to accelerate novel materials discovery. As researchers and domain scientists seek to unify and consolidate chemical knowledge, the case for models with potential to generalize across different tasks within materials science - so-called "foundation models" - grows with ambitions. This manuscript reviews our rece… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: 17 pages, 7 figures, 1 table. Accepted paper/presentation at the AI4Science workshop at Super Computing '23

  47. arXiv:2310.02206  [pdf, other

    cs.LG stat.ML

    Chunking: Continual Learning is not just about Distribution Shift

    Authors: Thomas L. Lee, Amos Storkey

    Abstract: Work on continual learning (CL) has thus far largely focused on the problems arising from shifts in the data distribution. However, CL can be decomposed into two sub-problems: (a) shifts in the data distribution, and (b) dealing with the fact that the data is split into chunks and so only a part of the data is available to be trained on at any point in time. In this work, we look at the latter sub… ▽ More

    Submitted 11 July, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Published at the 3rd Conference on Lifelong Learning Agents (CoLLAs), 2024

  48. arXiv:2309.14449  [pdf, other

    astro-ph.GA

    Explaining the Chemical Inventory of Orion KL through Machine Learning

    Authors: Haley N. Scolati, Anthony J. Remijan, Eric Herbst, Brett A. McGuire, Kin Long Kelvin Lee

    Abstract: The interplay of the chemistry and physics that exists within astrochemically relevant sources can only be fully appreciated if we can gain a holistic understanding of their chemical inventories. Previous work by Lee et al. (2021) demonstrated the capabilities of simple regression models to reproduce the abundances of the chemical inventory of the Taurus Molecular Cloud 1 (TMC-1), as well as provi… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: 14 pages; 6 figures, 1 table in the main text. 0 figures, 1 table in the appendix. Accepted for publication in The Astrophysical Journal. Molecular dataset for machine learning can be found in the Zenodo repository here: https://zenodo.org/record/7675609

  49. arXiv:2309.12460  [pdf

    cs.LG cs.AI cs.CE cs.CL cs.CV

    Multimodal Deep Learning for Scientific Imaging Interpretation

    Authors: Abdulelah S. Alshehri, Franklin L. Lee, Shihu Wang

    Abstract: In the domain of scientific imaging, interpreting visual data often demands an intricate combination of human expertise and deep comprehension of the subject materials. This study presents a novel methodology to linguistically emulate and subsequently evaluate human-like interactions with Scanning Electron Microscopy (SEM) images, specifically of glass materials. Leveraging a multimodal deep learn… ▽ More

    Submitted 25 September, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

    Report number: NTR208745

  50. arXiv:2309.11816  [pdf, other

    cs.HC

    Designing Loving-Kindness Meditation in Virtual Reality for Long-Distance Romantic Relationships

    Authors: Xian Wang, Xiaoyu Mo, Lik-Hang Lee, Xiaoying Wei, Xiaofu Jin, Mingming Fan, Pan Hui

    Abstract: Loving-kindness meditation (LKM) is used in clinical psychology for couples' relationship therapy, but physical isolation can make the relationship more strained and inaccessible to LKM. Virtual reality (VR) can provide immersive LKM activities for long-distance couples. However, no suitable commercial VR applications for couples exist to engage in LKM activities of long-distance. This paper organ… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.