Skip to main content

Showing 1–50 of 211 results for author: Wong, N

  1. arXiv:2407.13623  [pdf, other

    cs.CL cs.AI

    Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

    Authors: Chaofan Tao, Qian Liu, Longxu Dou, Niklas Muennighoff, Zhongwei Wan, Ping Luo, Min Lin, Ngai Wong

    Abstract: Research on scaling large language models (LLMs) has primarily focused on model parameters and training data size, overlooking the role of vocabulary size. % Intuitively, larger vocabularies enable more efficient tokenization by representing sentences with fewer tokens, but they also increase the risk of under-fitting representations for rare tokens. We investigate how vocabulary size impacts LLM… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 11 pages

  2. arXiv:2407.10154  [pdf, ps, other

    math.OA math.FA

    Ternary rings of operators and their linking von Neumann algebras

    Authors: Liguang Wang, Ngai-Ching Wong

    Abstract: In this short note, we show that a von Neumann algebra can be written as the linking von Neumann algebra of a $W^\ast$-ternary ring of operators ($W^\ast$-TRO, in short), if and only if, it contains no abelian direct summand. We also provide some new characterizations for nuclear TROs and $W^\ast$-exact TROs.

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 5 pages

    MSC Class: 46L10; 46L50

  3. arXiv:2407.10150  [pdf, ps, other

    math.OA math.FA

    Operational 2-local automorphisms/derivations

    Authors: Liguang Wang, Ngai-Ching Wong

    Abstract: Let $φ: A\to A$ be a (not necessarily linear, additive or continuous) map of a standard operator algebra. Suppose for any $a,b\in A$ there is an algebra automorphism $θ_{a,b}$ of $ A$ such that \begin{align*} φ(a)φ(b) = θ_{a,b}(ab). \end{align*} We show that either $φ$ or $-φ$ is a linear Jordan homomorphism. Similar results are obtained when any of the following conditions is satisfied: \begi… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 10 pages; to appear in J. Nonlinear and Convex Analysis

    MSC Class: 46L10; 46L50

  4. arXiv:2406.13572  [pdf, other

    quant-ph

    Entanglement source and quantum memory analysis for zero added-loss multiplexing

    Authors: Jeffrey H. Shapiro, Michael G. Raymer, Clark Embleton, Franco N. C. Wong, Brian J. Smith

    Abstract: High-rate, high-fidelity entanglement distribution is essential to the creation of a quantum internet, but recent achievements in fiber and satellite-based entanglement distribution fall far short of what is needed. Chen et al. [Phys. Rev. Appl. 19, 054209 (2023)] proposed a means for dramatically increasing entanglement-distribution rates via zero added-loss multiplexing (ZALM). ZALM's quantum tr… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 26 pages, 15 figure, 1 table

  5. arXiv:2406.11909  [pdf, other

    cs.LG cs.AI

    Mixture-of-Subspaces in Low-Rank Adaptation

    Authors: Taiqiang Wu, Jiahao Wang, Zhe Zhao, Ngai Wong

    Abstract: In this paper, we introduce a subspace-inspired Low-Rank Adaptation (LoRA) method, which is computationally efficient, easy to implement, and readily applicable to large language, multimodal, and diffusion models. Initially, we equivalently decompose the weights of LoRA into two subspaces, and find that simply mixing them can enhance performance. To study such a phenomenon, we revisit it through a… ▽ More

    Submitted 5 July, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: working in progress

  6. arXiv:2406.03063  [pdf, other

    quant-ph

    In-operando microwave scattering-parameter calibrated measurement of a Josephson travelling wave parametric amplifier

    Authors: S. H. Shin, M. Stanley, W. N. Wong, T. Sweetnam, A. Elarabi, T. Lindström, N. M. Ridler, S. E. de Graaf

    Abstract: Superconducting travelling wave parametric amplifiers (TWPAs) are broadband near-quantum limited microwave amplifiers commonly used for qubit readout and a wide range of other applications in quantum technologies. The performance of these amplifiers depends on achieving impedance matching to minimise reflected signals. Here we apply a microwave calibration technique to extract the S-parameters of… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  7. arXiv:2405.12398  [pdf, other

    cs.LG

    ASMR: Activation-sharing Multi-resolution Coordinate Networks For Efficient Inference

    Authors: Jason Chun Lok Li, Steven Tin Sui Luo, Le Xu, Ngai Wong

    Abstract: Coordinate network or implicit neural representation (INR) is a fast-emerging method for encoding natural signals (such as images and videos) with the benefits of a compact neural representation. While numerous methods have been proposed to increase the encoding capabilities of an INR, an often overlooked aspect is the inference efficiency, usually measured in multiply-accumulate (MAC) count. This… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: ICLR 2024 (v3: 21 pages, 11 figures, Project Page: https://github.com/stevolopolis/asmr.git)

  8. arXiv:2405.10531  [pdf, other

    cs.LG cs.CV

    Nonparametric Teaching of Implicit Neural Representations

    Authors: Chen Zhang, Steven Tin Sui Luo, Jason Chun Lok Li, Yik-Chung Wu, Ngai Wong

    Abstract: We investigate the learning of implicit neural representation (INR) using an overparameterized multilayer perceptron (MLP) via a novel nonparametric teaching perspective. The latter offers an efficient example selection framework for teaching nonparametrically defined (viz. non-closed-form) target functions, such as image functions defined by 2D grids of pixels. To address the costly training of I… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: ICML 2024 (24 pages, 13 figures)

  9. arXiv:2405.08804  [pdf, other

    astro-ph.HE gr-qc hep-th

    Photon Ring Interferometric Signatures Beyond The Universal Regime

    Authors: He Jia, Eliot Quataert, Alexandru Lupsasca, George N. Wong

    Abstract: We calculate the interferometric signatures of black hole photon rings beyond the universal regime by perturbatively including the effects of finite ring width. Our approach first slices a thick ring into a series of thin rings, each of which falls within the universal regime. We thus calculate the visibility of the thick ring by aggregating the contributions from each thin ring, and then perturba… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 10+6 pages, 7+3 figures, to be submitted

  10. arXiv:2405.05573  [pdf, other

    cs.CV cs.CR

    Poisoning-based Backdoor Attacks for Arbitrary Target Label with Positive Triggers

    Authors: Binxiao Huang, Jason Chun Lok, Chang Liu, Ngai Wong

    Abstract: Poisoning-based backdoor attacks expose vulnerabilities in the data preparation stage of deep neural network (DNN) training. The DNNs trained on the poisoned dataset will be embedded with a backdoor, making them behave well on clean data while outputting malicious predictions whenever a trigger is applied. To exploit the abundant information contained in the input data to output label mapping, our… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  11. arXiv:2405.02356  [pdf, other

    cs.LG cs.AI

    Stochastic Multivariate Universal-Radix Finite-State Machine: a Theoretically and Practically Elegant Nonlinear Function Approximator

    Authors: Xincheng Feng, Guodong Shen, Jianhao Hu, Meng Li, Ngai Wong

    Abstract: Nonlinearities are crucial for capturing complex input-output relationships especially in deep neural networks. However, nonlinear functions often incur various hardware and compute overheads. Meanwhile, stochastic computing (SC) has emerged as a promising approach to tackle this challenge by trading output precision for hardware simplicity. To this end, this paper proposes a first-of-its-kind sto… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  12. arXiv:2404.10179  [pdf, other

    cs.RO cs.AI cs.HC cs.LG

    Scaling Instructable Agents Across Many Simulated Worlds

    Authors: SIMA Team, Maria Abi Raad, Arun Ahuja, Catarina Barros, Frederic Besse, Andrew Bolt, Adrian Bolton, Bethanie Brownfield, Gavin Buttimore, Max Cant, Sarah Chakera, Stephanie C. Y. Chan, Jeff Clune, Adrian Collister, Vikki Copeman, Alex Cullum, Ishita Dasgupta, Dario de Cesare, Julia Di Trapani, Yani Donchev, Emma Dunleavy, Martin Engelcke, Ryan Faulkner, Frankie Garcia, Charles Gbadamosi , et al. (68 additional authors not shown)

    Abstract: Building embodied AI systems that can follow arbitrary language instructions in any 3D environment is a key challenge for creating general AI. Accomplishing this goal requires learning to ground language in perception and embodied actions, in order to accomplish complex tasks. The Scalable, Instructable, Multiworld Agent (SIMA) project tackles this by training agents to follow free-form instructio… ▽ More

    Submitted 17 April, 2024; v1 submitted 13 March, 2024; originally announced April 2024.

  13. arXiv:2404.02657  [pdf, other

    cs.CL cs.AI

    Rethinking Kullback-Leibler Divergence in Knowledge Distillation for Large Language Models

    Authors: Taiqiang Wu, Chaofan Tao, Jiahao Wang, Zhe Zhao, Ngai Wong

    Abstract: Kullback-Leiber divergence has been widely used in Knowledge Distillation (KD) to compress Large Language Models (LLMs). Contrary to prior assertions that reverse Kullback-Leibler (RKL) divergence is mode-seeking and thus preferable over the mean-seeking forward Kullback-Leibler (FKL) divergence, this study empirically and theoretically demonstrates that neither mode-seeking nor mean-seeking prope… ▽ More

    Submitted 16 June, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: working on progress

  14. arXiv:2403.19238  [pdf, other

    cs.CV cs.AI eess.IV

    Taming Lookup Tables for Efficient Image Retouching

    Authors: Sidi Yang, Binxiao Huang, Mingdeng Cao, Yatai Ji, Hanzhong Guo, Ngai Wong, Yujiu Yang

    Abstract: The widespread use of high-definition screens in edge devices, such as end-user cameras, smartphones, and televisions, is spurring a significant demand for image enhancement. Existing enhancement models often optimize for high performance while falling short of reducing hardware inference time and power consumption, especially on edge devices with constrained computing and storage resources. To th… ▽ More

    Submitted 13 July, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: Accepted by ECCV2024

  15. arXiv:2402.14866  [pdf, other

    cs.LG cs.AI cs.CL

    APTQ: Attention-aware Post-Training Mixed-Precision Quantization for Large Language Models

    Authors: Ziyi Guan, Hantao Huang, Yupeng Su, Hong Huang, Ngai Wong, Hao Yu

    Abstract: Large Language Models (LLMs) have greatly advanced the natural language processing paradigm. However, the high computational load and huge model sizes pose a grand challenge for deployment on edge devices. To this end, we propose APTQ (Attention-aware Post-Training Mixed-Precision Quantization) for LLMs, which considers not only the second-order information of each layer's weights, but also, for t… ▽ More

    Submitted 15 April, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 6 pages, 2 figures, published to DAC 2024: 61st IEEE/ACM Design Automation Conference. (DAC'24)

  16. arXiv:2402.11417  [pdf, other

    cs.CL cs.AI cs.LG

    LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models

    Authors: Yifan Yang, Jiajun Zhou, Ngai Wong, Zheng Zhang

    Abstract: Various parameter-efficient fine-tuning (PEFT) techniques have been proposed to enable computationally efficient fine-tuning while maintaining model performance. However, existing PEFT methods are still limited by the growing number of trainable parameters with the rapid deployment of Large Language Models (LLMs). To address this challenge, we present LoRETTA, an ultra-parameter-efficient framewor… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  17. arXiv:2402.00927  [pdf, other

    astro-ph.HE astro-ph.GA

    Ordered magnetic fields around the 3C 84 central black hole

    Authors: G. F. Paraschos, J. -Y. Kim, M. Wielgus, J. Röder, T. P. Krichbaum, E. Ros, I. Agudo, I. Myserlis, M. Moscibrodzka, E. Traianou, J. A. Zensus, L. Blackburn, C. -K. Chan, S. Issaoun, M. Janssen, M. D. Johnson, V. L. Fish, K. Akiyama, A. Alberdi, W. Alef, J. C. Algaba, R. Anantua, K. Asada, R. Azulay, U. Bach , et al. (258 additional authors not shown)

    Abstract: 3C84 is a nearby radio source with a complex total intensity structure, showing linear polarisation and spectral patterns. A detailed investigation of the central engine region necessitates the use of VLBI above the hitherto available maximum frequency of 86GHz. Using ultrahigh resolution VLBI observations at the highest available frequency of 228GHz, we aim to directly detect compact structures a… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 15 pages, 6 figures, published in A&A

    Journal ref: Issue: A&A Volume 682, February 2024; Article number: L3; Number of pages: 15

  18. arXiv:2312.17018  [pdf, other

    cs.CV cs.LG

    Learning Spatially Collaged Fourier Bases for Implicit Neural Representation

    Authors: Jason Chun Lok Li, Chang Liu, Binxiao Huang, Ngai Wong

    Abstract: Existing approaches to Implicit Neural Representation (INR) can be interpreted as a global scene representation via a linear combination of Fourier bases of different frequencies. However, such universal basis functions can limit the representation capability in local regions where a specific component is unnecessary, resulting in unpleasant artifacts. To this end, we introduce a learnable spatial… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: 11 pages, 13 figures, Accepted at the 38th AAAI Conference on Artificial Intelligence (AAAI-24)

  19. arXiv:2312.16172  [pdf, other

    astro-ph.HE

    Balanced Turbulence and the Helicity Barrier in Black Hole Accretion

    Authors: George N. Wong, Lev Arzamasskiy

    Abstract: Horizon-scale observations from the Event Horizon Telescope (EHT) have enabled precision study of supermassive black hole accretion. Contemporary accretion modeling often treats the inflowing plasma as a single, thermal fluid, but microphysical kinetic effects can lead to significant deviations from this idealized picture. We investigate how the helicity barrier influences EHT-accessible electroma… ▽ More

    Submitted 26 December, 2023; originally announced December 2023.

    Comments: 19 pages, 10 figures, accepted for publication in ApJ

  20. arXiv:2312.09922  [pdf, other

    cs.CV cs.AI

    A Unifying Tensor View for Lightweight CNNs

    Authors: Jason Chun Lok Li, Rui Lin, Jiajun Zhou, Edmund Yin Mun Lam, Ngai Wong

    Abstract: Despite the decomposition of convolutional kernels for lightweight CNNs being well studied, existing works that rely on tensor network diagrams or hyperdimensional abstraction lack geometry intuition. This work devises a new perspective by linking a 3D-reshaped kernel tensor to its various slice-wise and rank-1 decompositions, permitting a straightforward connection between various tensor approxim… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: 4 pages, 3 figures, accepted in 2023 IEEE 15th International Conference on ASIC (ASICON 2023)

  21. arXiv:2312.06101  [pdf, other

    eess.IV cs.CV

    Hundred-Kilobyte Lookup Tables for Efficient Single-Image Super-Resolution

    Authors: Binxiao Huang, Jason Chun Lok Li, Jie Ran, Boyu Li, Jiajun Zhou, Dahai Yu, Ngai Wong

    Abstract: Conventional super-resolution (SR) schemes make heavy use of convolutional neural networks (CNNs), which involve intensive multiply-accumulate (MAC) operations, and require specialized hardware such as graphics processing units. This contradicts the regime of edge AI that often runs on devices strained by power, computing, and storage resources. Such a challenge has motivated a series of lookup ta… ▽ More

    Submitted 8 May, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

  22. arXiv:2311.08125  [pdf, other

    cs.LG

    Lite it fly: An All-Deformable-Butterfly Network

    Authors: Rui Lin, Jason Chun Lok Li, Jiajun Zhou, Binxiao Huang, Jie Ran, Ngai Wong

    Abstract: Most deep neural networks (DNNs) consist fundamentally of convolutional and/or fully connected layers, wherein the linear transform can be cast as the product between a filter matrix and a data matrix obtained by arranging feature tensors into columns. The lately proposed deformable butterfly (DeBut) decomposes the filter matrix into generalized, butterflylike factors, thus achieving network compr… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 7 pages, 3 figures, accepted as a brief paper in IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

  23. arXiv:2311.04388  [pdf, other

    astro-ph.HE gr-qc hep-th physics.flu-dyn physics.plasm-ph

    The $230$ GHz Variability of Numerical Models of Sagittarius~A* I. Parameter Surveys on Varying Ion-electron Temperature Ratios Under Strongly Magnetized Conditions

    Authors: Ho-Sang Chan, Chi-kwan Chan, Ben S. Prather, George N. Wong, Charles Gammie

    Abstract: The $230$ GHz lightcurves of Sagittarius~A* (Sgr~A*) predicted by general relativistic magnetohydrodynamics (GRMHD) and ray-tracing (GRRT) models in Event Horizon Telescope Collaboration et al. (2022) have higher variability $M_{ΔT}$ compared to observations. In this series of papers, we explore the origin of such large brightness variability. In this first paper, we performed large GRRT parameter… ▽ More

    Submitted 4 February, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: 15 Pages, 9 Figures

  24. arXiv:2309.05234  [pdf

    quant-ph

    High-dimensional time-frequency entanglement in a singly-filtered biphoton frequency comb

    Authors: Xiang Cheng, Kai-Chi Chang, Murat Can Sarihan, Andrew Mueller, Maria Spiropulu, Matthew D. Shaw, Boris Korzh, Andrei Faraon, Franco N. C. Wong, Jeffrey H. Shapiro, Chee Wei Wong

    Abstract: High-dimensional quantum entanglement is a cornerstone for advanced technology enabling large-scale noise-tolerant quantum systems, fault-tolerant quantum computing, and distributed quantum networks. The recently developed biphoton frequency comb (BFC) provides a powerful platform for high-dimensional quantum information processing in its spectral and temporal quantum modes. Here we propose and ge… ▽ More

    Submitted 11 September, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: 30 pages, 4 figures

  25. arXiv:2308.15381  [pdf, other

    astro-ph.HE astro-ph.IM

    A search for pulsars around Sgr A* in the first Event Horizon Telescope dataset

    Authors: Pablo Torne, Kuo Liu, Ralph P. Eatough, Jompoj Wongphechauxsorn, James M. Cordes, Gregory Desvignes, Mariafelicia De Laurentis, Michael Kramer, Scott M. Ransom, Shami Chatterjee, Robert Wharton, Ramesh Karuppusamy, Lindy Blackburn, Michael Janssen, Chi-kwan Chan, Geoffrey B. Crew, Lynn D. Matthews, Ciriaco Goddi, Helge Rottmann, Jan Wagner, Salvador Sanchez, Ignacio Ruiz, Federico Abbate, Geoffrey C. Bower, Juan J. Salamanca , et al. (261 additional authors not shown)

    Abstract: The Event Horizon Telescope (EHT) observed in 2017 the supermassive black hole at the center of the Milky Way, Sagittarius A* (Sgr A*), at a frequency of 228.1 GHz ($λ$=1.3 mm). The fundamental physics tests that even a single pulsar orbiting Sgr A* would enable motivate searching for pulsars in EHT datasets. The high observing frequency means that pulsars - which typically exhibit steep emission… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: 33 pages, 7 figures, 6 Tables. Accepted for publication in ApJ

  26. arXiv:2307.06372  [pdf, other

    astro-ph.HE gr-qc

    Black Hole Polarimetry I: A Signature of Electromagnetic Energy Extraction

    Authors: Andrew Chael, Alexandru Lupsasca, George N. Wong, Eliot Quataert

    Abstract: In 1977, Blandford and Znajek showed that the electromagnetic field surrounding a rotating black hole can harvest its spin energy and use it to power a collimated astrophysical jet, such as the one launched from the center of the elliptical galaxy M87. Today, interferometric observations with the Event Horizon Telescope (EHT) are delivering high-resolution, event-horizon-scale, polarimetric images… ▽ More

    Submitted 14 November, 2023; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: 35 pages, 5 figures. Published in ApJ

  27. arXiv:2307.05293  [pdf, other

    astro-ph.HE astro-ph.IM

    Demonstrating Photon Ring Existence with Single-Baseline Polarimetry

    Authors: Daniel C. M. Palumbo, George N. Wong, Andrew A. Chael, Michael D. Johnson

    Abstract: Images of supermassive black hole accretion flows contain features of both curved spacetime and plasma structure. Inferring properties of the spacetime from images requires modeling the plasma properties, and vice versa. The Event Horizon Telescope Collaboration has imaged near-horizon millimeter emission from both Messier 87* (M87*) and Sagittarius A* (Sgr A*) with very-long-baseline interferomet… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: 14 pages, 7 figures, Accepted to ApJL

  28. arXiv:2306.14262  [pdf, other

    cs.CV

    A Spectral Perspective towards Understanding and Improving Adversarial Robustness

    Authors: Binxiao Huang, Rui Lin, Chaofan Tao, Ngai Wong

    Abstract: Deep neural networks (DNNs) are incredibly vulnerable to crafted, imperceptible adversarial perturbations. While adversarial training (AT) has proven to be an effective defense approach, the AT mechanism for robustness improvement is not fully understood. This work investigates AT from a spectral perspective, adding new insights to the design of effective defenses. In particular, we show that AT i… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

  29. arXiv:2306.14099  [pdf

    physics.ins-det physics.optics quant-ph

    High-precision and low-latency widefield diamond quantum sensing with neuromorphic vision sensors

    Authors: Zhiyuan Du, Madhav Gupta, Feng Xu, Kai Zhang, Jiahua Zhang, Yan Zhou, Yiyao Liu, Zhenyu Wang, Jorg Wrachtrup, Ngai Wong, Can Li, Zhiqin Chu

    Abstract: During the past decade, interest has grown significantly in developing ultrasensitive widefield diamond magnetometry for various applications. Despite attempts to improve the adoption of conventional frame-based sensors, achieving high temporal resolution and sensitivity simultaneously remains a key challenge. This is largely due to the transfer and processing of massive amounts of sensor data to… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

    Comments: 21 pages, 4 figures

  30. arXiv:2306.12824  [pdf, ps, other

    math.FA

    Weighted composition operators preserving various Lipschitz constants

    Authors: Ching-Jou Liao, Chih-Neng Liu, Jung-Hui Liu, Ngai-Ching Wong

    Abstract: Let $\mathrm{Lip}(X)$, $\mathrm{Lip}^b(X)$, $\mathrm{Lip}^{\mathrm{loc}}(X)$ and $\mathrm{Lip}^\mathrm{pt}(X)$ be the vector spaces of Lipschitz, bounded Lipschitz, locally Lipschitz and pointwise Lipschitz (real-valued) functions defined on a metric space $(X, d_X)$, respectively. We show that if a weighted composition operator $Tf=h\cdot f\circ \varphi$ defines a bijection between such vec… ▽ More

    Submitted 22 June, 2023; originally announced June 2023.

    Comments: to appear in "Annals of Mathematical Sciences and Applications"

    MSC Class: 46B04; 51F30; 26A16

  31. arXiv:2306.11149  [pdf, other

    eess.SP

    Overcoming Beam Squint in Dual-Wideband mmWave MIMO Channel Estimation: A Bayesian Multi-Band Sparsity Approach

    Authors: Le Xu, Lei Cheng, Ngai Wong, Yik-Chung Wu, H. Vincent Poor

    Abstract: The beam squint effect, which manifests in different steering matrices in different sub-bands, has been widely considered a challenge in millimeter wave (mmWave) multiinput multi-output (MIMO) channel estimation. Existing methods either require specific forms of the precoding/combining matrix, which restrict their general practicality, or simply ignore the beam squint effect by only making use of… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  32. arXiv:2306.11123  [pdf, other

    eess.SP cs.CV

    To Fold or Not to Fold: Graph Regularized Tensor Train for Visual Data Completion

    Authors: Le Xu, Lei Cheng, Ngai Wong, Yik-Chung Wu

    Abstract: Tensor train (TT) representation has achieved tremendous success in visual data completion tasks, especially when it is combined with tensor folding. However, folding an image or video tensor breaks the original data structure, leading to local information loss as nearby pixels may be assigned into different dimensions and become far away from each other. In this paper, to fully preserve the local… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  33. arXiv:2305.15365  [pdf, other

    cs.CV

    Boundary Attention Mapping (BAM): Fine-grained saliency maps for segmentation of Burn Injuries

    Authors: Mahla Abdolahnejad, Justin Lee, Hannah Chan, Alex Morzycki, Olivier Ethier, Anthea Mo, Peter X. Liu, Joshua N. Wong, Colin Hong, Rakesh Joshi

    Abstract: Burn injuries can result from mechanisms such as thermal, chemical, and electrical insults. A prompt and accurate assessment of burns is essential for deciding definitive clinical treatments. Currently, the primary approach for burn assessments, via visual and tactile observations, is approximately 60%-80% accurate. The gold standard is biopsy and a close second would be non-invasive methods like… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

  34. A chip-scale polarization-spatial-momentum quantum SWAP gate in silicon nanophotonics

    Authors: Xiang Cheng, Kai-Chi Chang, Zhenda Xie, Murat Can Sarihan, Yoo Seung Lee, Yongnan Li, XinAn Xu, Abhinav Kumar Vinod, Serdar Kocaman, Mingbin Yu, Patrick Guo-Qiang Lo, Dim-Lee Kwong, Jeffrey H. Shapiro, Franco N. C. Wong, Chee Wei Wong

    Abstract: Recent progress in quantum computing and networking enables high-performance large-scale quantum processors by connecting different quantum modules. Optical quantum systems show advantages in both computing and communications, and integrated quantum photonics further increases the level of scaling and complexity. Here we demonstrate an efficient SWAP gate that deterministically swaps a photon's po… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: 25 pages, 4 figures

    Journal ref: Nat. Photon. 17, 656-665 (2023)

  35. arXiv:2305.09098  [pdf, other

    cs.CL cs.LG

    Weight-Inherited Distillation for Task-Agnostic BERT Compression

    Authors: Taiqiang Wu, Cheng Hou, Shanshan Lao, Jiayi Li, Ngai Wong, Zhe Zhao, Yujiu Yang

    Abstract: Knowledge Distillation (KD) is a predominant approach for BERT compression. Previous KD-based methods focus on designing extra alignment losses for the student model to mimic the behavior of the teacher model. These methods transfer the knowledge in an indirect way. In this paper, we propose a novel Weight-Inherited Distillation (WID), which directly transfers knowledge from the teacher. WID does… ▽ More

    Submitted 20 March, 2024; v1 submitted 15 May, 2023; originally announced May 2023.

    Comments: 9 pages, 4 figures, NAACL2024 findings

  36. arXiv:2304.03804  [pdf, other

    astro-ph.HE gr-qc

    Mahakala: a Python-based Modular Ray-tracing and Radiative Transfer Algorithm for Curved Space-times

    Authors: Aniket Sharma, Lia Medeiros, Chi-kwan Chan, Goni Halevi, Patrick D. Mullen, James M. Stone, George N. Wong

    Abstract: We introduce Mahakala, a Python-based, modular, radiative ray-tracing code for curved space-times. We employ Google's JAX framework for accelerated automatic differentiation, which can efficiently compute Christoffel symbols directly from the metric, allowing the user to easily and quickly simulate photon trajectories through non-Kerr metrics. JAX also enables Mahakala to run in parallel on both C… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: 15 pages, 11 figures

  37. arXiv:2303.15522  [pdf, other

    astro-ph.HE

    $κ$monty: a Monte Carlo Compton Scattering code including non-thermal electrons

    Authors: Jordy Davelaar, Benjamin R. Ryan, George N. Wong, Thomas Bronzwaer, Hector Olivares, Monika Mościbrodzka, Charles F. Gammie, Heino Falcke

    Abstract: Low-luminosity active galactic nuclei are strong sources of X-ray emission produced by Compton scattering originating from the accretion flows surrounding their supermassive black holes. The shape and energy of the resulting spectrum depend on the shape of the underlying electron distribution function (DF). In this work, we present an extended version of the grmonty code, called $κ$monty. The grmo… ▽ More

    Submitted 2 October, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: 12 pages, 9 figures, accepted for publication in MNRAS

  38. arXiv:2303.14893  [pdf, other

    cs.CV

    Context-Aware Transformer for 3D Point Cloud Automatic Annotation

    Authors: Xiaoyan Qian, Chang Liu, Xiaojuan Qi, Siew-Chong Tan, Edmund Lam, Ngai Wong

    Abstract: 3D automatic annotation has received increased attention since manually annotating 3D point clouds is laborious. However, existing methods are usually complicated, e.g., pipelined training for 3D foreground/background segmentation, cylindrical object proposals, and point completion. Furthermore, they often overlook the inter-object feature relation that is particularly informative to hard samples… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

  39. arXiv:2303.13763  [pdf, other

    cs.LG cs.AI

    Edge-free but Structure-aware: Prototype-Guided Knowledge Distillation from GNNs to MLPs

    Authors: Taiqiang Wu, Zhe Zhao, Jiahao Wang, Xingyu Bai, Lei Wang, Ngai Wong, Yujiu Yang

    Abstract: Distilling high-accuracy Graph Neural Networks~(GNNs) to low-latency multilayer perceptrons~(MLPs) on graph tasks has become a hot research topic. However, MLPs rely exclusively on the node features and fail to capture the graph structural information. Previous methods address this issue by processing graph edges into extra inputs for MLPs, but such graph structures may be unavailable for various… ▽ More

    Submitted 27 March, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: 8 pages, 4 figures, 9 tables

  40. Comparison of Polarized Radiative Transfer Codes used by the EHT Collaboration

    Authors: Ben S. Prather, Jason Dexter, Monika Moscibrodzka, Hung-Yi Pu, Thomas Bronzwaer, Jordy Davelaar, Ziri Younsi, Charles F. Gammie, Roman Gold, George N. Wong, Kazunori Akiyama, Antxon Alberdi, Walter Alef, Juan Carlos Algaba, Richard Anantua, Keiichi Asada, Rebecca Azulay, Uwe Bach, Anne-Kathrin Baczko, David Ball, Mislav Baloković, John Barrett, Michi Bauböck, Bradford A. Benson, Dan Bintley , et al. (248 additional authors not shown)

    Abstract: Interpretation of resolved polarized images of black holes by the Event Horizon Telescope (EHT) requires predictions of the polarized emission observable by an Earth-based instrument for a particular model of the black hole accretion system. Such predictions are generated by general relativistic radiative transfer (GRRT) codes, which integrate the equations of polarized radiative transfer in curve… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: Accepted for publication in ApJ

  41. DyBit: Dynamic Bit-Precision Numbers for Efficient Quantized Neural Network Inference

    Authors: Jiajun Zhou, Jiajun Wu, Yizhao Gao, Yuhao Ding, Chaofan Tao, Boyu Li, Fengbin Tu, Kwang-Ting Cheng, Hayden Kwok-Hay So, Ngai Wong

    Abstract: To accelerate the inference of deep neural networks (DNNs), quantization with low-bitwidth numbers is actively researched. A prominent challenge is to quantize the DNN models into low-bitwidth numbers without significant accuracy degradation, especially at very low bitwidths (< 8 bits). This work targets an adaptive data representation with variable-length encoding called DyBit. DyBit can dynamica… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

  42. arXiv:2302.11170  [pdf, ps, other

    math.FA

    Linear maps preserving matrices annihilated by a fixed polynomial

    Authors: Chi-Kwong Li, Ming-Cheng Tsai, Ya-Shu Wang, Ngai-Ching Wong

    Abstract: Let ${\bf M}_n(\mathbb{F})$ be the algebra of $n\times n$ matrices over an arbitrary field $\mathbb{F}$. We consider linear maps $Φ: {\bf M}_n(\mathbb{F}) \rightarrow {\bf M}_r(\mathbb{F})$ preserving matrices annihilated by a fixed polynomial $f(x) = (x-a_1)\cdots (x-a_m)$ with $m\ge 2$ distinct zeroes $a_1, a_2, \ldots, a_m \in \mathbb{F}$; namely,… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  43. arXiv:2212.12732  [pdf, other

    cs.CV

    Frequency Regularization for Improving Adversarial Robustness

    Authors: Binxiao Huang, Chaofan Tao, Rui Lin, Ngai Wong

    Abstract: Deep neural networks are incredibly vulnerable to crafted, human-imperceptible adversarial perturbations. Although adversarial training (AT) has proven to be an effective defense approach, we find that the AT-trained models heavily rely on the input low-frequency content for judgment, accounting for the low standard accuracy. To close the large gap between the standard and robust accuracies during… ▽ More

    Submitted 24 December, 2022; originally announced December 2022.

    Comments: accepted by AAAI 2023 workshop

  44. arXiv:2212.04852  [pdf, other

    astro-ph.HE astro-ph.GA astro-ph.IM

    Using Machine Learning to Link Black Hole Accretion Flows with Spatially Resolved Polarimetric Observables

    Authors: Richard Qiu, Angelo Ricarte, Ramesh Narayan, George N. Wong, Andrew Chael, Daniel Palumbo

    Abstract: We introduce a new library of 535,194 model images of the supermassive black holes and Event Horizon Telescope (EHT) targets Sgr A* and M87*, computed by performing general relativistic radiative transfer calculations on general relativistic magnetohydrodynamics simulations. Then, to infer underlying black hole and accretion flow parameters (spin, inclination, ion-to-electron temperature ratio, an… ▽ More

    Submitted 9 February, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: 24 pages, 27 figures

  45. arXiv:2211.11602  [pdf, other

    cs.LG cs.HC cs.MA

    Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback

    Authors: Josh Abramson, Arun Ahuja, Federico Carnevale, Petko Georgiev, Alex Goldin, Alden Hung, Jessica Landon, Jirka Lhotka, Timothy Lillicrap, Alistair Muldal, George Powell, Adam Santoro, Guy Scully, Sanjana Srivastava, Tamara von Glehn, Greg Wayne, Nathaniel Wong, Chen Yan, Rui Zhu

    Abstract: An important goal in artificial intelligence is to create agents that can both interact naturally with humans and learn from their feedback. Here we demonstrate how to use reinforcement learning from human feedback (RLHF) to improve upon simulated, embodied agents trained to a base level of competency with imitation learning. First, we collected data of humans interacting with agents in a simulate… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  46. arXiv:2211.06541  [pdf, other

    astro-ph.HE astro-ph.GA

    Emission Modeling in the EHT-ngEHT Age

    Authors: Richard Anantua, Joaquín Dúran, Nathan Ngata, Lani Oramas, Razieh Emami, Angelo Ricarte, Brandon Curd, Jan Röder, Avery Broderick, Jeremy Wayland, George N. Wong, Sean Ressler

    Abstract: This work proposes a methodology to test phenomenologically-motivated emission processes that account for the flux and polarization distribution and global structure of the 230 GHz sources imaged by the Event Horizon Telescope (EHT): Messier (M)87* and Sagittarius (Sgr) A*. We introduce to general relativistic magnetohydrodynamic (GRMHD) simulations some novel models to bridge the largely uncertai… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: 21 pages, 9 figures

  47. arXiv:2210.08701  [pdf, other

    cs.LG cs.CV

    ODG-Q: Robust Quantization via Online Domain Generalization

    Authors: Chaofan Tao, Ngai Wong

    Abstract: Quantizing neural networks to low-bitwidth is important for model deployment on resource-limited edge hardware. Although a quantized network has a smaller model size and memory footprint, it is fragile to adversarial attacks. However, few methods study the robustness and training efficiency of quantized networks. To this end, we propose a new method by recasting robust quantization as an online do… ▽ More

    Submitted 16 October, 2022; originally announced October 2022.

  48. arXiv:2210.01218  [pdf, other

    astro-ph.GA astro-ph.CO astro-ph.HE

    Unraveling Twisty Linear Polarization Morphologies in Black Hole Images

    Authors: Razieh Emami, Angelo Ricarte, George N. Wong, Daniel Palumbo, Dominic Chang, Sheperd S. Doeleman, Avery Broaderick, Ramesh Narayan, Maciek Wielgus, Lindy Blackburn, Ben S. Prather, Andrew A. Chael, Richard Anantua, Koushik Chatterjee, Ivan Marti-Vidal, Jose L. Gomez, Kazunori Akiyama, Matthew Liska, Lars Hernquist, Grant Tremblay, Mark Vogelsberger, Charles Alcock, Randall Smith, James Steiner, Paul Tiede , et al. (1 additional authors not shown)

    Abstract: We investigate general relativistic magnetohydrodynamic simulations (GRMHD) to determine the physical origin of the twisty patterns of linear polarization seen in spatially resolved black hole images and explain their morphological dependence on black hole spin. By characterising the observed emission with a simple analytic ring model, we find that the twisty morphology is determined by the magnet… ▽ More

    Submitted 28 March, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: 26 pages, 20 figures, accepted for publication in ApJ

  49. arXiv:2209.14412  [pdf

    cond-mat.mes-hall

    Persistent Enhancement of Exciton Diffusivity in CsPbBr3 Nanocrystal Solids

    Authors: Wenbi Shcherbakov-Wu, Seryio Saris, Thomas Sheehan, Narumi Nagaya Wong, Eric R. Powers, Franziska Krieg, Maksym V. Kovalenko, Adam P. Willard, William A. Tisdale

    Abstract: In semiconductors, exciton or charge carrier diffusivity is typically described as an inherent material property. Here, we show that the transport of excitons (i.e., bound electron-hole pairs) in CsPbBr3 perovskite nanocrystals (NCs) depends markedly on how recently those NCs were occupied by a previous exciton. Using fluence- and repetition-rate-dependent transient photoluminescence microscopy, w… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: 45 pages, 16 figures

  50. arXiv:2208.13571  [pdf, other

    cs.LG cs.AI

    PECAN: A Product-Quantized Content Addressable Memory Network

    Authors: Jie Ran, Rui Lin, Jason Chun Lok Li, Jiajun Zhou, Ngai Wong

    Abstract: A novel deep neural network (DNN) architecture is proposed wherein the filtering and linear transform are realized solely with product quantization (PQ). This results in a natural implementation via content addressable memory (CAM), which transcends regular DNN layer operations and requires only simple table lookup. Two schemes are developed for the end-to-end PQ prototype training, namely, throug… ▽ More

    Submitted 13 August, 2022; originally announced August 2022.