Skip to main content

Showing 1–24 of 24 results for author: Law, K

  1. arXiv:2406.13578  [pdf, other

    cs.CL

    Enhancing Distractor Generation for Multiple-Choice Questions with Retrieval Augmented Pretraining and Knowledge Graph Integration

    Authors: Han-Cheng Yu, Yu-An Shih, Kin-Man Law, Kai-Yu Hsieh, Yu-Chen Cheng, Hsin-Chih Ho, Zih-An Lin, Wen-Chuan Hsu, Yao-Chung Fan

    Abstract: In this paper, we tackle the task of distractor generation (DG) for multiple-choice questions. Our study introduces two key designs. First, we propose \textit{retrieval augmented pretraining}, which involves refining the language model pretraining to align it more closely with the downstream task of DG. Second, we explore the integration of knowledge graphs to enhance the performance of DG. Throug… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Findings at ACL 2024

  2. arXiv:2402.06173  [pdf, other

    stat.ML cs.LG stat.CO

    SMC Is All You Need: Parallel Strong Scaling

    Authors: Xinzhu Liang, Joseph M. Lukens, Sanjaya Lohani, Brian T. Kirby, Thomas A. Searles, Kody J. H. Law

    Abstract: The Bayesian posterior distribution can only be evaluated up-to a constant of proportionality, which makes simulation and consistent estimation challenging. Classical consistent Bayesian methods such as sequential Monte Carlo (SMC) and Markov chain Monte Carlo (MCMC) have unbounded time complexity requirements. We develop a fully parallel sequential Monte Carlo (pSMC) method which provably deliver… ▽ More

    Submitted 2 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: 23 pages, 17 figures

  3. arXiv:2402.02111  [pdf, other

    stat.ML cs.LG math.OC math.PR stat.CO stat.ME

    Accelerating Look-ahead in Bayesian Optimization: Multilevel Monte Carlo is All you Need

    Authors: Shangda Yang, Vitaly Zankin, Maximilian Balandat, Stefan Scherer, Kevin Carlberg, Neil Walton, Kody J. H. Law

    Abstract: We leverage multilevel Monte Carlo (MLMC) to improve the performance of multi-step look-ahead Bayesian optimization (BO) methods that involve nested expectations and maximizations. Often these expectations must be computed by Monte Carlo (MC). The complexity rate of naive MC degrades for nested operations, whereas MLMC is capable of achieving the canonical MC convergence rate for this type of prob… ▽ More

    Submitted 25 June, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

    Comments: Preprint ICML 2024

  4. arXiv:2302.02506  [pdf

    cs.LG cs.AI

    Generating Dispatching Rules for the Interrupting Swap-Allowed Blocking Job Shop Problem Using Graph Neural Network and Reinforcement Learning

    Authors: Vivian W. H. Wong, Sang Hun Kim, Junyoung Park, Jinkyoo Park, Kincho H. Law

    Abstract: The interrupting swap-allowed blocking job shop problem (ISBJSSP) is a complex scheduling problem that is able to model many manufacturing planning and logistics applications realistically by addressing both the lack of storage capacity and unforeseen production interruptions. Subjected to random disruptions due to machine malfunction or maintenance, industry production settings often choose to ad… ▽ More

    Submitted 28 September, 2023; v1 submitted 5 February, 2023; originally announced February 2023.

    Comments: 14 pages, 10 figures. Supplementary Material not included

  5. arXiv:2208.12830  [pdf, other

    stat.ML cs.LG stat.CO

    Mixtures of Gaussian Process Experts with SMC$^2$

    Authors: Teemu Härkönen, Sara Wade, Kody Law, Lassi Roininen

    Abstract: Gaussian processes are a key component of many flexible statistical and machine learning models. However, they exhibit cubic computational complexity and high memory constraints due to the need of inverting and storing a full covariance matrix. To circumvent this, mixtures of Gaussian process experts have been considered where data points are assigned to independent experts, reducing the complexit… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

  6. arXiv:2208.07243  [pdf, other

    stat.ML cs.LG math.OC

    Exponential Concentration in Stochastic Approximation

    Authors: Kody Law, Neil Walton, Shangda Yang

    Abstract: We analyze the behavior of stochastic approximation algorithms where iterates, in expectation, progress towards an objective at each step. When progress is proportional to the step size of the algorithm, we prove exponential concentration bounds. These tail-bounds contrast asymptotic normality results, which are more frequently associated with stochastic approximation. The methods that we develop… ▽ More

    Submitted 24 March, 2024; v1 submitted 15 August, 2022; originally announced August 2022.

    Comments: 35 pages, 11 Figures

  7. Efficient Burst Raw Denoising with Variance Stabilization and Multi-frequency Denoising Network

    Authors: Dasong Li, Yi Zhang, Ka Lung Law, Xiaogang Wang, Hongwei Qin, Hongsheng Li

    Abstract: With the growing popularity of smartphones, capturing high-quality images is of vital importance to smartphones. The cameras of smartphones have small apertures and small sensor cells, which lead to the noisy images in low light environment. Denoising based on a burst of multiple frames generally outperforms single frame denoising but with the larger compututional cost. In this paper, we propose a… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

    Comments: Accepted for publication in International Journal of Computer Vision

    Journal ref: IJCV 2022

  8. arXiv:2203.13718  [pdf, other

    cs.CV cond-mat.mtrl-sci physics.comp-ph

    Digital Fingerprinting of Microstructures

    Authors: Michael D. White, Alexander Tarakanov, Christopher P. Race, Philip J. Withers, Kody J. H. Law

    Abstract: Finding efficient means of fingerprinting microstructural information is a critical step towards harnessing data-centric machine learning approaches. A statistical framework is systematically developed for compressed characterisation of a population of images, which includes some classical computer vision methods as special cases. The focus is on materials microstructure. The ultimate purpose is t… ▽ More

    Submitted 22 January, 2024; v1 submitted 25 March, 2022; originally announced March 2022.

  9. arXiv:2111.14358  [pdf, other

    cs.CV

    IDR: Self-Supervised Image Denoising via Iterative Data Refinement

    Authors: Yi Zhang, Dasong Li, Ka Lung Law, Xiaogang Wang, Hongwei Qin, Hongsheng Li

    Abstract: The lack of large-scale noisy-clean image pairs restricts supervised denoising methods' deployment in actual applications. While existing unsupervised methods are able to learn image denoising without ground-truth clean images, they either show poor performance or work under impractical settings (e.g., paired noisy images). In this paper, we present a practical unsupervised image denoising method… ▽ More

    Submitted 22 March, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: CVPR2022; code & dataset: https://github.com/zhangyi-3/IDR

  10. arXiv:2104.05237  [pdf, other

    cs.CV eess.IV

    Neural Camera Simulators

    Authors: Hao Ouyang, Zifan Shi, Chenyang Lei, Ka Lung Law, Qifeng Chen

    Abstract: We present a controllable camera simulator based on deep neural networks to synthesize raw image data under different camera settings, including exposure time, ISO, and aperture. The proposed simulator includes an exposure module that utilizes the principle of modern lens designs for correcting the luminance level. It also contains a noise module using the noise level function and an aperture modu… ▽ More

    Submitted 9 August, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: Accepted to CVPR2021

  11. arXiv:2101.08993  [pdf

    eess.IV cs.CV

    Automatic Volumetric Segmentation of Additive Manufacturing Defects with 3D U-Net

    Authors: Vivian Wen Hui Wong, Max Ferguson, Kincho H. Law, Yung-Tsun Tina Lee, Paul Witherell

    Abstract: Segmentation of additive manufacturing (AM) defects in X-ray Computed Tomography (XCT) images is challenging, due to the poor contrast, small sizes and variation in appearance of defects. Automatic segmentation can, however, provide quality control for additive manufacturing. Over recent years, three-dimensional convolutional neural networks (3D CNNs) have performed well in the volumetric segmenta… ▽ More

    Submitted 22 January, 2021; originally announced January 2021.

    Comments: Accepted by AAAI 2020 Spring Symposia

    Journal ref: AAAI 2020 Spring Symposia, Stanford, CA, USA, Mar 23-25, 2020

  12. arXiv:2101.05808  [pdf, other

    cond-mat.mtrl-sci cs.LG stat.AP

    Materials Fingerprinting Classification

    Authors: Adam Spannaus, Kody J. H. Law, Piotr Luszczek, Farzana Nasrin, Cassie Putman Micucci, Peter K. Liaw, Louis J. Santodonato, David J. Keffer, Vasileios Maroulas

    Abstract: Significant progress in many classes of materials could be made with the availability of experimentally-derived large datasets composed of atomic identities and three-dimensional coordinates. Methods for visualizing the local atomic structure, such as atom probe tomography (APT), which routinely generate datasets comprised of millions of atoms, are an important step in realizing this goal. However… ▽ More

    Submitted 14 January, 2021; originally announced January 2021.

  13. Fast Deep Mixtures of Gaussian Process Experts

    Authors: Clement Etienam, Kody Law, Sara Wade, Vitaly Zankin

    Abstract: Mixtures of experts have become an indispensable tool for flexible modelling in a supervised learning context, allowing not only the mean function but the entire density of the output to change with the inputs. Sparse Gaussian processes (GP) have shown promise as a leading candidate for the experts in such models, and in this article, we propose to design the gating network for selecting the exper… ▽ More

    Submitted 30 November, 2023; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: 22 pages, 28 figures, to be published in Machine Learning journal

    Journal ref: Machine Learning (2024)

  14. arXiv:1912.12197  [pdf, ps, other

    eess.SP cs.IT cs.LG

    Experimental Demonstration of Learned Time-Domain Digital Back-Propagation

    Authors: Eric Sillekens, Wenting Yi, Daniel Semrau, Alessandro Ottino, Boris Karanov, Sujie Zhou, Kevin Law, Jack Chen, Domanic Lavery, Lidia Galdino, Polina Bayvel, Robert I. Killey

    Abstract: We present the first experimental demonstration of learned time-domain digital back-propagation (DBP), in 64-GBd dual-polarization 64-QAM signal transmission over 1014 km. Performance gains were comparable to those obtained with conventional, higher complexity, frequency-domain DBP.

    Submitted 23 December, 2019; originally announced December 2019.

  15. Building Information Modeling and Classification by Visual Learning At A City Scale

    Authors: Qian Yu, Chaofeng Wang, Barbaros Cetiner, Stella X. Yu, Frank Mckenna, Ertugrul Taciroglu, Kincho H. Law

    Abstract: In this paper, we provide two case studies to demonstrate how artificial intelligence can empower civil engineering. In the first case, a machine learning-assisted framework, BRAILS, is proposed for city-scale building information modeling. Building information modeling (BIM) is an efficient way of describing buildings, which is essential to architecture, engineering, and construction. Our propose… ▽ More

    Submitted 20 July, 2020; v1 submitted 14 October, 2019; originally announced October 2019.

    Comments: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  16. arXiv:1905.06220  [pdf, other

    cs.LG stat.ML

    Cluster, Classify, Regress: A General Method For Learning Discountinous Functions

    Authors: David E. Bernholdt, Mark R. Cianciosa, Clement Etienam, David L. Green, Kody J. H. Law, J. M. Park

    Abstract: This paper presents a method for solving the supervised learning problem in which the output is highly nonlinear and discontinuous. It is proposed to solve this problem in three stages: (i) cluster the pairs of input-output data points, resulting in a label for each point; (ii) classify the data, where the corresponding label is the output; and finally (iii) perform one separate regression for eac… ▽ More

    Submitted 16 May, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

    Comments: 12 files,6 figures

  17. arXiv:1903.03989  [pdf

    stat.ML cs.CV cs.LG

    Uncertainty Propagation in Deep Neural Network Using Active Subspace

    Authors: Weiqi Ji, Zhuyin Ren, Chung K. Law

    Abstract: The inputs of deep neural network (DNN) from real-world data usually come with uncertainties. Yet, it is challenging to propagate the uncertainty in the input features to the DNN predictions at a low computational cost. This work employs a gradient-based subspace method and response surface technique to accelerate the uncertainty propagation in DNN. Specifically, the active subspace method is empl… ▽ More

    Submitted 11 January, 2020; v1 submitted 10 March, 2019; originally announced March 2019.

    Comments: Add link to github repo

  18. arXiv:1808.02518  [pdf, other

    cs.CV

    Detection and Segmentation of Manufacturing Defects with Convolutional Neural Networks and Transfer Learning

    Authors: Max Ferguson, Ronay Ak, Yung-Tsun Tina Lee, Kincho H. Law

    Abstract: Quality control is a fundamental component of many manufacturing processes, especially those involving casting or welding. However, manual quality control procedures are often time-consuming and error-prone. In order to meet the growing demand for high-quality products, the use of intelligent visual inspection systems is becoming essential in production lines. Recently, Convolutional Neural Networ… ▽ More

    Submitted 2 September, 2018; v1 submitted 7 August, 2018; originally announced August 2018.

  19. arXiv:1805.08551  [pdf

    eess.SY cs.RO

    Robust Model Predictive Control for Autonomous Vehicles/Self Driving Cars

    Authors: Che Kun Law, Darshit Dalal, Stephen Shearrow

    Abstract: A robust Model Predictive Control (MPC) approach for controlling front steering of an autonomous vehicle is presented in this paper. We present various approaches to increase the robustness of model predictive control by using weight tuning, a successive on-line linearization of a nonlinear vehicle model to track position error and successive on-line linearization to track velocity error. Results… ▽ More

    Submitted 22 May, 2018; originally announced May 2018.

    Comments: 12 pages,9 figures

  20. arXiv:1802.04520   

    cs.AI

    Learning Robust and Adaptive Real-World Continuous Control Using Simulation and Transfer Learning

    Authors: M Ferguson, K. H. Law

    Abstract: We use model-free reinforcement learning, extensive simulation, and transfer learning to develop a continuous control algorithm that has good zero-shot performance in a real physical environment. We train a simulated agent to act optimally across a set of similar environments, each with dynamics drawn from a prior distribution. We propose that the agent is able to adjust its actions almost immedia… ▽ More

    Submitted 8 March, 2018; v1 submitted 13 February, 2018; originally announced February 2018.

    Comments: The paper has several technical errors. Rather than correct these errors we have chosen to significantly reformulate the work

  21. arXiv:1701.01657  [pdf, other

    cs.RO astro-ph.IM

    Autonomous Multirobot Excavation for Lunar Applications

    Authors: Jekanthan Thangavelautham, Kenneth Law, Terence Fu, Nader Abu El Samid, Alexander D. S. Smith, Gabriele M. T. D'Eleuterio

    Abstract: In this paper, a control approach called Artificial Neural Tissue (ANT) is applied to multirobot excavation for lunar base preparation tasks including clearing landing pads and burying of habitat modules. We show for the first time, a team of autonomous robots excavating a terrain to match a given 3D blueprint. Constructing mounds around landing pads will provide physical shielding from debris dur… ▽ More

    Submitted 6 January, 2017; originally announced January 2017.

    Comments: 38 pages, 32 figures, archive of journal article, in Robotica, 2017

  22. Transmit Beamforming for Interference Exploitation in the Underlay Cognitive Radio Z-channel

    Authors: Ka Lung Law, Christos Masouros, Marius Pesavento

    Abstract: This paper introduces novel transmit beamforming approaches for the cognitive radio (CR) Z-channel. The proposed transmission schemes exploit non-causal information about the interference at the SBS to re-design the CR beamforming optimization problem. This is done with the objective to improve the quality of service (QoS) of secondary users by taking advantage of constructive interference in the… ▽ More

    Submitted 21 June, 2016; originally announced June 2016.

  23. Rank-Two Beamforming and Power Allocation in Multicasting Relay Networks

    Authors: Adrian Schad, Ka L. Law, Marius Pesavento

    Abstract: In this paper, we propose a novel single-group multicasting relay beamforming scheme. We assume a source that transmits common messages via multiple amplify-and-forward relays to multiple destinations. To increase the number of degrees of freedom in the beamforming design, the relays process two received signals jointly and transmit the Alamouti space-time block code over two different beams. Furt… ▽ More

    Submitted 17 February, 2015; originally announced February 2015.

  24. General Rank Multiuser Downlink Beamforming With Shaping Constraints Using Real-valued OSTBC

    Authors: Ka Lung Law, Xin Wen, Minh Thanh Vu, Marius Pesavento

    Abstract: In this paper we consider optimal multiuser downlink beamforming in the presence of a massive number of arbitrary quadratic shaping constraints. We combine beamforming with full-rate high dimensional real-valued orthogonal space time block coding (OSTBC) to increase the number of beamforming weight vectors and associated degrees of freedom in the beamformer design. The original multi-constraint be… ▽ More

    Submitted 17 February, 2015; v1 submitted 16 February, 2015; originally announced February 2015.