Skip to main content

Showing 1–5 of 5 results for author: Hu, M Y

  1. arXiv:2405.13349  [pdf, other

    cs.DC

    Building a Verifiable Logical Clock for P2P Networks

    Authors: Guangda Sun, Tianyang Tao, Yanpei Guo, Michael Yiqing Hu, Jialin Li

    Abstract: Logical clocks are a fundamental tool to establish causal ordering of events in a distributed system. They have been applied in weakly consistent storage systems, causally ordered broadcast, distributed snapshots, deadlock detection, and distributed system debugging. However, prior logical clock constructs fail to work in an open network with Byzantine participants. In this work, we present Chrono… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  2. arXiv:2404.06214  [pdf, other

    cs.CL

    [Call for Papers] The 2nd BabyLM Challenge: Sample-efficient pretraining on a developmentally plausible corpus

    Authors: Leshem Choshen, Ryan Cotterell, Michael Y. Hu, Tal Linzen, Aaron Mueller, Candace Ross, Alex Warstadt, Ethan Wilcox, Adina Williams, Chengxu Zhuang

    Abstract: After last year's successful BabyLM Challenge, the competition will be hosted again in 2024/2025. The overarching goals of the challenge remain the same; however, some of the competition rules will be different. The big changes for this year's competition are as follows: First, we replace the loose track with a paper track, which allows (for example) non-model-based submissions, novel cognitively-… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  3. arXiv:2402.03618  [pdf, other

    cs.AI cs.CL q-bio.NC

    Comparing Abstraction in Humans and Large Language Models Using Multimodal Serial Reproduction

    Authors: Sreejan Kumar, Raja Marjieh, Byron Zhang, Declan Campbell, Michael Y. Hu, Umang Bhatt, Brenden Lake, Thomas L. Griffiths

    Abstract: Humans extract useful abstractions of the world from noisy sensory data. Serial reproduction allows us to study how people construe the world through a paradigm similar to the game of telephone, where one person observes a stimulus and reproduces it for the next to form a chain of reproductions. Past serial reproduction experiments typically employ a single sensory modality, but humans often commu… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  4. arXiv:2308.09543  [pdf, other

    cs.LG

    Latent State Models of Training Dynamics

    Authors: Michael Y. Hu, Angelica Chen, Naomi Saphra, Kyunghyun Cho

    Abstract: The impact of randomness on model training is poorly understood. How do differences in data order and initialization actually manifest in the model, such that some training runs outperform others or converge faster? Furthermore, how can we interpret the resulting training dynamics and the phase transitions that characterize different trajectories? To understand the effect of randomness on the dyna… ▽ More

    Submitted 19 January, 2024; v1 submitted 18 August, 2023; originally announced August 2023.

    Comments: Accepted at TMLR 2023. Updated Jan 19, 2024 with erratum

  5. arXiv:2205.11558  [pdf, other

    cs.AI

    Using Natural Language and Program Abstractions to Instill Human Inductive Biases in Machines

    Authors: Sreejan Kumar, Carlos G. Correa, Ishita Dasgupta, Raja Marjieh, Michael Y. Hu, Robert D. Hawkins, Nathaniel D. Daw, Jonathan D. Cohen, Karthik Narasimhan, Thomas L. Griffiths

    Abstract: Strong inductive biases give humans the ability to quickly learn to perform a variety of tasks. Although meta-learning is a method to endow neural networks with useful inductive biases, agents trained by meta-learning may sometimes acquire very different strategies from humans. We show that co-training these agents on predicting representations from natural language task descriptions and programs… ▽ More

    Submitted 5 February, 2023; v1 submitted 23 May, 2022; originally announced May 2022.

    Comments: In Proceedings of the 36th Conference on Neural Information Processing Systems (NeurIPS 2022), winner of Outstanding Paper Award