Skip to main content

Showing 1–40 of 40 results for author: Mitchell, A

  1. arXiv:2406.05914  [pdf, other

    eess.AS cs.SD eess.SP

    Soundscape Captioning using Sound Affective Quality Network and Large Language Model

    Authors: Yuanbo Hou, Qiaoqiao Ren, Andrew Mitchell, Wenwu Wang, Jian Kang, Tony Belpaeme, Dick Botteldooren

    Abstract: We live in a rich and varied acoustic world, which is experienced by individuals or communities as a soundscape. Computational auditory scene analysis, disentangling acoustic scenes by detecting and classifying events, focuses on objective attributes of sounds, such as their category and temporal characteristics, ignoring the effect of sounds on people and failing to explore the relationship betwe… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Code: https://github.com/Yuanbo2020/SoundSCaper

  2. arXiv:2405.19452  [pdf, other

    cs.RO cs.LG

    Gaitor: Learning a Unified Representation Across Gaits for Real-World Quadruped Locomotion

    Authors: Alexander L. Mitchell, Wolfgang Merkt, Aristotelis Papatheodorou, Ioannis Havoutis, Ingmar Posner

    Abstract: The current state-of-the-art in quadruped locomotion is able to produce robust motion for terrain traversal but requires the segmentation of a desired robot trajectory into a discrete set of locomotion skills such as trot and crawl. In contrast, in this work we demonstrate the feasibility of learning a single, unified representation for quadruped locomotion enabling continuous blending between gai… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 10 pages, 8 figures, 2 tables

  3. arXiv:2402.12366  [pdf, other

    cs.LG cs.AI cs.CL

    A Critical Evaluation of AI Feedback for Aligning Large Language Models

    Authors: Archit Sharma, Sedrick Keh, Eric Mitchell, Chelsea Finn, Kushal Arora, Thomas Kollar

    Abstract: Reinforcement learning with AI feedback (RLAIF) is a popular paradigm for improving the instruction-following abilities of powerful pre-trained language models. RLAIF first performs supervised fine-tuning (SFT) using demonstrations from a teacher model and then further fine-tunes the model with reinforcement learning (RL), using feedback from a critic model. While recent popular open-source models… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  4. arXiv:2311.09030  [pdf

    eess.AS cs.SD

    AI-based soundscape analysis: Jointly identifying sound sources and predicting annoyance

    Authors: Yuanbo Hou, Qiaoqiao Ren, Huizhong Zhang, Andrew Mitchell, Francesco Aletta, Jian Kang, Dick Botteldooren

    Abstract: Soundscape studies typically attempt to capture the perception and understanding of sonic environments by surveying users. However, for long-term monitoring or assessing interventions, sound-signal-based approaches are required. To this end, most previous research focused on psycho-acoustic quantities or automatic sound recognition. Few attempts were made to include appraisal (e.g., in circumplex… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: The Journal of the Acoustical Society of America, 154 (5), 3145

    Journal ref: The Journal of the Acoustical Society of America, 154, 3145 (2023)

  5. arXiv:2311.08401  [pdf, other

    cs.CL cs.AI cs.LG

    Fine-tuning Language Models for Factuality

    Authors: Katherine Tian, Eric Mitchell, Huaxiu Yao, Christopher D. Manning, Chelsea Finn

    Abstract: The fluency and creativity of large pre-trained language models (LLMs) have led to their widespread use, sometimes even as a replacement for traditional search engines. Yet language models are prone to making convincing but factually inaccurate claims, often referred to as 'hallucinations.' These errors can inadvertently spread misinformation or harmfully perpetuate misconceptions. Further, manual… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  6. arXiv:2311.02791  [pdf, other

    cs.CV

    MirrorCalib: Utilizing Human Pose Information for Mirror-based Virtual Camera Calibration

    Authors: Longyun Liao, Rong Zheng, Andrew Mitchell

    Abstract: In this paper, we present the novel task of estimating the extrinsic parameters of a virtual camera relative to a real camera in exercise videos with a mirror. This task poses a significant challenge in scenarios where the views from the real and mirrored cameras have no overlap or share salient features. To address this issue, prior knowledge of a human body and 2D joint locations are utilized to… ▽ More

    Submitted 17 May, 2024; v1 submitted 5 November, 2023; originally announced November 2023.

    Comments: Accepted by AVSS2024

  7. arXiv:2310.12962  [pdf, other

    cs.CL cs.AI cs.LG

    An Emulator for Fine-Tuning Large Language Models using Small Language Models

    Authors: Eric Mitchell, Rafael Rafailov, Archit Sharma, Chelsea Finn, Christopher D. Manning

    Abstract: Widely used language models (LMs) are typically built by scaling up a two-stage training pipeline: a pre-training stage that uses a very large, diverse dataset of text and a fine-tuning (sometimes, 'alignment') stage that uses targeted examples or other specifications of desired behaviors. While it has been hypothesized that knowledge and skills come from pre-training, and fine-tuning mostly filte… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  8. arXiv:2310.06074  [pdf, other

    cs.RO

    Momentum-Aware Trajectory Optimisation using Full-Centroidal Dynamics and Implicit Inverse Kinematics

    Authors: Aristotelis Papatheodorou, Wolfgang Merkt, Alexander L. Mitchell, Ioannis Havoutis

    Abstract: The current state-of-the-art gradient-based optimisation frameworks are able to produce impressive dynamic manoeuvres such as linear and rotational jumps. However, these methods, which optimise over the full rigid-body dynamics of the robot, often require precise foothold locations apriori, while real-time performance is not guaranteed without elaborate regularisation and tuning of the cost functi… ▽ More

    Submitted 15 March, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

  9. arXiv:2310.03121  [pdf

    physics.chem-ph cs.LG

    OpenMM 8: Molecular Dynamics Simulation with Machine Learning Potentials

    Authors: Peter Eastman, Raimondas Galvelis, Raúl P. Peláez, Charlles R. A. Abreu, Stephen E. Farr, Emilio Gallicchio, Anton Gorenko, Michael M. Henry, Frank Hu, Jing Huang, Andreas Krämer, Julien Michel, Joshua A. Mitchell, Vijay S. Pande, João PGLM Rodrigues, Jaime Rodriguez-Guerra, Andrew C. Simmonett, Sukrit Singh, Jason Swails, Philip Turner, Yuanqing Wang, Ivy Zhang, John D. Chodera, Gianni De Fabritiis, Thomas E. Markland

    Abstract: Machine learning plays an important and growing role in molecular simulation. The newest version of the OpenMM molecular dynamics toolkit introduces new features to support the use of machine learning potentials. Arbitrary PyTorch models can be added to a simulation and used to compute forces and energy. A higher-level interface allows users to easily model their molecules of interest with general… ▽ More

    Submitted 29 November, 2023; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: 16 pages, 5 figures

    ACM Class: J.2; J.3

  10. arXiv:2308.11980  [pdf, other

    eess.AS cs.SD

    Joint Prediction of Audio Event and Annoyance Rating in an Urban Soundscape by Hierarchical Graph Representation Learning

    Authors: Yuanbo Hou, Siyang Song, Cheng Luo, Andrew Mitchell, Qiaoqiao Ren, Weicheng Xie, Jian Kang, Wenwu Wang, Dick Botteldooren

    Abstract: Sound events in daily life carry rich information about the objective world. The composition of these sounds affects the mood of people in a soundscape. Most previous approaches only focus on classifying and detecting audio events and scenes, but may ignore their perceptual quality that may impact humans' listening mood for the environment, e.g. annoyance. To this end, this paper proposes a novel… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: INTERSPEECH 2023, Code and models: https://github.com/Yuanbo2020/HGRL

  11. arXiv:2305.18290  [pdf, other

    cs.LG cs.AI cs.CL

    Direct Preference Optimization: Your Language Model is Secretly a Reward Model

    Authors: Rafael Rafailov, Archit Sharma, Eric Mitchell, Stefano Ermon, Christopher D. Manning, Chelsea Finn

    Abstract: While large-scale unsupervised language models (LMs) learn broad world knowledge and some reasoning skills, achieving precise control of their behavior is difficult due to the completely unsupervised nature of their training. Existing methods for gaining such steerability collect human labels of the relative quality of model generations and fine-tune the unsupervised LM to align with these prefere… ▽ More

    Submitted 13 December, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

  12. arXiv:2305.15076  [pdf, other

    cs.CL

    Meta-Learning Online Adaptation of Language Models

    Authors: Nathan Hu, Eric Mitchell, Christopher D. Manning, Chelsea Finn

    Abstract: Large language models encode impressively broad world knowledge in their parameters. However, the knowledge in static language models falls out of date, limiting the model's effective "shelf life." While online fine-tuning can reduce this degradation, we find that naively fine-tuning on a stream of documents leads to a low level of information uptake. We hypothesize that online fine-tuning does no… ▽ More

    Submitted 20 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 Camera Ready

  13. arXiv:2305.14975  [pdf, other

    cs.CL

    Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback

    Authors: Katherine Tian, Eric Mitchell, Allan Zhou, Archit Sharma, Rafael Rafailov, Huaxiu Yao, Chelsea Finn, Christopher D. Manning

    Abstract: A trustworthy real-world prediction system should produce well-calibrated confidence scores; that is, its confidence in an answer should be indicative of the likelihood that the answer is correct, enabling deferral to an expert in cases of low-confidence predictions. Recent studies have shown that unsupervised pre-training produces large language models (LMs) whose conditional probabilities are re… ▽ More

    Submitted 24 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023 Camera Ready

  14. A Study of Generative Large Language Model for Medical Research and Healthcare

    Authors: Cheng Peng, Xi Yang, Aokun Chen, Kaleb E Smith, Nima PourNejatian, Anthony B Costa, Cheryl Martin, Mona G Flores, Ying Zhang, Tanja Magoc, Gloria Lipori, Duane A Mitchell, Naykky S Ospina, Mustafa M Ahmed, William R Hogan, Elizabeth A Shenkman, Yi Guo, Jiang Bian, Yonghui Wu

    Abstract: There is enormous enthusiasm and concerns in using large language models (LLMs) in healthcare, yet current assumptions are all based on general-purpose LLMs such as ChatGPT. This study develops a clinical generative LLM, GatorTronGPT, using 277 billion words of mixed clinical and English text with a GPT-3 architecture of 20 billion parameters. GatorTronGPT improves biomedical natural language proc… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  15. arXiv:2303.17853  [pdf, other

    physics.pop-ph astro-ph.HE cs.CL

    Can AI Put Gamma-Ray Astrophysicists Out of a Job?

    Authors: Samuel T. Spencer, Vikas Joshi, Alison M. W. Mitchell

    Abstract: In what will likely be a litany of generative-model-themed arXiv submissions celebrating April the 1st, we evaluate the capacity of state-of-the-art transformer models to create a paper detailing the detection of a Pulsar Wind Nebula with a non-existent Imaging Atmospheric Cherenkov Telescope (IACT) Array. We do this to evaluate the ability of such models to interpret astronomical observations and… ▽ More

    Submitted 4 April, 2023; v1 submitted 31 March, 2023; originally announced March 2023.

  16. arXiv:2301.11305  [pdf, other

    cs.CL cs.AI

    DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature

    Authors: Eric Mitchell, Yoonho Lee, Alexander Khazatsky, Christopher D. Manning, Chelsea Finn

    Abstract: The increasing fluency and widespread usage of large language models (LLMs) highlight the desirability of corresponding tools aiding detection of LLM-generated text. In this paper, we identify a property of the structure of an LLM's probability function that is useful for such detection. Specifically, we demonstrate that text sampled from an LLM tends to occupy negative curvature regions of the mo… ▽ More

    Submitted 23 July, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: ICML 2023

  17. arXiv:2211.11875  [pdf, other

    cs.CL cs.AI

    Enhancing Self-Consistency and Performance of Pre-Trained Language Models through Natural Language Inference

    Authors: Eric Mitchell, Joseph J. Noh, Siyan Li, William S. Armstrong, Ananth Agarwal, Patrick Liu, Chelsea Finn, Christopher D. Manning

    Abstract: While large pre-trained language models are powerful, their predictions often lack logical consistency across test inputs. For example, a state-of-the-art Macaw question-answering (QA) model answers 'Yes' to 'Is a sparrow a bird?' and 'Does a bird have feet?' but answers 'No' to 'Does a sparrow have feet?'. To address this failure mode, we propose a framework, Consistency Correction through Relati… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: 16 pages. EMNLP 2022 Camera Ready. See https://ericmitchell.ai/emnlp-2022-concord/ for code and data

  18. arXiv:2209.03252  [pdf

    physics.optics cs.ET

    Neuromorphic computing using wavelength-division multiplexing

    Authors: Xingyuan Xu, Weiwei Han, Mengxi Tan, Yang Sun, Yang Li, Jiayang Wu, Roberto Morandotti, Arnan Mitchell, Kun Xu, David J. Moss

    Abstract: Optical neural networks (ONNs), or optical neuromorphic hardware accelerators, have the potential to dramatically enhance the computing power and energy efficiency of mainstream electronic processors, due to their ultralarge bandwidths of up to 10s of terahertz together with their analog architecture that avoids the need for reading and writing data back and forth. Different multiplexing technique… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    Comments: 13 pages, 8 figures, 160 references

    Journal ref: IEEE Journal of Selected Topics in Quantum Electronics Volume 28, Early Access (2022)

  19. arXiv:2207.06245   

    eess.SP cs.NE

    Hitless memory-reconfigurable photonic reservoir computing architecture

    Authors: Mohab Abdalla, Clément Zrounba, Raphael Cardoso, Paul Jimenez, Guanghui Ren, Andreas Boes, Arnan Mitchell, Alberto Bosio, Ian O'Connor, Fabio Pavanello

    Abstract: Reservoir computing is an analog bio-inspired computation model for efficiently processing time-dependent signals, the photonic implementations of which promise a combination of massive parallel information processing, low power consumption, and high speed operation. However, most implementations, especially for the case of time-delay reservoir computing (TDRC), require signal attenuation in the r… ▽ More

    Submitted 17 May, 2023; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: The paper has been withdrawn by the authors due to their belief that the arguments and results presented in the paper are not mature enough, and includes a slight error

  20. arXiv:2206.06520  [pdf, other

    cs.AI cs.CL

    Memory-Based Model Editing at Scale

    Authors: Eric Mitchell, Charles Lin, Antoine Bosselut, Christopher D. Manning, Chelsea Finn

    Abstract: Even the largest neural networks make errors, and once-correct predictions can become invalid as the world changes. Model editors make local updates to the behavior of base (pre-trained) models to inject updated knowledge or correct undesirable behaviors. Existing model editors have shown promise, but also suffer from insufficient expressiveness: they struggle to accurately model an edit's intende… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    Comments: ICML 2022. Project site at https://sites.google.com/view/serac-editing

  21. arXiv:2205.01179  [pdf, other

    cs.RO cs.LG

    VAE-Loco: Versatile Quadruped Locomotion by Learning a Disentangled Gait Representation

    Authors: Alexander L. Mitchell, Wolfgang Merkt, Mathieu Geisert, Siddhant Gangapurwala, Martin Engelcke, Oiwi Parker Jones, Ioannis Havoutis, Ingmar Posner

    Abstract: Quadruped locomotion is rapidly maturing to a degree where robots are able to realise highly dynamic manoeuvres. However, current planners are unable to vary key gait parameters of the in-swing feet midair. In this work we address this limitation and show that it is pivotal in increasing controller robustness by learning a latent space capturing the key stance phases constituting a particular gait… ▽ More

    Submitted 12 July, 2023; v1 submitted 2 May, 2022; originally announced May 2022.

    Comments: 16 pages, 13 figures, 1 table, accepted by IEEE Transactions on Robotics (T-RO) as an extended paper. arXiv admin note: substantial text overlap with arXiv:2112.04809

  22. arXiv:2203.03540  [pdf

    cs.CL cs.AI cs.LG

    GatorTron: A Large Clinical Language Model to Unlock Patient Information from Unstructured Electronic Health Records

    Authors: Xi Yang, Aokun Chen, Nima PourNejatian, Hoo Chang Shin, Kaleb E Smith, Christopher Parisien, Colin Compas, Cheryl Martin, Mona G Flores, Ying Zhang, Tanja Magoc, Christopher A Harle, Gloria Lipori, Duane A Mitchell, William R Hogan, Elizabeth A Shenkman, Jiang Bian, Yonghui Wu

    Abstract: There is an increasing interest in developing artificial intelligence (AI) systems to process and interpret electronic health records (EHRs). Natural language processing (NLP) powered by pretrained language models is the key technology for medical AI systems utilizing clinical narratives. However, there are few clinical language models, the largest of which trained in the clinical domain is compar… ▽ More

    Submitted 16 December, 2022; v1 submitted 2 February, 2022; originally announced March 2022.

    Comments: 24 pages, 2 figures, 3 tables

  23. arXiv:2112.04809  [pdf, other

    cs.RO cs.LG

    Next Steps: Learning a Disentangled Gait Representation for Versatile Quadruped Locomotion

    Authors: Alexander L. Mitchell, Wolfgang Merkt, Mathieu Geisert, Siddhant Gangapurwala, Martin Engelcke, Oiwi Parker Jones, Ioannis Havoutis, Ingmar Posner

    Abstract: Quadruped locomotion is rapidly maturing to a degree where robots now routinely traverse a variety of unstructured terrains. However, while gaits can be varied typically by selecting from a range of pre-computed styles, current planners are unable to vary key gait parameters continuously while the robot is in motion. The synthesis, on-the-fly, of gaits with unexpected operational characteristics o… ▽ More

    Submitted 29 March, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: 7 pages, 4 figures, accepted at IEEE International Conference on Robotics and Automation (ICRA), 2022

  24. arXiv:2110.11309  [pdf, other

    cs.LG cs.AI cs.CL

    Fast Model Editing at Scale

    Authors: Eric Mitchell, Charles Lin, Antoine Bosselut, Chelsea Finn, Christopher D. Manning

    Abstract: While large pre-trained models have enabled impressive results on a variety of downstream tasks, the largest existing models still make errors, and even accurate predictions may become outdated over time. Because detecting all such failures at training time is impossible, enabling both developers and end users of such models to correct inaccurate outputs while leaving the model otherwise intact is… ▽ More

    Submitted 13 June, 2022; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: ICLR 2022. View implementation and additional project info at https://sites.google.com/view/mend-editing

  25. arXiv:2011.07393  [pdf

    cs.NE physics.app-ph physics.optics

    11 TeraFLOPs per second photonic convolutional accelerator for deep learning optical neural networks

    Authors: Xingyuan Xu, Mengxi Tan, Bill Corcoran, Jiayang Wu, Andreas Boes, Thach G. Nguyen, Sai T. Chu, Brent E. Little, Damien G. Hicks, Roberto Morandotti, Arnan Mitchell, David J. Moss

    Abstract: Convolutional neural networks (CNNs), inspired by biological visual cortex systems, are a powerful category of artificial neural networks that can extract the hierarchical features of raw data to greatly reduce the network parametric complexity and enhance the predicting accuracy. They are of significant interest for machine learning tasks such as computer vision, speech recognition, playing board… ▽ More

    Submitted 14 November, 2020; originally announced November 2020.

    Comments: 21 pages, 9 figures, 39 references

    Journal ref: Nature, Volume 589 Issue 7840. pages 44-51 (2021)

  26. arXiv:2008.06043  [pdf, other

    cs.LG cs.AI stat.ML

    Offline Meta-Reinforcement Learning with Advantage Weighting

    Authors: Eric Mitchell, Rafael Rafailov, Xue Bin Peng, Sergey Levine, Chelsea Finn

    Abstract: This paper introduces the offline meta-reinforcement learning (offline meta-RL) problem setting and proposes an algorithm that performs well in this setting. Offline meta-RL is analogous to the widely successful supervised learning strategy of pre-training a model on a large batch of fixed, pre-collected data (possibly from various tasks) and fine-tuning the model to a new task with relatively lit… ▽ More

    Submitted 21 July, 2021; v1 submitted 13 August, 2020; originally announced August 2020.

    Comments: ICML 2021; for code & project info, see http://sites.google.com/view/macaw-metarl

  27. arXiv:2007.01520  [pdf, other

    cs.RO cs.LG

    First Steps: Latent-Space Control with Semantic Constraints for Quadruped Locomotion

    Authors: Alexander L. Mitchell, Martin Engelcke, Oiwi Parker Jones, David Surovik, Siddhant Gangapurwala, Oliwier Melon, Ioannis Havoutis, Ingmar Posner

    Abstract: Traditional approaches to quadruped control frequently employ simplified, hand-derived models. This significantly reduces the capability of the robot since its effective kinematic range is curtailed. In addition, kinodynamic constraints are often non-differentiable and difficult to implement in an optimisation approach. In this work, these challenges are addressed by framing quadruped control as o… ▽ More

    Submitted 20 November, 2020; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: 8 pages, 7 figures, accepted at IROS 2020

  28. arXiv:2006.07981  [pdf, other

    cs.CV

    Geodesic-HOF: 3D Reconstruction Without Cutting Corners

    Authors: Ziyun Wang, Eric A. Mitchell, Volkan Isler, Daniel D. Lee

    Abstract: Single-view 3D object reconstruction is a challenging fundamental problem in computer vision, largely due to the morphological diversity of objects in the natural world. In particular, high curvature regions are not always captured effectively by methods trained using only set-based loss functions, resulting in reconstructions short-circuiting the surface or cutting corners. In particular, high cu… ▽ More

    Submitted 14 June, 2020; originally announced June 2020.

  29. arXiv:2004.11948  [pdf, other

    cs.CE

    An active learning high-throughput microstructure calibration framework for solving inverse structure-process problems in materials informatics

    Authors: Anh Tran, John A. Mitchell, Laura P. Swiler, Tim Wildey

    Abstract: Determining a process-structure-property relationship is the holy grail of materials science, where both computational prediction in the forward direction and materials design in the inverse direction are essential. Problems in materials design are often considered in the context of process-property linkage by bypassing the materials structure, or in the context of structure-property linkage as in… ▽ More

    Submitted 24 April, 2020; originally announced April 2020.

    Comments: Acta Materialia

  30. arXiv:2003.01347  [pdf

    physics.optics cs.ET

    Single photonic perceptron based on a soliton crystal Kerr microcomb for high-speed, scalable, optical neural networks

    Authors: Xingyuan Xu, Mengxi Tan, Bill Corcoran, Jiayang Wu, Thach G. Nguyen, Andreas Boes, Sai T. Chu, Brent E. Little, Roberto Morandotti, Arnan Mitchell, Damien G. Hicks, David J. Moss

    Abstract: Optical artificial neural networks (ONNs), analog computing hardware tailored for machine learning, have significant potential for ultra-high computing speed and energy efficiency. We propose a new approach to architectures for ONNs based on integrated Kerr micro-comb sources that is programmable, highly scalable and capable of reaching ultra-high speeds. We experimentally demonstrate the building… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

    Comments: 18 pages, 7 Figures, 62 References

  31. arXiv:2002.09676  [pdf, other

    cs.RO cs.LG eess.SY

    Guided Constrained Policy Optimization for Dynamic Quadrupedal Robot Locomotion

    Authors: Siddhant Gangapurwala, Alexander Mitchell, Ioannis Havoutis

    Abstract: Deep reinforcement learning (RL) uses model-free techniques to optimize task-specific control policies. Despite having emerged as a promising approach for complex problems, RL is still hard to use reliably for real-world applications. Apart from challenges such as precise reward function tuning, inaccurate sensing and actuation, and non-deterministic response, existing RL methods do not guarantee… ▽ More

    Submitted 22 February, 2020; originally announced February 2020.

    Comments: 8 pages, 8 figures, 5 tables, 1 algorithm, accepted to IEEE Robotics and Automation Letters (RA-L), January 2020 with presentation at International Conference on Robotics and Automation (ICRA) 2020

  32. arXiv:1907.10388  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    Higher-Order Function Networks for Learning Composable 3D Object Representations

    Authors: Eric Mitchell, Selim Engin, Volkan Isler, Daniel D Lee

    Abstract: We present a new approach to 3D object representation where a neural network encodes the geometry of an object directly into the weights and biases of a second 'mapping' network. This mapping network can be used to reconstruct an object by applying its encoded transformation to points randomly sampled from a simple geometric space, such as the unit sphere. We study the effectiveness of our method… ▽ More

    Submitted 6 April, 2020; v1 submitted 24 July, 2019; originally announced July 2019.

    Comments: To be published in International Conference on Learning Representations (ICLR 2020) [https://openreview.net/forum?id=HJgfDREKDB]; 19 pages

  33. arXiv:1904.02643  [pdf, other

    cs.CV

    Siamese Encoding and Alignment by Multiscale Learning with Self-Supervision

    Authors: Eric Mitchell, Stefan Keselj, Sergiy Popovych, Davit Buniatyan, H. Sebastian Seung

    Abstract: We propose a method of aligning a source image to a target image, where the transform is specified by a dense vector field. The two images are encoded as feature hierarchies by siamese convolutional nets. Then a hierarchy of aligner modules computes the transform in a coarse-to-fine recursion. Each module receives as input the transform that was computed by the module at the level above, aligns th… ▽ More

    Submitted 4 April, 2019; originally announced April 2019.

  34. arXiv:1903.10605  [pdf, other

    cs.AI

    Q-Learning for Continuous Actions with Cross-Entropy Guided Policies

    Authors: Riley Simmons-Edler, Ben Eisner, Eric Mitchell, Sebastian Seung, Daniel Lee

    Abstract: Off-Policy reinforcement learning (RL) is an important class of methods for many problem domains, such as robotics, where the cost of collecting data is high and on-policy methods are consequently intractable. Standard methods for applying Q-learning to continuous-valued action domains involve iteratively sampling the Q-function to find a good action (e.g. via hill-climbing), or by learning a poli… ▽ More

    Submitted 1 July, 2019; v1 submitted 25 March, 2019; originally announced March 2019.

  35. arXiv:1902.08767  [pdf, other

    cs.GR cs.CG

    VoroCrust: Voronoi Meshing Without Clipping

    Authors: Ahmed Abdelkader, Chandrajit L. Bajaj, Mohamed S. Ebeida, Ahmed H. Mahmoud, Scott A. Mitchell, John D. Owens, Ahmad A. Rushdi

    Abstract: Polyhedral meshes are increasingly becoming an attractive option with particular advantages over traditional meshes for certain applications. What has been missing is a robust polyhedral meshing algorithm that can handle broad classes of domains exhibiting arbitrarily curved boundaries and sharp features. In addition, the power of primal-dual mesh pairs, exemplified by Voronoi-Delaunay meshes, has… ▽ More

    Submitted 22 November, 2023; v1 submitted 23 February, 2019; originally announced February 2019.

    Comments: 18 pages (including appendix), 18 figures. Version without compressed images available on https://www.sandia.gov/app/uploads/sites/217/2023/09/VoroCrust.pdf. Supplemental materials available on https://www.sandia.gov/app/uploads/sites/217/2023/09/VoroCrust_supplemental_materials.pdf

    ACM Class: I.3.5

    Journal ref: ACM Transaction on Graphics, Vol. 39, No. 3, Article No. 23 (May 2020)

  36. arXiv:1803.06078  [pdf, other

    cs.CG

    Sampling Conditions for Conforming Voronoi Meshing by the VoroCrust Algorithm

    Authors: Ahmed Abdelkader, Chandrajit L. Bajaj, Mohamed S. Ebeida, Ahmed H. Mahmoud, Scott A. Mitchell, John D. Owens, Ahmad A. Rushdi

    Abstract: We study the problem of decomposing a volume bounded by a smooth surface into a collection of Voronoi cells. Unlike the dual problem of conforming Delaunay meshing, a principled solution to this problem for generic smooth surfaces remained elusive. VoroCrust leverages ideas from $α$-shapes and the power crust algorithm to produce unweighted Voronoi cells conforming to the surface, yielding the fir… ▽ More

    Submitted 14 April, 2018; v1 submitted 16 March, 2018; originally announced March 2018.

    Comments: polished up version, results essentially unchanged

    ACM Class: I.3.5

  37. arXiv:1606.00800  [pdf, other

    stat.ML cs.CV cs.SI q-bio.NC

    Multi-View Treelet Transform

    Authors: Brian A. Mitchell, Linda R. Petzold

    Abstract: Current multi-view factorization methods make assumptions that are not acceptable for many kinds of data, and in particular, for graphical data with hierarchical structure. At the same time, current hierarchical methods work only in the single-view setting. We generalize the Treelet Transform to the Multi-View Treelet Transform (MVTT) to allow for the capture of hierarchical structure when multipl… ▽ More

    Submitted 17 June, 2016; v1 submitted 2 June, 2016; originally announced June 2016.

  38. arXiv:1501.05992  [pdf, other

    astro-ph.IM cs.CE

    The Murchison Widefield Array Correlator

    Authors: S. M. Ord, B. Crosse, D. Emrich, D. Pallot, R. B. Wayth, M. A. Clark, S. E. Tremblay, W. Arcus, D. Barnes, M. Bell, G. Bernardi, N. D. R. Bhat, J. D. Bowman, F. Briggs, J. D. Bunton, R. J. Cappallo, B. E. Corey, A. A. Deshpande, L. deSouza, A. Ewell-Wice, L. Feng, R. Goeke, L. J. Greenhill, B. J. Hazelton, D. Herne , et al. (42 additional authors not shown)

    Abstract: The Murchison Widefield Array (MWA) is a Square Kilometre Array (SKA) Precursor. The telescope is located at the Murchison Radio--astronomy Observatory (MRO) in Western Australia (WA). The MWA consists of 4096 dipoles arranged into 128 dual polarisation aperture arrays forming a connected element interferometer that cross-correlates signals from all 256 inputs. A hybrid approach to the correlation… ▽ More

    Submitted 23 January, 2015; originally announced January 2015.

    Comments: 17 pages, 9 figures. Accepted for publication in PASA. Some figures altered to meet astro-ph submission requirements

  39. Spoke-Darts for High-Dimensional Blue-Noise Sampling

    Authors: Scott A. Mitchell, Mohamed S. Ebeida, Muhammad A. Awad, Chonhyon Park, Anjul Patney, Ahmad A. Rushdi, Laura P. Swiler, Dinesh Manocha, Li-Yi Wei

    Abstract: Blue noise sampling has proved useful for many graphics applications, but remains underexplored in high-dimensional spaces due to the difficulty of generating distributions and proving properties about them. We present a blue noise sampling method with good quality and performance across different dimensions. The method, spoke-dart sampling, shoots rays from prior samples and selects samples from… ▽ More

    Submitted 13 June, 2018; v1 submitted 5 August, 2014; originally announced August 2014.

    Comments: 19 pages, 22 figures

    Report number: SAND2018-5611 J

    Journal ref: ACM Transactions on Graphics (TOG), Volume 37, Issue 2, May 2018, Article No. 22

  40. k-d Darts: Sampling by k-Dimensional Flat Searches

    Authors: Mohamed S. Ebeida, Anjul Patney, Scott A. Mitchell, Keith R. Dalbey, Andrew A. Davidson, John D. Owens

    Abstract: We formalize the notion of sampling a function using k-d darts. A k-d dart is a set of independent, mutually orthogonal, k-dimensional subspaces called k-d flats. Each dart has d choose k flats, aligned with the coordinate axes for efficiency. We show that k-d darts are useful for exploring a function's properties, such as estimating its integral, or finding an exemplar above a threshold. We descr… ▽ More

    Submitted 15 February, 2013; originally announced February 2013.

    Comments: 19 pages 16 figures

    ACM Class: I.3.5

    Journal ref: Transactions on Graphics, vol. 33, no. 1 (Jan 2014) pp. 3:1--3:16