Skip to main content

Showing 1–50 of 91 results for author: Ho, D

  1. arXiv:2406.13847  [pdf, other

    cs.CV

    Locating and measuring marine aquaculture production from space: a computer vision approach in the French Mediterranean

    Authors: Sebastian Quaade, Andrea Vallebueno, Olivia D. N. Alcabes, Kit T. Rodolfa, Daniel E. Ho

    Abstract: Aquaculture production -- the cultivation of aquatic plants and animals -- has grown rapidly since the 1990s, but sparse, self-reported and aggregate production data limits the effective understanding and monitoring of the industry's trends and potential risks. Building on a manual survey of aquaculture production from remote sensing imagery, we train a computer vision model to identify marine aqu… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  2. arXiv:2406.12165  [pdf, other

    cs.CL

    Statistical Uncertainty in Word Embeddings: GloVe-V

    Authors: Andrea Vallebueno, Cassandra Handan-Nader, Christopher D. Manning, Daniel E. Ho

    Abstract: Static word embeddings are ubiquitous in computational social science applications and contribute to practical decision-making in a variety of fields including law and healthcare. However, assessing the statistical uncertainty in downstream conclusions drawn from word embedding statistics has remained challenging. When using only point estimates for embeddings, researchers have no streamlined way… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  3. arXiv:2405.20362  [pdf, other

    cs.CL cs.CY

    Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools

    Authors: Varun Magesh, Faiz Surani, Matthew Dahl, Mirac Suzgun, Christopher D. Manning, Daniel E. Ho

    Abstract: Legal practice has witnessed a sharp rise in products incorporating artificial intelligence (AI). Such tools are designed to assist with a wide range of core legal tasks, from search and summarization of caselaw to document drafting. But the large language models used in these tools are prone to "hallucinate," or make up false information, making their use risky in high-stakes domains. Recently, c… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Our dataset, tool outputs, and labels will be made available upon publication. This version of the manuscript (May 30, 2024) is updated to reflect an evaluation of Westlaw's AI-Assisted Research

  4. arXiv:2404.02127  [pdf, other

    cs.CL cs.AI cs.LG

    FLawN-T5: An Empirical Examination of Effective Instruction-Tuning Data Mixtures for Legal Reasoning

    Authors: Joel Niklaus, Lucia Zheng, Arya D. McCarthy, Christopher Hahn, Brian M. Rosen, Peter Henderson, Daniel E. Ho, Garrett Honke, Percy Liang, Christopher Manning

    Abstract: Instruction tuning is an important step in making language models useful for direct user interaction. However, many legal tasks remain out of reach for most open LLMs and there do not yet exist any large scale instruction datasets for the domain. This critically limits research in this application area. In this work, we curate LawInstruct, a large legal instruction dataset, covering 17 jurisdictio… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    MSC Class: 68T50 ACM Class: I.2

  5. arXiv:2403.07918  [pdf, other

    cs.CY cs.AI cs.LG

    On the Societal Impact of Open Foundation Models

    Authors: Sayash Kapoor, Rishi Bommasani, Kevin Klyman, Shayne Longpre, Ashwin Ramaswami, Peter Cihon, Aspen Hopkins, Kevin Bankston, Stella Biderman, Miranda Bogen, Rumman Chowdhury, Alex Engler, Peter Henderson, Yacine Jernite, Seth Lazar, Stefano Maffulli, Alondra Nelson, Joelle Pineau, Aviya Skowron, Dawn Song, Victor Storchan, Daniel Zhang, Daniel E. Ho, Percy Liang, Arvind Narayanan

    Abstract: Foundation models are powerful technologies: how they are released publicly directly shapes their societal impact. In this position paper, we focus on open foundation models, defined here as those with broadly available model weights (e.g. Llama 2, Stable Diffusion XL). We identify five distinctive properties (e.g. greater customizability, poor monitoring) of open foundation models that lead to bo… ▽ More

    Submitted 27 February, 2024; originally announced March 2024.

  6. arXiv:2402.18182  [pdf, ps, other

    cs.DL

    Handling Open Research Data within the Max Planck Society -- Looking Closer at the Year 2020

    Authors: Martin Boosen, Michael Franke, Yves Vincent Grossmann, Sy Dat Ho, Larissa Leiminger, Jan Matthiesen

    Abstract: This paper analyses the practice of publishing research data within the Max Planck Society in the year 2020. The central finding of the study is that up to 40\% of the empirical text publications had research data available. The aggregation of the available data is predominantly analysed. There are differences between the sections of the Max Planck Society but they are not as great as one might ex… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  7. arXiv:2402.02008  [pdf, other

    cs.CL cs.AI

    How well do LLMs cite relevant medical references? An evaluation framework and analyses

    Authors: Kevin Wu, Eric Wu, Ally Cassasola, Angela Zhang, Kevin Wei, Teresa Nguyen, Sith Riantawan, Patricia Shi Riantawan, Daniel E. Ho, James Zou

    Abstract: Large language models (LLMs) are currently being used to answer medical questions across a variety of clinical domains. Recent top-performing commercial LLMs, in particular, are also capable of citing sources to support their responses. In this paper, we ask: do the sources that LLMs generate actually support the claims that they make? To answer this, we propose three contributions. First, as expe… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  8. arXiv:2401.01301  [pdf, other

    cs.CL cs.AI cs.CY

    Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models

    Authors: Matthew Dahl, Varun Magesh, Mirac Suzgun, Daniel E. Ho

    Abstract: Do large language models (LLMs) know the law? These models are increasingly being used to augment legal practice, education, and research, yet their revolutionary potential is threatened by the presence of hallucinations -- textual output that is not consistent with legal facts. We present the first systematic evidence of these hallucinations, documenting LLMs' varying performance across jurisdict… ▽ More

    Submitted 21 June, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

  9. arXiv:2310.01679  [pdf, other

    cs.LG cs.CY stat.ML

    Estimating and Implementing Conventional Fairness Metrics With Probabilistic Protected Features

    Authors: Hadi Elzayn, Emily Black, Patrick Vossler, Nathanael Jo, Jacob Goldin, Daniel E. Ho

    Abstract: The vast majority of techniques to train fair models require access to the protected attribute (e.g., race, gender), either at train time or in production. However, in many important applications this protected attribute is largely unavailable. In this paper, we develop methods for measuring and reducing fairness violations in a setting with limited access to protected attribute labels. Specifical… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  10. arXiv:2309.17337  [pdf, other

    cs.LG cs.AI cs.CY

    Toward Operationalizing Pipeline-aware ML Fairness: A Research Agenda for Developing Practical Guidelines and Tools

    Authors: Emily Black, Rakshit Naidu, Rayid Ghani, Kit T. Rodolfa, Daniel E. Ho, Hoda Heidari

    Abstract: While algorithmic fairness is a thriving area of research, in practice, mitigating issues of bias often gets reduced to enforcing an arbitrarily chosen fairness metric, either by enforcing fairness constraints during the optimization step, post-processing model outputs, or by manipulating the training data. Recent work has called on the ML community to take a more holistic approach to tackle fairn… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Comments: EAAMO'23 (Archival)

  11. arXiv:2308.11462  [pdf, other

    cs.CL cs.AI cs.CY

    LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

    Authors: Neel Guha, Julian Nyarko, Daniel E. Ho, Christopher Ré, Adam Chilton, Aditya Narayana, Alex Chohlas-Wood, Austin Peters, Brandon Waldon, Daniel N. Rockmore, Diego Zambrano, Dmitry Talisman, Enam Hoque, Faiz Surani, Frank Fagan, Galit Sarfaty, Gregory M. Dickinson, Haggai Porat, Jason Hegland, Jessica Wu, Joe Nudell, Joel Niklaus, John Nay, Jonathan H. Choi, Kevin Tobia , et al. (15 additional authors not shown)

    Abstract: The advent of large language models (LLMs) and their adoption by the legal community has given rise to the question: what types of legal reasoning can LLMs perform? To enable greater study of this question, we present LegalBench: a collaboratively constructed legal reasoning benchmark consisting of 162 tasks covering six different types of legal reasoning. LegalBench was built through an interdisc… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: 143 pages, 79 tables, 4 figures

  12. arXiv:2306.09237  [pdf, other

    cs.CL cs.AI cs.LG

    SCALE: Scaling up the Complexity for Advanced Language Model Evaluation

    Authors: Vishvaksenan Rasiah, Ronja Stern, Veton Matoshi, Matthias Stürmer, Ilias Chalkidis, Daniel E. Ho, Joel Niklaus

    Abstract: Recent strides in Large Language Models (LLMs) have saturated many NLP benchmarks (even professional domain-specific ones), emphasizing the need for novel, more challenging novel ones to properly assess LLM capabilities. In this paper, we introduce a novel NLP benchmark that poses challenges to current LLMs across four key dimensions: processing long documents (up to 50K tokens), utilizing domain… ▽ More

    Submitted 1 September, 2023; v1 submitted 15 June, 2023; originally announced June 2023.

    MSC Class: 68T50 ACM Class: I.2

  13. arXiv:2306.02069  [pdf, other

    cs.CL cs.AI cs.LG

    MultiLegalPile: A 689GB Multilingual Legal Corpus

    Authors: Joel Niklaus, Veton Matoshi, Matthias Stürmer, Ilias Chalkidis, Daniel E. Ho

    Abstract: Large, high-quality datasets are crucial for training Large Language Models (LLMs). However, so far, there are few datasets available for specialized critical domains such as law and the available ones are often only for the English language. We curate and release MultiLegalPile, a 689GB corpus in 24 languages from 17 jurisdictions. The MultiLegalPile corpus, which includes diverse legal data sour… ▽ More

    Submitted 19 May, 2024; v1 submitted 3 June, 2023; originally announced June 2023.

    Comments: Accepted to ACL 2024

    MSC Class: 68T50 ACM Class: I.2

  14. arXiv:2305.03270  [pdf, other

    cs.RO

    Deep RL at Scale: Sorting Waste in Office Buildings with a Fleet of Mobile Manipulators

    Authors: Alexander Herzog, Kanishka Rao, Karol Hausman, Yao Lu, Paul Wohlhart, Mengyuan Yan, Jessica Lin, Montserrat Gonzalez Arenas, Ted Xiao, Daniel Kappler, Daniel Ho, Jarek Rettinghouse, Yevgen Chebotar, Kuang-Huei Lee, Keerthana Gopalakrishnan, Ryan Julian, Adrian Li, Chuyuan Kelly Fu, Bob Wei, Sangeetha Ramesh, Khem Holden, Kim Kleiven, David Rendleman, Sean Kirmani, Jeff Bingham , et al. (15 additional authors not shown)

    Abstract: We describe a system for deep reinforcement learning of robotic manipulation skills applied to a large-scale real-world task: sorting recyclables and trash in office buildings. Real-world deployment of deep RL policies requires not only effective training algorithms, but the ability to bootstrap real-world training and enable broad generalization. To this end, our system combines scalable deep RL… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: Published at Robotics: Science and Systems 2023

  15. Potential for allocative harm in an environmental justice data tool

    Authors: Benjamin Q. Huynh, Elizabeth T. Chin, Allison Koenecke, Derek Ouyang, Daniel E. Ho, Mathew V. Kiang, David H. Rehkopf

    Abstract: Neighborhood-level screening algorithms are increasingly being deployed to inform policy decisions. We evaluate one such algorithm, CalEnviroScreen - designed to promote environmental justice and used to guide hundreds of millions of dollars in public funding annually - assessing its potential for allocative harm. We observe the model to be sensitive to subjective model decisions, with 16% of trac… ▽ More

    Submitted 12 April, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

    Journal ref: Nat Mach Intell 6, 187-194 (2024)

  16. arXiv:2303.02580  [pdf, other

    stat.AP cs.CY

    Estimating Racial Disparities When Race is Not Observed

    Authors: Cory McCartan, Robin Fisher, Jacob Goldin, Daniel E. Ho, Kosuke Imai

    Abstract: The estimation of racial disparities in various fields is often hampered by the lack of individual-level racial information. In many cases, the law prohibits the collection of such information to prevent direct racial discrimination. As a result, analysts have frequently adopted Bayesian Improved Surname Geocoding (BISG) and its variants, which combine individual names and addresses with Census da… ▽ More

    Submitted 16 April, 2024; v1 submitted 4 March, 2023; originally announced March 2023.

    Comments: 28 pages, 9 figures, plus references and appendices

  17. arXiv:2302.04334  [pdf, other

    cs.RO cs.AI

    Asking for Help: Failure Prediction in Behavioral Cloning through Value Approximation

    Authors: Cem Gokmen, Daniel Ho, Mohi Khansari

    Abstract: Recent progress in end-to-end Imitation Learning approaches has shown promising results and generalization capabilities on mobile manipulation tasks. Such models are seeing increasing deployment in real-world settings, where scaling up requires robots to be able to operate with high autonomy, i.e. requiring as little human supervision as possible. In order to avoid the need for one-on-one human su… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Comments: Accepted to the 2023 IEEE International Conference on Robotics and Automation (ICRA 2023)

    ACM Class: I.2.9

  18. arXiv:2211.07590  [pdf, other

    cs.CV

    Stain-invariant self supervised learning for histopathology image analysis

    Authors: Alexandre Tiard, Alex Wong, David Joon Ho, Yangchao Wu, Eliram Nof, Alvin C. Goh, Stefano Soatto, Saad Nadeem

    Abstract: We present a self-supervised algorithm for several classification tasks within hematoxylin and eosin (H&E) stained images of breast cancer. Our method is robust to stain variations inherent to the histology images acquisition process, which has limited the applicability of automated analysis tools. We address this problem by imposing constraints a learnt latent space which leverages stain normaliz… ▽ More

    Submitted 7 September, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

  19. arXiv:2209.06120  [pdf, ps, other

    cs.AI

    LegalBench: Prototyping a Collaborative Benchmark for Legal Reasoning

    Authors: Neel Guha, Daniel E. Ho, Julian Nyarko, Christopher Ré

    Abstract: Can foundation models be guided to execute tasks involving legal reasoning? We believe that building a benchmark to answer this question will require sustained collaborative efforts between the computer science and legal communities. To that end, this short paper serves three purposes. First, we describe how IRAC-a framework legal scholars use to distinguish different types of legal reasoning-can… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 13 pages, 7 tables

  20. arXiv:2208.11747  [pdf, other

    cs.LG

    Entropy Regularization for Population Estimation

    Authors: Ben Chugg, Peter Henderson, Jacob Goldin, Daniel E. Ho

    Abstract: Entropy regularization is known to improve exploration in sequential decision-making problems. We show that this same mechanism can also lead to nearly unbiased and lower-variance estimates of the mean reward in the optimize-and-estimate structured bandit setting. Mean reward estimation (i.e., population estimation) tasks have recently been shown to be essential for public policy settings where le… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

  21. Detecting Environmental Violations with Satellite Imagery in Near Real Time: Land Application under the Clean Water Act

    Authors: Ben Chugg, Nicolas Rothbacher, Alex Feng, Xiaoqi Long, Daniel E. Ho

    Abstract: This paper introduces a new, highly consequential setting for the use of computer vision for environmental sustainability. Concentrated Animal Feeding Operations (CAFOs) (aka intensive livestock farms or "factory farms") produce significant manure and pollution. Dumping manure in the winter months poses significant environmental risks and violates environmental law in many states. Yet the federal… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: Accepted to CIKM '22

  22. arXiv:2208.04910  [pdf

    cs.CV

    Deep Learning-Based Objective and Reproducible Osteosarcoma Chemotherapy Response Assessment and Outcome Prediction

    Authors: David Joon Ho, Narasimhan P. Agaram, Marc-Henri Jean, Stephanie D. Suser, Cynthia Chu, Chad M. Vanderbilt, Paul A. Meyers, Leonard H. Wexler, John H. Healey, Thomas J. Fuchs, Meera R. Hameed

    Abstract: Osteosarcoma is the most common primary bone cancer whose standard treatment includes pre-operative chemotherapy followed by resection. Chemotherapy response is used for predicting prognosis and further management of patients. Necrosis is routinely assessed post-chemotherapy from histology slides on resection specimens where necrosis ratio is defined as the ratio of necrotic tumor to overall tumor… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

  23. arXiv:2207.00220  [pdf, other

    cs.CL cs.CY

    Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset

    Authors: Peter Henderson, Mark S. Krass, Lucia Zheng, Neel Guha, Christopher D. Manning, Dan Jurafsky, Daniel E. Ho

    Abstract: One concern with the rise of large language models lies with their potential for significant harm, particularly from pretraining on biased, obscene, copyrighted, and private information. Emerging ethical approaches have attempted to filter pretraining material, but such approaches have been ad hoc and failed to take context into account. We offer an approach to filtering grounded in law, which has… ▽ More

    Submitted 29 November, 2022; v1 submitted 1 July, 2022; originally announced July 2022.

    Comments: Presented at NeurIPS Datasets & Benchmarks (2022)

  24. arXiv:2206.10573  [pdf

    cs.CV q-bio.QM

    H&E-based Computational Biomarker Enables Universal EGFR Screening for Lung Adenocarcinoma

    Authors: Gabriele Campanella, David Ho, Ida Häggström, Anton S Becker, Jason Chang, Chad Vanderbilt, Thomas J Fuchs

    Abstract: Lung cancer is the leading cause of cancer death worldwide, with lung adenocarcinoma being the most prevalent form of lung cancer. EGFR positive lung adenocarcinomas have been shown to have high response rates to TKI therapy, underlying the essential nature of molecular testing for lung cancers. Despite current guidelines consider testing necessary, a large portion of patients are not routinely pr… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

  25. arXiv:2206.09875  [pdf, other

    cs.LG cs.CY

    Algorithmic Fairness and Vertical Equity: Income Fairness with IRS Tax Audit Models

    Authors: Emily Black, Hadi Elzayn, Alexandra Chouldechova, Jacob Goldin, Daniel E. Ho

    Abstract: This study examines issues of algorithmic fairness in the context of systems that inform tax audit selection by the United States Internal Revenue Service (IRS). While the field of algorithmic fairness has developed primarily around notions of treating like individuals alike, we instead explore the concept of vertical equity -- appropriately accounting for relevant differences across individuals -… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  26. arXiv:2206.04737  [pdf, other

    cs.CY

    Outsider Oversight: Designing a Third Party Audit Ecosystem for AI Governance

    Authors: Inioluwa Deborah Raji, Peggy Xu, Colleen Honigsberg, Daniel E. Ho

    Abstract: Much attention has focused on algorithmic audits and impact assessments to hold developers and users of algorithmic systems accountable. But existing algorithmic accountability policy approaches have neglected the lessons from non-algorithmic domains: notably, the importance of interventions that allow for the effective participation of third parties. Our paper synthesizes lessons from other field… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: Presented at 5th Annual ACM/AAAI AI Ethics and Society (AIES) conference

  27. arXiv:2204.11910  [pdf, other

    cs.LG cs.CY

    Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection

    Authors: Peter Henderson, Ben Chugg, Brandon Anderson, Kristen Altenburger, Alex Turk, John Guyton, Jacob Goldin, Daniel E. Ho

    Abstract: We introduce a new setting, optimize-and-estimate structured bandits. Here, a policy must select a batch of arms, each characterized by its own context, that would allow it to both maximize reward and maintain an accurate (ideally unbiased) population estimate of the reward. This setting is inherent to many public and private sector applications and often requires handling delayed feedback, small… ▽ More

    Submitted 24 January, 2023; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: Accepted to the Thirty-Seventh AAAI Conference On Artificial Intelligence (AAAI), 2023

  28. arXiv:2204.01691  [pdf, other

    cs.RO cs.CL cs.LG

    Do As I Can, Not As I Say: Grounding Language in Robotic Affordances

    Authors: Michael Ahn, Anthony Brohan, Noah Brown, Yevgen Chebotar, Omar Cortes, Byron David, Chelsea Finn, Chuyuan Fu, Keerthana Gopalakrishnan, Karol Hausman, Alex Herzog, Daniel Ho, Jasmine Hsu, Julian Ibarz, Brian Ichter, Alex Irpan, Eric Jang, Rosario Jauregui Ruano, Kyle Jeffrey, Sally Jesmonth, Nikhil J Joshi, Ryan Julian, Dmitry Kalashnikov, Yuheng Kuang, Kuang-Huei Lee , et al. (20 additional authors not shown)

    Abstract: Large language models can encode a wealth of semantic knowledge about the world. Such knowledge could be extremely useful to robots aiming to act upon high-level, temporally extended instructions expressed in natural language. However, a significant weakness of language models is that they lack real-world experience, which makes it difficult to leverage them for decision making within a given embo… ▽ More

    Submitted 16 August, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

    Comments: See website at https://say-can.github.io/ V1. Initial Upload. V2. Added PaLM results. Added study about new capabilities (drawer manipulation, chain of thought prompting, multilingual instructions). Added an ablation study of language model size. Added an open-source version of \algname on a simulated tabletop environment. Improved readability

  29. arXiv:2203.15015  [pdf, other

    eess.IV cs.CV

    Deep Interactive Learning-based ovarian cancer segmentation of H&E-stained whole slide images to study morphological patterns of BRCA mutation

    Authors: David Joon Ho, M. Herman Chui, Chad M. Vanderbilt, Jiwon Jung, Mark E. Robson, Chan-Sik Park, Jin Roh, Thomas J. Fuchs

    Abstract: Deep learning has been widely used to analyze digitized hematoxylin and eosin (H&E)-stained histopathology whole slide images. Automated cancer segmentation using deep learning can be used to diagnose malignancy and to find novel morphological patterns to predict molecular subtypes. To train pixel-wise cancer segmentation models, manual annotation from pathologists is generally a bottleneck due to… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

  30. arXiv:2202.07600  [pdf, other

    cs.RO cs.LG

    Bayesian Imitation Learning for End-to-End Mobile Manipulation

    Authors: Yuqing Du, Daniel Ho, Alexander A. Alemi, Eric Jang, Mohi Khansari

    Abstract: In this work we investigate and demonstrate benefits of a Bayesian approach to imitation learning from multiple sensor inputs, as applied to the task of opening office doors with a mobile manipulator. Augmenting policies with additional sensor inputs, such as RGB + depth cameras, is a straightforward approach to improving robot perception capabilities, especially for tasks that may favor different… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

  31. arXiv:2202.05702  [pdf

    q-fin.ST cs.LG cs.NE

    Machine Learning for Stock Prediction Based on Fundamental Analysis

    Authors: Yuxuan Huang, Luiz Fernando Capretz, Danny Ho

    Abstract: Application of machine learning for stock prediction is attracting a lot of attention in recent years. A large amount of research has been conducted in this area and multiple existing results have shown that machine learning methods could be successfully used toward stock predicting using stocks historical data. Most of these existing approaches have focused on short term prediction using stocks h… ▽ More

    Submitted 26 January, 2022; originally announced February 2022.

    Comments: 10 pages. IEEE Symposium Series on Computational Intelligence, Orlando, Florida,USA, December 2021

  32. arXiv:2202.01862  [pdf, other

    cs.RO cs.LG

    Practical Imitation Learning in the Real World via Task Consistency Loss

    Authors: Mohi Khansari, Daniel Ho, Yuqing Du, Armando Fuentes, Matthew Bennice, Nicolas Sievers, Sean Kirmani, Yunfei Bai, Eric Jang

    Abstract: Recent work in visual end-to-end learning for robotics has shown the promise of imitation learning across a variety of tasks. Such approaches are expensive both because they require large amounts of real world training demonstrations and because identifying the best model to deploy in the real world requires time-consuming real-world evaluations. These challenges can be mitigated by simulation: by… ▽ More

    Submitted 7 March, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

  33. arXiv:2112.10988  [pdf, other

    cs.CV cs.LG

    Mapping industrial poultry operations at scale with deep learning and aerial imagery

    Authors: Caleb Robinson, Ben Chugg, Brandon Anderson, Juan M. Lavista Ferres, Daniel E. Ho

    Abstract: Concentrated Animal Feeding Operations (CAFOs) pose serious risks to air, water, and public health, but have proven to be challenging to regulate. The U.S. Government Accountability Office notes that a basic challenge is the lack of comprehensive location information on CAFOs. We use the USDA's National Agricultural Imagery Program (NAIP) 1m/pixel aerial imagery to detect poultry CAFOs across the… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

  34. Beyond Ads: Sequential Decision-Making Algorithms in Law and Public Policy

    Authors: Peter Henderson, Ben Chugg, Brandon Anderson, Daniel E. Ho

    Abstract: We explore the promises and challenges of employing sequential decision-making algorithms -- such as bandits, reinforcement learning, and active learning -- in law and public policy. While such algorithms have well-characterized performance in the private sector (e.g., online advertising), the tendency to naively apply algorithms motivated by one domain, often online advertisements, can be called… ▽ More

    Submitted 29 November, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: Version 1 presented at Causal Inference Challenges in Sequential Decision Making: Bridging Theory and Practice (2021), a NeurIPS 2021 Workshop; Version 2 presented at the 2nd ACM Symposium on Computer Science and Law (2022) (DOI: https://dl.acm.org/doi/10.1145/3511265.3550439)

  35. arXiv:2110.13306  [pdf, other

    cs.LG

    Reconciling Risk Allocation and Prevalence Estimation in Public Health Using Batched Bandits

    Authors: Ben Chugg, Daniel E. Ho

    Abstract: In many public health settings, there is a perceived tension between allocating resources to known vulnerable areas and learning about the overall prevalence of the problem. Inspired by a door-to-door Covid-19 testing program we helped design, we combine multi-armed bandit strategies and insights from sampling theory to demonstrate how to recover accurate prevalence estimates while continuing to a… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: Published in Machine Learning in Public Health Workshop at NeurIPS 2021

  36. Automatic Recall of Software Lessons Learned for Software Project Managers

    Authors: Tamer Mohamed Abdellatif, Luiz Fernando Capretz, Danny Ho

    Abstract: Lessons learned (LL) records constitute the software organization memory of successes and failures. LL are recorded within the organization repository for future reference to optimize planning, gain experience, and elevate market competitiveness. However, manually searching this repository is a daunting task, so it is often disregarded. This can lead to the repetition of previous mistakes or even… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Journal ref: Information and Software Technology Journal, Volume 115, pp. 44-57, Elsevier, November 2019

  37. An Empirical Testing of Autonomous Vehicle Simulator System for Urban Driving

    Authors: John Seymour, Dac-Thanh-Chuong Ho, Quang-Hung Luu

    Abstract: Safety is one of the main challenges that prohibit autonomous vehicles (AV), requiring them to be well tested ahead of being allowed on the road. In comparison with road tests, simulators allow us to validate the AV conveniently and affordably. However, it remains unclear how to best use the AV-based simulator system for testing effectively. Our paper presents an empirical testing of AV simulator… ▽ More

    Submitted 10 September, 2021; v1 submitted 17 August, 2021; originally announced August 2021.

    Comments: 8 pages, 8 figures, 4 tables

    Report number: Paper-No54

    Journal ref: 2021 IEEE International Conference on Artificial Intelligence Testing (AITest)

  38. arXiv:2108.07258  [pdf, other

    cs.LG cs.AI cs.CY

    On the Opportunities and Risks of Foundation Models

    Authors: Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card, Rodrigo Castellon, Niladri Chatterji, Annie Chen, Kathleen Creel, Jared Quincy Davis, Dora Demszky, Chris Donahue, Moussa Doumbouya, Esin Durmus, Stefano Ermon, John Etchemendy, Kawin Ethayarajh , et al. (89 additional authors not shown)

    Abstract: AI is undergoing a paradigm shift with the rise of models (e.g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks. We call these models foundation models to underscore their critically central yet incomplete character. This report provides a thorough account of the opportunities and risks of foundation models, ranging from their cap… ▽ More

    Submitted 12 July, 2022; v1 submitted 16 August, 2021; originally announced August 2021.

    Comments: Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI). Report page with citation guidelines: https://crfm.stanford.edu/report.html

  39. arXiv:2107.04140  [pdf, other

    cs.AR

    First-Generation Inference Accelerator Deployment at Facebook

    Authors: Michael Anderson, Benny Chen, Stephen Chen, Summer Deng, Jordan Fix, Michael Gschwind, Aravind Kalaiah, Changkyu Kim, Jaewon Lee, Jason Liang, Haixin Liu, Yinghai Lu, Jack Montgomery, Arun Moorthy, Satish Nadathur, Sam Naghshineh, Avinash Nayak, Jongsoo Park, Chris Petersen, Martin Schatz, Narayanan Sundaram, Bangsheng Tang, Peter Tang, Amy Yang, Jiecao Yu , et al. (90 additional authors not shown)

    Abstract: In this paper, we provide a deep dive into the deployment of inference accelerators at Facebook. Many of our ML workloads have unique characteristics, such as sparse memory accesses, large model sizes, as well as high compute, memory and network bandwidth requirements. We co-designed a high-performance, energy-efficient inference accelerator platform based on these requirements. We describe the in… ▽ More

    Submitted 4 August, 2021; v1 submitted 8 July, 2021; originally announced July 2021.

  40. Context-Aware Legal Citation Recommendation using Deep Learning

    Authors: Zihan Huang, Charles Low, Mengqiu Teng, Hongyi Zhang, Daniel E. Ho, Mark S. Krass, Matthias Grabmair

    Abstract: Lawyers and judges spend a large amount of time researching the proper legal authority to cite while drafting decisions. In this paper, we develop a citation recommendation tool that can help improve efficiency in the process of opinion drafting. We train four types of machine learning models, including a citation-list based method (collaborative filtering) and three context-based methods (text si… ▽ More

    Submitted 20 June, 2021; originally announced June 2021.

    Comments: 10 pages published in Proceedings of ICAIL 2021; link to data here: https://reglab.stanford.edu/data/bva-case-citation-dataset ; code available here: https://github.com/TUMLegalTech/bva-citation-prediction

  41. Enhancing Environmental Enforcement with Near Real-Time Monitoring: Likelihood-Based Detection of Structural Expansion of Intensive Livestock Farms

    Authors: Ben Chugg, Brandon Anderson, Seiji Eicher, Sandy Lee, Daniel E. Ho

    Abstract: Much environmental enforcement in the United States has historically relied on either self-reported data or physical, resource-intensive, infrequent inspections. Advances in remote sensing and computer vision, however, have the potential to augment compliance monitoring by detecting early warning signs of noncompliance. We demonstrate a process for rapid identification of significant structural ex… ▽ More

    Submitted 2 August, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

    Journal ref: International Journal of Applied Earth Observation and Geoinformation, Volume 103, 2021, 102463, ISSN 0303-2434

  42. arXiv:2104.10637  [pdf, ps, other

    cs.LG math.FA stat.ML

    Robust Kernel-based Distribution Regression

    Authors: Zhan Yu, Daniel W. C. Ho, Ding-Xuan Zhou

    Abstract: Regularization schemes for regression have been widely studied in learning theory and inverse problems. In this paper, we study distribution regression (DR) which involves two stages of sampling, and aims at regressing from probability measures to real-valued responses over a reproducing kernel Hilbert space (RKHS). Recently, theoretical analysis on DR has been carried out via kernel ridge regress… ▽ More

    Submitted 21 April, 2021; originally announced April 2021.

    Comments: 29 pages

  43. arXiv:2104.08671  [pdf, other

    cs.CL

    When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset

    Authors: Lucia Zheng, Neel Guha, Brandon R. Anderson, Peter Henderson, Daniel E. Ho

    Abstract: While self-supervised learning has made rapid advances in natural language processing, it remains unclear when researchers should engage in resource-intensive domain-specific pretraining (domain pretraining). The law, puzzlingly, has yielded few documented instances of substantial gains to domain pretraining in spite of the fact that legal language is widely seen to be unique. We hypothesize that… ▽ More

    Submitted 5 July, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

    Comments: ICAIL 2021. Code & data available at https://github.com/reglab/casehold

  44. arXiv:2103.09787  [pdf, other

    cs.CV

    Temporal Cluster Matching for Change Detection of Structures from Satellite Imagery

    Authors: Caleb Robinson, Anthony Ortiz, Juan M. Lavista Ferres, Brandon Anderson, Daniel E. Ho

    Abstract: Longitudinal studies are vital to understanding dynamic changes of the planet, but labels (e.g., buildings, facilities, roads) are often available only for a single point in time. We propose a general model, Temporal Cluster Matching (TCM), for detecting building changes in time series of remotely sensed imagery when footprint labels are observed only once. The intuition behind the model is that t… ▽ More

    Submitted 29 June, 2021; v1 submitted 17 March, 2021; originally announced March 2021.

    Comments: Published in ACM COMPASS 2021

  45. arXiv:2101.06005  [pdf, other

    cs.RO

    SimGAN: Hybrid Simulator Identification for Domain Adaptation via Adversarial Reinforcement Learning

    Authors: Yifeng Jiang, Tingnan Zhang, Daniel Ho, Yunfei Bai, C. Karen Liu, Sergey Levine, Jie Tan

    Abstract: As learning-based approaches progress towards automating robot controllers design, transferring learned policies to new domains with different dynamics (e.g. sim-to-real transfer) still demands manual effort. This paper introduces SimGAN, a framework to tackle domain adaptation by identifying a hybrid physics simulator to match the simulated trajectories to the ones from the target domain, using a… ▽ More

    Submitted 31 May, 2021; v1 submitted 15 January, 2021; originally announced January 2021.

    Comments: ICRA 2021, Code Available at: https://github.com/jyf588/SimGAN ; Accompanying Video: https://youtu.be/McKOGllO7nc

  46. arXiv:2012.14285  [pdf

    cs.CY cs.LG

    Affirmative Algorithms: The Legal Grounds for Fairness as Awareness

    Authors: Daniel E. Ho, Alice Xiang

    Abstract: While there has been a flurry of research in algorithmic fairness, what is less recognized is that modern antidiscrimination law may prohibit the adoption of such techniques. We make three contributions. First, we discuss how such approaches will likely be deemed "algorithmic affirmative action," posing serious legal risks of violating equal protection, particularly under the higher education juri… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

    Comments: 12 pages, 3 figures

    Journal ref: 10/30/20 U. Chi. L. Rev. Online 143, https://lawreviewblog.uchicago.edu/2020/10/30/aa-ho-xiang/

  47. arXiv:2011.11270  [pdf, other

    cs.RO cs.LG

    COCOI: Contact-aware Online Context Inference for Generalizable Non-planar Pushing

    Authors: Zhuo Xu, Wenhao Yu, Alexander Herzog, Wenlong Lu, Chuyuan Fu, Masayoshi Tomizuka, Yunfei Bai, C. Karen Liu, Daniel Ho

    Abstract: General contact-rich manipulation problems are long-standing challenges in robotics due to the difficulty of understanding complicated contact physics. Deep reinforcement learning (RL) has shown great potential in solving robot manipulation tasks. However, existing RL policies have limited adaptability to environments with diverse dynamics properties, which is pivotal in solving many contact-rich… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.

  48. Leveraging Administrative Data for Bias Audits: Assessing Disparate Coverage with Mobility Data for COVID-19 Policy

    Authors: Amanda Coston, Neel Guha, Derek Ouyang, Lisa Lu, Alexandra Chouldechova, Daniel E. Ho

    Abstract: Anonymized smartphone-based mobility data has been widely adopted in devising and evaluating COVID-19 response strategies such as the targeting of public health resources. Yet little attention has been paid to measurement validity and demographic bias, due in part to the lack of documentation about which users are represented as well as the challenge of obtaining ground truth data on unique visits… ▽ More

    Submitted 15 April, 2021; v1 submitted 13 November, 2020; originally announced November 2020.

    Journal ref: Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency. pp. 173-184

  49. arXiv:2011.03148  [pdf, other

    cs.RO

    RetinaGAN: An Object-aware Approach to Sim-to-Real Transfer

    Authors: Daniel Ho, Kanishka Rao, Zhuo Xu, Eric Jang, Mohi Khansari, Yunfei Bai

    Abstract: The success of deep reinforcement learning (RL) and imitation learning (IL) in vision-based robotic manipulation typically hinges on the expense of large scale data collection. With simulation, data to train a policy can be collected efficiently at scale, but the visual gap between sim and real makes deployment in the real world difficult. We introduce RetinaGAN, a generative adversarial network (… ▽ More

    Submitted 3 July, 2021; v1 submitted 5 November, 2020; originally announced November 2020.

    Comments: International Conference on Robotics and Automation (ICRA) 2021

    ACM Class: I.2.9

  50. arXiv:2010.00204  [pdf, ps, other

    math.OC cs.LG eess.SY

    Robust Model-Free Learning and Control without Prior Knowledge

    Authors: Dimitar Ho, John Doyle

    Abstract: We present a simple model-free control algorithm that is able to robustly learn and stabilize an unknown discrete-time linear system with full control and state feedback subject to arbitrary bounded disturbance and noise sequences. The controller does not require any prior knowledge of the system dynamics, disturbances, or noise, yet it can guarantee robust stability and provides asymptotic and wo… ▽ More

    Submitted 1 October, 2020; originally announced October 2020.

    Comments: 16 pages, 7 figures

    Journal ref: 2019 IEEE 58th Conference on Decision and Control (CDC), Nice, France, 2019, pp. 4577-4582