subscribe to arXiv mailings

Towards Understanding Task-agnostic Debiasing Through the Lenses of Intrinsic Bias and Forgetfulness

Authors: Guangliang Liu, Milad Afshari, Xitong Zhang, Zhiyu Xue, Avrajit Ghosh, Bidhan Bashyal, Rongrong Wang, Kristen Johnson

Abstract: While task-agnostic debiasing provides notable generalizability and reduced reliance on downstream data, its impact on language modeling ability and the risk of relearning social biases from downstream task-specific data remain as the two most significant challenges when debiasing Pretrained Language Models (PLMs). The impact on language modeling ability can be alleviated given a high-quality and… ▽ More While task-agnostic debiasing provides notable generalizability and reduced reliance on downstream data, its impact on language modeling ability and the risk of relearning social biases from downstream task-specific data remain as the two most significant challenges when debiasing Pretrained Language Models (PLMs). The impact on language modeling ability can be alleviated given a high-quality and long-contextualized debiasing corpus, but there remains a deficiency in understanding the specifics of relearning biases. We empirically ascertain that the effectiveness of task-agnostic debiasing hinges on the quantitative bias level of both the task-specific data used for downstream applications and the debiased model. We empirically show that the lower bound of the bias level of the downstream fine-tuned model can be approximated by the bias level of the debiased model, in most practical cases. To gain more in-depth understanding about how the parameters of PLMs change during fine-tuning due to the forgetting issue of PLMs, we propose a novel framework which can Propagate Socially-fair Debiasing to Downstream Fine-tuning, ProSocialTuning. Our proposed framework can push the fine-tuned model to approach the bias lower bound during downstream fine-tuning, indicating that the ineffectiveness of debiasing can be alleviated by overcoming the forgetting issue through regularizing successfully debiased attention heads based on the PLMs' bias levels from stages of pretraining and debiasing. △ Less

Submitted 6 June, 2024; originally announced June 2024.

arXiv:2406.02378 [pdf, other]

On the Intrinsic Self-Correction Capability of LLMs: Uncertainty and Latent Concept

Authors: Guangliang Liu, Haitao Mao, Bochuan Cao, Zhiyu Xue, Kristen Johnson, Jiliang Tang, Rongrong Wang

Abstract: Large Language Models (LLMs) can improve their responses when instructed to do so, a capability known as self-correction. When these instructions lack specific details about the issues in the response, this is referred to as leveraging the intrinsic self-correction capability. The empirical success of self-correction can be found in various applications, e.g., text detoxification and social bias m… ▽ More Large Language Models (LLMs) can improve their responses when instructed to do so, a capability known as self-correction. When these instructions lack specific details about the issues in the response, this is referred to as leveraging the intrinsic self-correction capability. The empirical success of self-correction can be found in various applications, e.g., text detoxification and social bias mitigation. However, leveraging this self-correction capability may not always be effective, as it has the potential to revise an initially correct response into an incorrect one. In this paper, we endeavor to understand how and why leveraging the self-correction capability is effective. We identify that appropriate instructions can guide LLMs to a convergence state, wherein additional self-correction steps do not yield further performance improvements. We empirically demonstrate that model uncertainty and activated latent concepts jointly characterize the effectiveness of self-correction. Furthermore, we provide a mathematical formulation indicating that the activated latent concept drives the convergence of the model uncertainty and self-correction performance. Our analysis can also be generalized to the self-correction behaviors observed in Vision-Language Models (VLMs). Moreover, we highlight that task-agnostic debiasing can benefit from our principle in terms of selecting effective fine-tuning samples. Such initial success demonstrates the potential extensibility for better instruction tuning and safety alignment. △ Less

Submitted 4 June, 2024; originally announced June 2024.

Comments: 22 pages, 7 figures

arXiv:2406.01792 [pdf, other]

The SemGuS Toolkit

Authors: Keith J. C. Johnson, Andrew Reynolds, Thomas Reps, Loris D'Antoni

Abstract: Semantics-Guided Synthesis (SemGuS) is a programmable framework for defining synthesis problems in a domain- and solver-agnostic way. This paper presents the standardized SemGuS format, together with an open-source toolkit that provides a parser, a verifier, and enumerative SemGuS solvers. The paper also describes an initial set of SemGuS benchmarks, which form the basis for comparing SemGuS solve… ▽ More Semantics-Guided Synthesis (SemGuS) is a programmable framework for defining synthesis problems in a domain- and solver-agnostic way. This paper presents the standardized SemGuS format, together with an open-source toolkit that provides a parser, a verifier, and enumerative SemGuS solvers. The paper also describes an initial set of SemGuS benchmarks, which form the basis for comparing SemGuS solvers, and presents an evaluation of the baseline enumerative solvers. △ Less

Submitted 3 June, 2024; originally announced June 2024.

arXiv:2405.04507 [pdf, other]

New allometric models for the USA create a step-change in forest carbon estimation, modeling, and mapping

Authors: Lucas K. Johnson, Michael J. Mahoney, Grant Domke, Colin M. Beier

Abstract: The United States national forest inventory (NFI) serves as the foundation for forest aboveground biomass (AGB) and carbon accounting across the nation. These data enable design-based estimates of forest carbon stocks and stock-changes at state and regional levels, but also serve as inputs to model-based approaches for characterizing forest carbon stocks and stock-changes at finer resolutions. Alt… ▽ More The United States national forest inventory (NFI) serves as the foundation for forest aboveground biomass (AGB) and carbon accounting across the nation. These data enable design-based estimates of forest carbon stocks and stock-changes at state and regional levels, but also serve as inputs to model-based approaches for characterizing forest carbon stocks and stock-changes at finer resolutions. Although NFI tree and plot-level data are often treated as truth in these models, they are in fact estimates based on regional species-group models known collectively as the Component Ratio Method (CRM). In late 2023 the Forest Inventory and Analysis (FIA) program introduced a new National Scale Volume and Biomass Estimators (NSVB) system to replace CRM nationwide and offer more precise and accurate representations of forest AGB and carbon. Given the prevalence of model-based AGB studies relying on FIA, there is concern about the transferability of methods from CRM to NSVB models, as well as the comparability of existing CRM AGB products (e.g. maps) to new and forthcoming NSVB AGB products. To begin addressing these concerns we compared previously published CRM AGB maps to new maps produced using identical methods with NSVB AGB reference data. Our results suggest that models relying on passive satellite imagery (e.g. Landsat) provide acceptable estimates of point-in-time NSVB AGB and carbon stocks, but fail to accurately quantify growth in mature closed-canopy forests. We highlight that existing estimates, models, and maps based on FIA reference data are no longer compatible with NSVB, and recommend new methods as well as updated models and maps for accommodating this step-change. Our collective ability to adopt NSVB in our modeling and mapping workflows will help us provide the most accurate spatial forest carbon data possible in order to better inform local management and decision making. △ Less

Submitted 7 May, 2024; originally announced May 2024.

Comments: Manuscript: 16 pages, 7 figures; Supplements: 3 pages, 2 figures; Submitted to: Remote Sensing of Environment

arXiv:2401.14581 [pdf, other]

AVELA -- A Vision for Engineering Literacy & Access: Understanding Why Technology Alone Is Not Enough

Authors: Kyle Johnson, Vicente Arroyos, Celeste Garcia, Liban Hussein, Aisha Cora, Tsewone Melaku, Jay L. Cunningham, R. Benjamin Shapiro, Vikram Iyer

Abstract: Unequal technology access for Black and Latine communities has been a persistent economic, social justice, and human rights issue despite increased technology accessibility due to advancements in consumer electronics like phones, tablets, and computers. We contextualize socio-technical access inequalities for Black and Latine urban communities and find that many students are hesitant to engage wit… ▽ More Unequal technology access for Black and Latine communities has been a persistent economic, social justice, and human rights issue despite increased technology accessibility due to advancements in consumer electronics like phones, tablets, and computers. We contextualize socio-technical access inequalities for Black and Latine urban communities and find that many students are hesitant to engage with available technologies due to a lack of engaging support systems. We present a holistic student-led STEM engagement model through AVELA - A Vision for Engineering Literacy and Access leveraging culturally responsive lessons, mentor embodied community representation, and service learning. To evaluate the model's impact after 4 years of mentoring 200+ university student instructors in teaching to 2,500+ secondary school students in 100+ classrooms, we conducted 24 semi-structured interviews with college AnonymizedOrganization members. We identify access barriers and provide principled recommendations for designing future STEM education programs. △ Less

Submitted 29 January, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

Comments: This is the author's version of the work. It is posted here for personal use, not for redistribution

arXiv:2311.17969 [pdf, other]

Generation of a Compendium of Transcription Factor Cascades and Identification of Potential Therapeutic Targets using Graph Machine Learning

Authors: Sonish Sivarajkumar, Pratyush Tandale, Ankit Bhardwaj, Kipp W. Johnson, Anoop Titus, Benjamin S. Glicksberg, Shameer Khader, Kamlesh K. Yadav, Lakshminarayanan Subramanian

Abstract: Transcription factors (TFs) play a vital role in the regulation of gene expression thereby making them critical to many cellular processes. In this study, we used graph machine learning methods to create a compendium of TF cascades using data extracted from the STRING database. A TF cascade is a sequence of TFs that regulate each other, forming a directed path in the TF network. We constructed a k… ▽ More Transcription factors (TFs) play a vital role in the regulation of gene expression thereby making them critical to many cellular processes. In this study, we used graph machine learning methods to create a compendium of TF cascades using data extracted from the STRING database. A TF cascade is a sequence of TFs that regulate each other, forming a directed path in the TF network. We constructed a knowledge graph of 81,488 unique TF cascades, with the longest cascade consisting of 62 TFs. Our results highlight the complex and intricate nature of TF interactions, where multiple TFs work together to regulate gene expression. We also identified 10 TFs with the highest regulatory influence based on centrality measurements, providing valuable information for researchers interested in studying specific TFs. Furthermore, our pathway enrichment analysis revealed significant enrichment of various pathways and functional categories, including those involved in cancer and other diseases, as well as those involved in development, differentiation, and cell signaling. The enriched pathways identified in this study may have potential as targets for therapeutic intervention in diseases associated with dysregulation of transcription factors. We have released the dataset, knowledge graph, and graphML methods for the TF cascades, and created a website to display the results, which can be accessed by researchers interested in using this dataset. Our study provides a valuable resource for understanding the complex network of interactions between TFs and their regulatory roles in cellular processes. △ Less

Submitted 29 November, 2023; originally announced November 2023.

arXiv:2310.17588 [pdf, other]

PAC-tuning:Fine-tuning Pretrained Language Models with PAC-driven Perturbed Gradient Descent

Authors: Guangliang Liu, Zhiyu Xue, Xitong Zhang, Kristen Marie Johnson, Rongrong Wang

Abstract: Fine-tuning pretrained language models (PLMs) for downstream tasks is a large-scale optimization problem, in which the choice of the training algorithm critically determines how well the trained model can generalize to unseen test data, especially in the context of few-shot learning. To achieve good generalization performance and avoid overfitting, techniques such as data augmentation and pruning… ▽ More Fine-tuning pretrained language models (PLMs) for downstream tasks is a large-scale optimization problem, in which the choice of the training algorithm critically determines how well the trained model can generalize to unseen test data, especially in the context of few-shot learning. To achieve good generalization performance and avoid overfitting, techniques such as data augmentation and pruning are often applied. However, adding these regularizations necessitates heavy tuning of the hyperparameters of optimization algorithms, such as the popular Adam optimizer. In this paper, we propose a two-stage fine-tuning method, PAC-tuning, to address this optimization challenge. First, based on PAC-Bayes training, PAC-tuning directly minimizes the PAC-Bayes generalization bound to learn proper parameter distribution. Second, PAC-tuning modifies the gradient by injecting noise with the variance learned in the first stage into the model parameters during training, resulting in a variant of perturbed gradient descent (PGD). In the past, the few-shot scenario posed difficulties for PAC-Bayes training because the PAC-Bayes bound, when applied to large models with limited training data, might not be stringent. Our experimental results across 5 GLUE benchmark tasks demonstrate that PAC-tuning successfully handles the challenges of fine-tuning tasks and outperforms strong baseline methods by a visible margin, further confirming the potential to apply PAC training for any other settings where the Adam optimizer is currently used for training. △ Less

Submitted 26 October, 2023; originally announced October 2023.

Comments: Accepted to EMNLP23 main

arXiv:2310.13781 [pdf, other]

How Much Consistency Is Your Accuracy Worth?

Authors: Jacob K. Johnson, Ana Marasović

Abstract: Contrast set consistency is a robustness measurement that evaluates the rate at which a model correctly responds to all instances in a bundle of minimally different examples relying on the same knowledge. To draw additional insights, we propose to complement consistency with relative consistency -- the probability that an equally accurate model would surpass the consistency of the proposed model,… ▽ More Contrast set consistency is a robustness measurement that evaluates the rate at which a model correctly responds to all instances in a bundle of minimally different examples relying on the same knowledge. To draw additional insights, we propose to complement consistency with relative consistency -- the probability that an equally accurate model would surpass the consistency of the proposed model, given a distribution over possible consistencies. Models with 100% relative consistency have reached a consistency peak for their accuracy. We reflect on prior work that reports consistency in contrast sets and observe that relative consistency can alter the assessment of a model's consistency compared to another. We anticipate that our proposed measurement and insights will influence future studies aiming to promote consistent behavior in models. △ Less

Submitted 20 October, 2023; originally announced October 2023.

Comments: BlackboxNLP 2023 accepted paper camera-ready version; 6 pages main, 3 pages appendix

arXiv:2309.06725 [pdf, other]

doi 10.1126/scirobotics.adg4276

Solar-powered shape-changing origami microfliers

Authors: Kyle Johnson, Vicente Arroyos, Amélie Ferran, Tilboon Elberier, Raul Villanueva, Dennis Yin, Alberto Aliseda, Sawyer Fuller, Vikram Iyer, Shyamnath Gollakota

Abstract: Using wind to disperse microfliers that fall like seeds and leaves can help automate large-scale sensor deployments. Here, we present battery-free microfliers that can change shape in mid-air to vary their dispersal distance. We design origami microfliers using bi-stable leaf-out structures and uncover an important property: a simple change in the shape of these origami structures causes two drama… ▽ More Using wind to disperse microfliers that fall like seeds and leaves can help automate large-scale sensor deployments. Here, we present battery-free microfliers that can change shape in mid-air to vary their dispersal distance. We design origami microfliers using bi-stable leaf-out structures and uncover an important property: a simple change in the shape of these origami structures causes two dramatically different falling behaviors. When unfolded and flat, the microfliers exhibit a tumbling behavior that increases lateral displacement in the wind. When folded inward, their orientation is stabilized, resulting in a downward descent that is less influenced by wind. To electronically transition between these two shapes, we designed a low-power electromagnetic actuator that produces peak forces of up to 200 millinewtons within 25 milliseconds while powered by solar cells. We fabricated a circuit directly on the folded origami structure that includes a programmable microcontroller, Bluetooth radio, solar power harvesting circuit, a pressure sensor to estimate altitude and a temperature sensor. Outdoor evaluations show that our 414 milligram origami microfliers are able to electronically change their shape mid-air, travel up to 98 meters in a light breeze, and wirelessly transmit data via Bluetooth up to 60 meters away, using only power collected from the sun. △ Less

Submitted 13 September, 2023; originally announced September 2023.

Comments: This is the author's version of the work. It is posted here by permission of the AAAS for personal use, not for redistribution. The definitive version was published in Science Robotics on September 13, 2023. DOI: 10.1126/scirobotics.adg4276

arXiv:2309.04590 [pdf, other]

Robotic Defect Inspection with Visual and Tactile Perception for Large-scale Components

Authors: Arpit Agarwal, Abhiroop Ajith, Chengtao Wen, Veniamin Stryzheus, Brian Miller, Matthew Chen, Micah K. Johnson, Jose Luis Susa Rincon, Justinian Rosca, Wenzhen Yuan

Abstract: In manufacturing processes, surface inspection is a key requirement for quality assessment and damage localization. Due to this, automated surface anomaly detection has become a promising area of research in various industrial inspection systems. A particular challenge in industries with large-scale components, like aircraft and heavy machinery, is inspecting large parts with very small defect dim… ▽ More In manufacturing processes, surface inspection is a key requirement for quality assessment and damage localization. Due to this, automated surface anomaly detection has become a promising area of research in various industrial inspection systems. A particular challenge in industries with large-scale components, like aircraft and heavy machinery, is inspecting large parts with very small defect dimensions. Moreover, these parts can be of curved shapes. To address this challenge, we present a 2-stage multi-modal inspection pipeline with visual and tactile sensing. Our approach combines the best of both visual and tactile sensing by identifying and localizing defects using a global view (vision) and using the localized area for tactile scanning for identifying remaining defects. To benchmark our approach, we propose a novel real-world dataset with multiple metallic defect types per image, collected in the production environments on real aerospace manufacturing parts, as well as online robot experiments in two environments. Our approach is able to identify 85% defects using Stage I and identify 100% defects after Stage II. The dataset is publicly available at https://zenodo.org/record/8327713 △ Less

Submitted 8 September, 2023; originally announced September 2023.

Comments: This is a pre-print for International Conference on Intelligent Robots and Systems 2023 publication

arXiv:2308.06956 [pdf, ps, other]

Modular System Synthesis

Authors: Kanghee Park, Keith J. C. Johnson, Loris D'Antoni, Thomas Reps

Abstract: This paper describes a way to improve the scalability of program synthesis by exploiting modularity: larger programs are synthesized from smaller programs. The key issue is to make each "larger-created-from-smaller" synthesis sub-problem be of a similar nature, so that the kind of synthesis sub-problem that needs to be solved--and the size of each search space--has roughly the same character at ea… ▽ More This paper describes a way to improve the scalability of program synthesis by exploiting modularity: larger programs are synthesized from smaller programs. The key issue is to make each "larger-created-from-smaller" synthesis sub-problem be of a similar nature, so that the kind of synthesis sub-problem that needs to be solved--and the size of each search space--has roughly the same character at each level. This work holds promise for creating program-synthesis tools that have far greater capabilities than currently available tools, and opens new avenues for synthesis research: how synthesis tools should support modular system design, and how synthesis applications can best exploit such capabilities. △ Less

Submitted 14 August, 2023; originally announced August 2023.

arXiv:2306.11984 [pdf, ps, other]

TauPETGen: Text-Conditional Tau PET Image Synthesis Based on Latent Diffusion Models

Authors: Se-In Jang, Cristina Lois, Emma Thibault, J. Alex Becker, Yafei Dong, Marc D. Normandin, Julie C. Price, Keith A. Johnson, Georges El Fakhri, Kuang Gong

Abstract: In this work, we developed a novel text-guided image synthesis technique which could generate realistic tau PET images from textual descriptions and the subject's MR image. The generated tau PET images have the potential to be used in examining relations between different measures and also increasing the public availability of tau PET datasets. The method was based on latent diffusion models. Both… ▽ More In this work, we developed a novel text-guided image synthesis technique which could generate realistic tau PET images from textual descriptions and the subject's MR image. The generated tau PET images have the potential to be used in examining relations between different measures and also increasing the public availability of tau PET datasets. The method was based on latent diffusion models. Both textual descriptions and the subject's MR prior image were utilized as conditions during image generation. The subject's MR image can provide anatomical details, while the text descriptions, such as gender, scan time, cognitive test scores, and amyloid status, can provide further guidance regarding where the tau neurofibrillary tangles might be deposited. Preliminary experimental results based on clinical [18F]MK-6240 datasets demonstrate the feasibility of the proposed method in generating realistic tau PET images at different clinical stages. △ Less

Submitted 20 June, 2023; originally announced June 2023.

arXiv:2305.11122 [pdf, other]

doi 10.1063/5.0159406

Autonomous sputter synthesis of thin film nitrides with composition controlled by Bayesian optimization of optical plasma emission

Authors: Davi M. Febba, Kevin R. Talley, Kendal Johnson, Stephen Schaefer, Sage R. Bauers, John S. Mangum, Rebecca W. Smaha, Andriy Zakutayev

Abstract: Autonomous experimentation has emerged as an efficient approach to accelerate the pace of materials discovery. Although instruments for autonomous synthesis have become popular in molecular and polymer science, solution processing of hybrid materials and nanoparticles, examples of autonomous tools for physical vapor deposition are scarce yet important for the semiconductor industry. Here, we repor… ▽ More Autonomous experimentation has emerged as an efficient approach to accelerate the pace of materials discovery. Although instruments for autonomous synthesis have become popular in molecular and polymer science, solution processing of hybrid materials and nanoparticles, examples of autonomous tools for physical vapor deposition are scarce yet important for the semiconductor industry. Here, we report the design and implementation of an autonomous workflow for sputter deposition of thin films with controlled composition, leveraging a highly automated sputtering reactor custom-controlled by Python, optical emission spectroscopy (OES), and a Bayesian optimization algorithm. We modeled film composition, measured by x-ray fluorescence, as a linear function of emission lines monitored during the co-sputtering from elemental Zn and Ti targets in N$_2$ atmosphere. A Bayesian control algorithm, informed by OES, navigates the space of sputtering power to fabricate films with user-defined composition, by minimizing the absolute error between desired and measured emission signals. We validated our approach by autonomously fabricating Zn$_x$Ti$_{1-x}$N$_y$ films with deviations from the targeted cation composition within relative 3.5 %, even for 15 nm thin films, demonstrating that the proposed approach can reliably synthesize thin films with specific composition and minimal human interference. Moreover, the proposed method can be extended to more difficult synthesis experiments where plasma intensity depends non-linearly on pressure, or the elemental sticking coefficients strongly depend on the substrate temperature. △ Less

Submitted 10 August, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

arXiv:2304.02632 [pdf, other]

doi 10.1016/j.foreco.2023.121348

Mapping historical forest biomass for stock-change assessments at parcel to landscape scales

Authors: Lucas K. Johnson, Michael J. Mahoney, Madeleine L. Desrochers, Colin M. Beier

Abstract: Understanding historical forest dynamics, specifically changes in forest biomass and carbon stocks, has become critical for assessing current forest climate benefits and projecting future benefits under various policy, regulatory, and stewardship scenarios. Carbon accounting frameworks based exclusively on national forest inventories are limited to broad-scale estimates, but model-based approaches… ▽ More Understanding historical forest dynamics, specifically changes in forest biomass and carbon stocks, has become critical for assessing current forest climate benefits and projecting future benefits under various policy, regulatory, and stewardship scenarios. Carbon accounting frameworks based exclusively on national forest inventories are limited to broad-scale estimates, but model-based approaches that combine these inventories with remotely sensed data can yield contiguous fine-resolution maps of forest biomass and carbon stocks across landscapes over time. Here we describe a fundamental step in building a map-based stock-change framework: mapping historical forest biomass at fine temporal and spatial resolution (annual, 30m) across all of New York State (USA) from 1990 to 2019, using freely available data and open-source tools. Using Landsat imagery, US Forest Service Forest Inventory and Analysis (FIA) data, and off-the-shelf LiDAR collections we developed three modeling approaches for mapping historical forest aboveground biomass (AGB): training on FIA plot-level AGB estimates (direct), training on LiDAR-derived AGB maps (indirect), and an ensemble averaging predictions from the direct and indirect models. Model prediction surfaces (maps) were tested against FIA estimates at multiple scales. All three approaches produced viable outputs, yet tradeoffs were evident in terms of model complexity, map accuracy, saturation, and fine-scale pattern representation. The resulting map products can help identify where, when, and how forest carbon stocks are changing as a result of both anthropogenic and natural drivers alike. These products can thus serve as inputs to a wide range of applications including stock-change assessments, monitoring reporting and verification frameworks, and prioritizing parcels for protection or enrollment in improved management programs. △ Less

Submitted 5 April, 2023; originally announced April 2023.

Comments: Manuscript: 24 pages, 7 figures; Supplements: 12 pages, 5 figures; Submitted to Forest Ecology and Management

Journal ref: Forest Ecology and Management 546, (2023), 121348

arXiv:2210.14219 [pdf, other]

Redistributor: Transforming Empirical Data Distributions

Authors: Pavol Harar, Dennis Elbrächter, Monika Dörfler, Kory D. Johnson

Abstract: We present an algorithm and package, Redistributor, which forces a collection of scalar samples to follow a desired distribution. When given independent and identically distributed samples of some random variable $S$ and the continuous cumulative distribution function of some desired target $T$, it provably produces a consistent estimator of the transformation $R$ which satisfies $R(S)=T$ in distr… ▽ More We present an algorithm and package, Redistributor, which forces a collection of scalar samples to follow a desired distribution. When given independent and identically distributed samples of some random variable $S$ and the continuous cumulative distribution function of some desired target $T$, it provably produces a consistent estimator of the transformation $R$ which satisfies $R(S)=T$ in distribution. As the distribution of $S$ or $T$ may be unknown, we also include algorithms for efficiently estimating these distributions from samples. This allows for various interesting use cases in image processing, where Redistributor serves as a remarkably simple and easy-to-use tool that is capable of producing visually appealing results. For color correction it outperforms other model-based methods and excels in achieving photorealistic style transfer, surpassing deep learning methods in content preservation. The package is implemented in Python and is optimized to efficiently handle large datasets, making it also suitable as a preprocessing step in machine learning. The source code is available at https://github.com/paloha/redistributor. △ Less

Submitted 5 July, 2024; v1 submitted 25 October, 2022; originally announced October 2022.

Comments: 16 pages, 13 figures - Added more use cases and comparisons with other methods

arXiv:2210.08343 [pdf, other]

doi 10.1016/j.cma.2023.115930

Modular machine learning-based elastoplasticity: generalization in the context of limited data

Authors: Jan N. Fuhg, Craig M. Hamel, Kyle Johnson, Reese Jones, Nikolaos Bouklas

Abstract: The development of accurate constitutive models for materials that undergo path-dependent processes continues to be a complex challenge in computational solid mechanics. Challenges arise both in considering the appropriate model assumptions and from the viewpoint of data availability, verification, and validation. Recently, data-driven modeling approaches have been proposed that aim to establish s… ▽ More The development of accurate constitutive models for materials that undergo path-dependent processes continues to be a complex challenge in computational solid mechanics. Challenges arise both in considering the appropriate model assumptions and from the viewpoint of data availability, verification, and validation. Recently, data-driven modeling approaches have been proposed that aim to establish stress-evolution laws that avoid user-chosen functional forms by relying on machine learning representations and algorithms. However, these approaches not only require a significant amount of data but also need data that probes the full stress space with a variety of complex loading paths. Furthermore, they rarely enforce all necessary thermodynamic principles as hard constraints. Hence, they are in particular not suitable for low-data or limited-data regimes, where the first arises from the cost of obtaining the data and the latter from the experimental limitations of obtaining labeled data, which is commonly the case in engineering applications. In this work, we discuss a hybrid framework that can work on a variable amount of data by relying on the modularity of the elastoplasticity formulation where each component of the model can be chosen to be either a classical phenomenological or a data-driven model depending on the amount of available information and the complexity of the response. The method is tested on synthetic uniaxial data coming from simulations as well as cyclic experimental data for structural materials. The discovered material models are found to not only interpolate well but also allow for accurate extrapolation in a thermodynamically consistent manner far outside the domain of the training data. Training aspects and details of the implementation of these models into Finite Element simulations are discussed and analyzed. △ Less

Submitted 15 October, 2022; originally announced October 2022.

Comments: 36 pages, 25 figures

arXiv:2210.00377 [pdf, other]

CHARTOPOLIS: A Small-Scale Labor-art-ory for Research and Reflection on Autonomous Vehicles, Human-Robot Interaction, and Sociotechnical Imaginaries

Authors: Sangeet Sankaramangalam Ulhas, Aditya Ravichander, Kathryn A. Johnson, Theodore P. Pavlic, Lance Gharavi, Spring Berman

Abstract: CHARTOPOLIS is a multi-faceted sociotechnical testbed meant to aid in building connections among engineers, psychologists, anthropologists, ethicists, and artists. Superficially, it is an urban autonomous-vehicle testbed that includes both a physical environment for small-scale robotic vehicles as well as a high-fidelity virtual replica that provides extra flexibility by way of computer simulation… ▽ More CHARTOPOLIS is a multi-faceted sociotechnical testbed meant to aid in building connections among engineers, psychologists, anthropologists, ethicists, and artists. Superficially, it is an urban autonomous-vehicle testbed that includes both a physical environment for small-scale robotic vehicles as well as a high-fidelity virtual replica that provides extra flexibility by way of computer simulation. However, both environments have been developed to allow for participatory simulation with human drivers as well. Each physical vehicle can be remotely operated by human drivers that have a driver-seat point of view that immerses them within the small-scale testbed, and those same drivers can also pilot high-fidelity models of those vehicles in a virtual replica of the environment. Juxtaposing human driving performance across these two contexts will help identify to what extent human driving behaviors are sensorimotor responses or involve psychological engagement with a system that has physical, not virtual, side effects and consequences. Furthermore, through collaboration with artists, we have designed the physical testbed to make tangible the reality that technological advancement causes the history of a city to fork into multiple, parallel timelines that take place within populations whose increasing isolation effectively creates multiple independent cities in one. Ultimately, CHARTOPOLIS is meant to challenge engineers to take a more holistic view when designing autonomous systems, while also enabling them to gather novel data that will assist them in making these systems more trustworthy. △ Less

Submitted 1 October, 2022; originally announced October 2022.

Comments: Submission to 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022) Workshop on Miniature Robot Platforms for Full Scale Autonomous Vehicle Research

MSC Class: 93C85 (Primary) 91Cxx (Secondary) ACM Class: I.2.9; J.4; J.5

arXiv:2209.06167 [pdf, other]

PET image denoising based on denoising diffusion probabilistic models

Authors: Kuang Gong, Keith A. Johnson, Georges El Fakhri, Quanzheng Li, Tinsu Pan

Abstract: Due to various physical degradation factors and limited counts received, PET image quality needs further improvements. The denoising diffusion probabilistic models (DDPM) are distribution learning-based models, which try to transform a normal distribution into a specific data distribution based on iterative refinements. In this work, we proposed and evaluated different DDPM-based methods for PET i… ▽ More Due to various physical degradation factors and limited counts received, PET image quality needs further improvements. The denoising diffusion probabilistic models (DDPM) are distribution learning-based models, which try to transform a normal distribution into a specific data distribution based on iterative refinements. In this work, we proposed and evaluated different DDPM-based methods for PET image denoising. Under the DDPM framework, one way to perform PET image denoising is to provide the PET image and/or the prior image as the network input. Another way is to supply the prior image as the input with the PET image included in the refinement steps, which can fit for scenarios of different noise levels. 120 18F-FDG datasets and 140 18F-MK-6240 datasets were utilized to evaluate the proposed DDPM-based methods. Quantification show that the DDPM-based frameworks with PET information included can generate better results than the nonlocal mean and Unet-based denoising methods. Adding additional MR prior in the model can help achieve better performance and further reduce the uncertainty during image denoising. Solely relying on MR prior while ignoring the PET information can result in large bias. Regional and surface quantification shows that employing MR prior as the network input while embedding PET image as a data-consistency constraint during inference can achieve the best performance. In summary, DDPM-based PET image denoising is a flexible framework, which can efficiently utilize prior information and achieve better performance than the nonlocal mean and Unet-based denoising methods. △ Less

Submitted 14 September, 2022; v1 submitted 13 September, 2022; originally announced September 2022.

Comments: 8 figures

arXiv:2205.08530 [pdf, other]

doi 10.1016/j.jag.2022.103059

Fine-resolution landscape-scale biomass mapping using a spatiotemporal patchwork of LiDAR coverages

Authors: Lucas K. Johnson, Michael J. Mahoney, Eddie Bevilacqua, Stephen V. Stehman, Grant Domke, Colin M. Beier

Abstract: Estimating forest AGB at large scales and fine spatial resolutions has become increasingly important for greenhouse gas accounting, monitoring, and verification efforts to mitigate climate change. Airborne LiDAR is highly valuable for modeling attributes of forest structure including AGB, yet most LiDAR collections take place at local or regional scales covering irregular, non-contiguous footprint… ▽ More Estimating forest AGB at large scales and fine spatial resolutions has become increasingly important for greenhouse gas accounting, monitoring, and verification efforts to mitigate climate change. Airborne LiDAR is highly valuable for modeling attributes of forest structure including AGB, yet most LiDAR collections take place at local or regional scales covering irregular, non-contiguous footprints, resulting in a patchwork of different landscape segments at various points in time. Here, as part of a statewide forest carbon assessment for New York State (USA), we addressed common obstacles in leveraging a LiDAR patchwork for AGB mapping at landscape scales, including selection of training data, the investigation of regional or coverage specific patterns in prediction error, and map agreement with field inventory across multiple scales. Three machine learning algorithms and an ensemble model were trained with FIA field measurements, airborne LiDAR, and topographic, climatic and cadastral geodata. Using a strict set of plot selection criteria, 801 FIA plots were selected with co-located point clouds drawn from a patchwork of 17 leaf-off LiDAR coverages (2014-2019). Our ensemble model was used to produce 30 m AGB prediction surfaces within a predictor-defined area of applicability (98% of LiDAR coverage), and the resulting AGB maps were compared with FIA plot-level and areal estimates at multiple scales of aggregation. Our model was overall accurate (% RMSE 22-45%; MAE 11.6-29.4 Mg ha$^{-1}$; ME 2.4-6.3 Mg ha$^{-1}$), explained 73-80% of field-observed variation, and yielded estimates that were consistent with FIA's design-based estimates (89% of estimates within FIA's 95% CI). We share practical solutions to challenges faced in using spatiotemporal patchworks of LiDAR to meet growing needs for AGB mapping in support of applications in forest carbon accounting and ecosystem. △ Less

Submitted 4 August, 2022; v1 submitted 17 May, 2022; originally announced May 2022.

Comments: Manuscript: 19 pages, 8 figures; Supplements: 13 pages, 4 figures; Submitted to: International Journal of Applied Earth Observation and Geodata, Earth Observations for Carbon Neutrality and Sustainable Development Goals Special Issue

Journal ref: Int J Appl Earth Obs Geoinf 114 (2022) 103059

arXiv:2205.05794 [pdf, other]

Deep-Learned Generators of Porosity Distributions Produced During Metal Additive Manufacturing

Authors: Francis Ogoke, Kyle Johnson, Michael Glinsky, Chris Laursen, Sharlotte Kramer, Amir Barati Farimani

Abstract: Laser Powder Bed Fusion has become a widely adopted method for metal Additive Manufacturing (AM) due to its ability to mass produce complex parts with increased local control. However, AM produced parts can be subject to undesirable porosity, negatively influencing the properties of printed components. Thus, controlling porosity is integral for creating effective parts. A precise understanding of… ▽ More Laser Powder Bed Fusion has become a widely adopted method for metal Additive Manufacturing (AM) due to its ability to mass produce complex parts with increased local control. However, AM produced parts can be subject to undesirable porosity, negatively influencing the properties of printed components. Thus, controlling porosity is integral for creating effective parts. A precise understanding of the porosity distribution is crucial for accurately simulating potential fatigue and failure zones. Previous research on generating synthetic porous microstructures have succeeded in generating parts with high density, isotropic porosity distributions but are often inapplicable to cases with sparser, boundary-dependent pore distributions. Our work bridges this gap by providing a method that considers these constraints by deconstructing the generation problem into its constitutive parts. A framework is introduced that combines Generative Adversarial Networks with Mallat Scattering Transform-based autocorrelation methods to construct novel realizations of the individual pore geometries and surface roughness, then stochastically reconstruct them to form realizations of a porous printed part. The generated parts are compared to the existing experimental porosity distributions based on statistical and dimensional metrics, such as nearest neighbor distances, pore volumes, pore anisotropies and scattering transform based auto-correlations. △ Less

Submitted 11 May, 2022; originally announced May 2022.

arXiv:2205.05047 [pdf, other]

doi 10.1080/01431161.2022.2155086

Classification and mapping of low-statured 'shrubland' cover types in post-agricultural landscapes of the US Northeast

Authors: Michael J Mahoney, Lucas K Johnson, Abigail Z Guinan, Colin M Beier

Abstract: Novel plant communities reshape landscapes and pose challenges for land cover classification and mapping that can constrain research and stewardship efforts. In the US Northeast, emergence of low-statured woody vegetation, or shrublands, instead of secondary forests in post-agricultural landscapes is well-documented by field studies, but poorly understood from a landscape perspective, which limits… ▽ More Novel plant communities reshape landscapes and pose challenges for land cover classification and mapping that can constrain research and stewardship efforts. In the US Northeast, emergence of low-statured woody vegetation, or shrublands, instead of secondary forests in post-agricultural landscapes is well-documented by field studies, but poorly understood from a landscape perspective, which limits the ability to systematically study and manage these lands. To address gaps in classification/mapping of low-statured cover types where they have been historically rare, we developed models to predict shrubland distributions at 30m resolution across New York State (NYS), using a stacked ensemble combining a random forest, gradient boosting machine, and artificial neural network to integrate remote sensing of structural (airborne LIDAR) and optical (satellite imagery) properties of vegetation cover. We first classified a 1m canopy height model (CHM), derived from a patchwork of available LIDAR coverages, to define shrubland presence/absence. Next, these non-contiguous maps were used to train a model ensemble based on temporally-segmented imagery to predict shrubland probability for the entire study landscape (NYS). Approximately 2.5% of the CHM coverage area was classified as shrubland. Models using Landsat predictors trained on the classified CHM were effective at identifying shrubland (test set AUC=0.893, real-world AUC=0.904), in discriminating between shrub/young forest and other cover classes, and produced qualitatively sensible maps, even when extending beyond the original training data. Our results suggest that incorporation of airborne LiDAR, even from a discontinuous patchwork of coverages, can improve land cover classification of historically rare but increasingly prevalent shrubland habitats across broader areas. △ Less

Submitted 21 December, 2022; v1 submitted 9 May, 2022; originally announced May 2022.

Comments: 43 pages (35 main text, 8 supplementary materials); 11 figures (10 main text, 1 supplementary materials), 10 tables (4 main text, 6 supplementary materials)

Journal ref: The International Journal of Remote Sensing 43(19-24), (2022), 7117-7138

arXiv:2106.14638 [pdf, ps, other]

Rate and Power Adaptation for Multihop Regenerative Relaying Systems

Authors: Elyes Balti, Brian K. Johnson

Abstract: In this work, we provide a global framework analysis of a multi-hop relaying systems wherein the transmitter (TX) communicates with the receiver (RX) through a set of intermediary relays deployed either in series or in parallel. Regenerative based relaying scheme is assumed such as the repetition-coded decoded-and-forward (DF) wherein the decoding is threshold-based. To reflect a wide range of fad… ▽ More In this work, we provide a global framework analysis of a multi-hop relaying systems wherein the transmitter (TX) communicates with the receiver (RX) through a set of intermediary relays deployed either in series or in parallel. Regenerative based relaying scheme is assumed such as the repetition-coded decoded-and-forward (DF) wherein the decoding is threshold-based. To reflect a wide range of fading, we introduce the generalized $H$-function (also termed as Fox-$H$ function) distribution model which enables the modeling of radio-frequency (RF) fading like Weibull and Gamma, as well as the free-space optic (FSO) such as the Double Generalized Gamma and Málaga fading. In this context, we introduce various power and rate adaptation policies based on the channel state information (CSI) availability at TX and RX. Finally, we address the effects of relaying topology, number of relays and fading model, etc, on the performance reliability of each link adaptation policy. △ Less

Submitted 6 July, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

arXiv:2104.06575 [pdf, other]

doi 10.1016/j.commatsci.2021.110756

Five Degree-of-Freedom Property Interpolation of Arbitrary Grain Boundaries via Voronoi Fundamental Zone Octonion Framework

Authors: Sterling G. Baird, Eric R. Homer, David T. Fullwood, Oliver K. Johnson

Abstract: We introduce the Voronoi fundamental zone octonion interpolation framework for grain boundary (GB) structure-property models and surrogates. The VFZO framework offers an advantage over other five degree-of-freedom based property interpolation methods because it is constructed as a point set in a manifold. This means that directly computed Euclidean distances approximate the original octonion dista… ▽ More We introduce the Voronoi fundamental zone octonion interpolation framework for grain boundary (GB) structure-property models and surrogates. The VFZO framework offers an advantage over other five degree-of-freedom based property interpolation methods because it is constructed as a point set in a manifold. This means that directly computed Euclidean distances approximate the original octonion distance with significantly reduced computation runtime (~7 CPU minutes vs. 153 CPU days for a 50000x50000 pairwise-distance matrix). This increased efficiency facilitates lower interpolation error through the use of significantly more input data. We demonstrate grain boundary energy interpolation results for a non-smooth validation function and simulated bi-crystal datasets for Fe and Ni using four interpolation methods: barycentric interpolation, Gaussian process regression (GPR), inverse-distance weighting, and nearest-neighbor interpolation. These are evaluated for 50000 random input GBs and 10 000 random prediction GBs. The best performance was achieved with GPR, which resulted in a reduction of the root mean square error (RMSE) by 83.0% relative to RMSE of a constant, average model. Likewise, interpolation on a large, noisy, molecular statics Fe simulation dataset improves performance by 34.4% compared to 21.2% in prior work. Interpolation on a small, low-noise MS Ni simulation dataset is similar to interpolation results for the original octonion metric (57.6% vs. 56.4%). A vectorized, parallelized, MATLAB interpolation function (interp5DOF.m) and related routines are available in our VFZO repository (github.com/sgbaird-5dof/interp) which can be applied to other crystallographic point groups. The VFZO framework offers advantages for computing distances between GBs, estimating property values for arbitrary GBs, and modeling surrogates of computationally expensive 5DOF functions and simulations. △ Less

Submitted 13 April, 2021; originally announced April 2021.

Comments: main: 22 pages, 10 figures; appendices: 5 pages, 3 figures; supp: 13 pages, 12 figures

arXiv:2011.01428 [pdf, other]

Leaf-like Origami with Bistability for Self-Adaptive Grasping Motions

Authors: Hiromi Yasuda, Kyle Johnson, Vicente Arroyos, Koshiro Yamaguchi, Jordan R. Raney, Jinkyu Yang

Abstract: The leaf-like origami structure was inspired by geometric patterns found in nature, exhibiting unique transitions between open and closed shapes. With a bistable energy landscape, leaf-like origami is able to replicate the autonomous grasping of objects observed in biological systems like the Venus flytrap. We show uniform grasping motions of the leaf-like origami, as well as various non-uniform g… ▽ More The leaf-like origami structure was inspired by geometric patterns found in nature, exhibiting unique transitions between open and closed shapes. With a bistable energy landscape, leaf-like origami is able to replicate the autonomous grasping of objects observed in biological systems like the Venus flytrap. We show uniform grasping motions of the leaf-like origami, as well as various non-uniform grasping motions which arise from its multi-transformable nature. Grasping motions can be triggered with high tunability due to the structure's bistable energy landscape. We demonstrate the self-adaptive grasping motion by dropping a target object onto our paper prototype, which does not require an external power source to retain the capture of the object. We also explore the non-uniform grasping motions of the leaf-like structure by selectively controlling the creases, which reveals various unique grasping configurations that can be exploited for versatile, autonomous, and self-adaptive robotic operations. △ Less

Submitted 2 November, 2020; originally announced November 2020.

arXiv:2010.14779 [pdf, other]

Stochastic Geometry Analysis of Uplink Cellular Networks with FSO Backhauling: Cooperative Relaying Vs. Reflecting Surfaces

Authors: Elyes Balti, Brian K. Johnson

Abstract: In this work, we consider the performance analysis of the uplink cellular networks with free space optics (FSO) backhauling. The user equipment (UE) communicates with the nearest Base Station (BS) in first slot while in second slot, the BS converts the received radio frequency (RF) signal into FSO pulse and transmits to the data center. We adopt the Rayleigh fading for the uplink channels while th… ▽ More In this work, we consider the performance analysis of the uplink cellular networks with free space optics (FSO) backhauling. The user equipment (UE) communicates with the nearest Base Station (BS) in first slot while in second slot, the BS converts the received radio frequency (RF) signal into FSO pulse and transmits to the data center. We adopt the Rayleigh fading for the uplink channels while the FSO backhaul encompasses the turbulence-induced fading which follows the Málaga distribution, the weather pathloss, and the pointing errors fading which is distributed following the generalized Beckmann model. Further, we will compare the performances when the BS behaves as either a decode-and-forward (DF) relay or an intelligent reflecting surface (IRS). Next, we will propose an optimal design of the phase shifters of the IRS to minimize the interference and improve the rate to beat relaying. Capitalizing on this framework, we will derive the system performance metrics such as the coverage probability as well as the spectral efficiency. Focusing on high SNR, we will obtain the diversity gain to get engineering insights into the system performance and limitations. Finally, the analytical results are confirmed by Monte Carlo simulations. △ Less

Submitted 28 October, 2020; originally announced October 2020.

arXiv:2010.12166 [pdf, other]

MmWaves Cellular V2X for Cooperative Diversity Relay Fast Fading Channels

Authors: Elyes Balti, Brian K. Johnson

Abstract: In this work, we present a framework analysis of millimeter waves (mmWaves) vehicular communications systems. Communications between vehicles take place through a cooperative relay which acts as an intermediary base station (BS). The relay is equipped with multiple transmit and receive antennas and it employs decode-and-forward (DF) to process the signal. Also, the relay applies maximal ratio comb… ▽ More In this work, we present a framework analysis of millimeter waves (mmWaves) vehicular communications systems. Communications between vehicles take place through a cooperative relay which acts as an intermediary base station (BS). The relay is equipped with multiple transmit and receive antennas and it employs decode-and-forward (DF) to process the signal. Also, the relay applies maximal ratio combining (MRC), and maximal ratio transmission (MRT), respectively, to receive and forward the signal. As the vehicles' speeds are relatively high, the channel experiences a fast fading and this time variation is modeled following the Jakes' autocorrelation model. We also assume narrowband fading channel. Closed-form expressions of the reliability metrics such as the outage probability, the probability of error and the channel capacity are derived. Capitalizing on these performances, we derive the low and high power regimes for the capacity, and the high signal-to-interference-plus-noise-ratio (SINR) asymptotes for the outage and error probability to get full insights into the system gains such as the diversity and coding gains. △ Less

Submitted 23 October, 2020; originally announced October 2020.

arXiv:2009.06852 [pdf]

Miniaturized Circuitry for Capacitive Self-sensing and Closed-loop Control of Soft Electrostatic Transducers

Authors: Khoi Ly, Nicholas Kellaris, Dade McMorris, Brian K. Johnson, Eric Acome, Vani Sundaram, Mantas Naris, J. Sean Humbert, Mark E. Rentschler, Christoph Keplinger, Nikolaus Correll

Abstract: Soft robotics is a field of robotic system design characterized by materials and structures that exhibit large-scale deformation, high compliance, and rich multifunctionality. The incorporation of soft and deformable structures endows soft robotic systems with the compliance and resiliency that makes them well-adapted for unstructured and dynamic environments. While actuation mechanisms for soft r… ▽ More Soft robotics is a field of robotic system design characterized by materials and structures that exhibit large-scale deformation, high compliance, and rich multifunctionality. The incorporation of soft and deformable structures endows soft robotic systems with the compliance and resiliency that makes them well-adapted for unstructured and dynamic environments. While actuation mechanisms for soft robots vary widely, soft electrostatic transducers such as dielectric elastomer actuators (DEAs) and hydraulically amplified self-healing electrostatic (HASEL) actuators have demonstrated promise due to their muscle-like performance and capacitive self-sensing capabilities. Despite previous efforts to implement self-sensing in electrostatic transducers by overlaying sinusoidal low-voltage signals, these designs still require sensing high-voltage signals, requiring bulky components that prevent integration with miniature, untethered soft robots. We present a circuit design that eliminates the need for any high-voltage sensing components, thereby facilitating the design of simple, low cost circuits using off-the-shelf components. Using this circuit, we perform simultaneous sensing and actuation for a range of electrostatic transducers including circular DEAs and HASEL actuators and demonstrate accurate estimated displacements with errors under 4%. We further develop this circuit into a compact and portable system that couples HV actuation, sensing, and computation as a prototype towards untethered, multifunctional soft robotic systems. Finally, we demonstrate the capabilities of our self-sensing design through feedback-control of a robotic arm powered by Peano-HASEL actuators. △ Less

Submitted 14 September, 2020; originally announced September 2020.

Comments: 35 pages, 7 main figures, 7 supplementary figures, 3 supplementary videos, accepted to Soft Robotics 2020

arXiv:1909.09224 [pdf, other]

The Colliding Reciprocal Dance Problem: A Mitigation Strategy with Application to Automotive Active Safety Systems

Authors: Jeffrey Kane Johnson

Abstract: A reciprocal dance occurs when two mobile agents attempt to pass each other but incompatible interaction models result in repeated attempts to take mutually blocking actions. Often, such a situation simply results in deadlock. But in systems with significant inertial constraints, it can result in collision. This abstract presents this colliding variant of the reciprocal dance, how it arises, and a… ▽ More A reciprocal dance occurs when two mobile agents attempt to pass each other but incompatible interaction models result in repeated attempts to take mutually blocking actions. Often, such a situation simply results in deadlock. But in systems with significant inertial constraints, it can result in collision. This abstract presents this colliding variant of the reciprocal dance, how it arises, and a mitigation strategy that can improve safety without sacrificing flexibility. A demonstration of the concept is provided in the context of automotive active safety. △ Less

Submitted 19 September, 2019; originally announced September 2019.

Comments: Extended abstract submitted to the 2019 Northeast Robotics Colloquium

arXiv:1905.10634 [pdf, other]

Adaptive, Distribution-Free Prediction Intervals for Deep Networks

Authors: Danijel Kivaranovic, Kory D. Johnson, Hannes Leeb

Abstract: The machine learning literature contains several constructions for prediction intervals that are intuitively reasonable but ultimately ad-hoc in that they do not come with provable performance guarantees. We present methods from the statistics literature that can be used efficiently with neural networks under minimal assumptions with guaranteed performance. We propose a neural network that outputs… ▽ More The machine learning literature contains several constructions for prediction intervals that are intuitively reasonable but ultimately ad-hoc in that they do not come with provable performance guarantees. We present methods from the statistics literature that can be used efficiently with neural networks under minimal assumptions with guaranteed performance. We propose a neural network that outputs three values instead of a single point estimate and optimizes a loss function motivated by the standard quantile regression loss. We provide two prediction interval methods with finite sample coverage guarantees solely under the assumption that the observations are independent and identically distributed. The first method leverages the conformal inference framework and provides average coverage. The second method provides a new, stronger guarantee by conditioning on the observed data. Lastly, our loss function does not compromise the predictive accuracy of the network like other prediction interval methods. We demonstrate the ease of use of our procedures as well as its improvements over other methods on both simulated and real data. As most deep networks can easily be modified by our method to output predictions with valid prediction intervals, its use should become standard practice, much like reporting standard errors along with mean estimates. △ Less

Submitted 24 February, 2020; v1 submitted 25 May, 2019; originally announced May 2019.

arXiv:1812.09952 [pdf, other]

Efficient Parametric Model Checking Using Domain Knowledge

Authors: Radu Calinescu, Colin Paterson, Kenneth Johnson

Abstract: We introduce an efficient parametric model checking (ePMC) method for the analysis of reliability, performance and other quality-of-service (QoS) properties of software systems. ePMC speeds up the analysis of parametric Markov chains modelling the behaviour of software by exploiting domain-specific modelling patterns for the software components. To this end, ePMC precomputes closed-form expression… ▽ More We introduce an efficient parametric model checking (ePMC) method for the analysis of reliability, performance and other quality-of-service (QoS) properties of software systems. ePMC speeds up the analysis of parametric Markov chains modelling the behaviour of software by exploiting domain-specific modelling patterns for the software components. To this end, ePMC precomputes closed-form expressions for key QoS properties of such patterns, and uses these expressions in the analysis of whole-system models. To evaluate ePMC, we show that its application to service-based systems and multi-tier software architectures reduces analysis time by several orders of magnitude compared to current parametric model checking methods. △ Less

Submitted 24 December, 2018; originally announced December 2018.

ACM Class: D.2.19.c; D.2.4.e

arXiv:1709.09662 [pdf, other]

Image Space Potential Fields: Constant Size Environment Representation for Vision-based Subsumption Control Architectures

Authors: Jeffrey Kane Johnson

Abstract: This technical report presents an environment representation for use in vision-based navigation. The representation has two useful properties: 1) it has constant size, which can enable strong run-time guarantees to be made for control algorithms using it, and 2) it is structurally similar to a camera image space, which effectively allows control to operate in the sensor space rather than employing… ▽ More This technical report presents an environment representation for use in vision-based navigation. The representation has two useful properties: 1) it has constant size, which can enable strong run-time guarantees to be made for control algorithms using it, and 2) it is structurally similar to a camera image space, which effectively allows control to operate in the sensor space rather than employing difficult, and often inaccurate, projections into a structurally different control space (e.g. Euclidean). The presented representation is intended to form the basis of a vision-based subsumption control architecture. △ Less

Submitted 26 September, 2017; originally announced September 2017.

Comments: Maeve Automation Technical Report. arXiv admin note: text overlap with arXiv:1709.03947

arXiv:1709.03947 [pdf, other]

Constant Space Complexity Environment Representation for Vision-based Navigation

Authors: Jeffrey Kane Johnson

Abstract: This paper presents a preliminary conceptual investigation into an environment representation that has constant space complexity with respect to the camera image space. This type of representation allows the planning algorithms of a mobile agent to bypass what are often complex and noisy transformations between camera image space and Euclidean space. The approach is to compute per-pixel potential… ▽ More This paper presents a preliminary conceptual investigation into an environment representation that has constant space complexity with respect to the camera image space. This type of representation allows the planning algorithms of a mobile agent to bypass what are often complex and noisy transformations between camera image space and Euclidean space. The approach is to compute per-pixel potential values directly from processed camera data, which results in a discrete potential field that has constant space complexity with respect to the image plane. This can enable planning and control algorithms, whose complexity often depends on the size of the environment representation, to be defined with constant run-time. This type of approach can be particularly useful for platforms with strict resource constraints, such as embedded and real-time systems. △ Less

Submitted 12 September, 2017; originally announced September 2017.

Comments: IROS 2017: 9th Workshop on Planning, Perception and Navigation for Intelligent Vehicles

arXiv:1701.07484 [pdf, other]

Monitoring and Intervention: Concepts and Formal Models

Authors: Kenneth Johnson, John V. Tucker, Victoria Wang

Abstract: Our machines, products, utilities, and environments have long been monitored by embedded software systems. Our professional, commercial, social and personal lives are also subject to monitoring as they are mediated by software systems. Data on nearly everything now exists, waiting to be collected and analysed for all sorts of reasons. Given the rising tide of data we pose the questions: What is mo… ▽ More Our machines, products, utilities, and environments have long been monitored by embedded software systems. Our professional, commercial, social and personal lives are also subject to monitoring as they are mediated by software systems. Data on nearly everything now exists, waiting to be collected and analysed for all sorts of reasons. Given the rising tide of data we pose the questions: What is monitoring? Do diverse and disparate monitoring systems have anything in common? We attempt answer these questions by proposing an abstract conceptual framework for studying monitoring. We argue that it captures a structure common to many different monitoring practices, and that from it detailed formal models can be derived, customised to applications. The framework formalises the idea that monitoring is a process that observes the behaviour of people and objects in a context. The entities and their behaviours are represented by abstract data types and the observable attributes by logics. Since monitoring usually has a specific purpose, we extend the framework with protocols for detecting attributes or events that require interventions and, possibly, a change in behaviour. Our theory is illustrated by a case study from criminal justice, that of electronic tagging. △ Less

Submitted 25 January, 2017; originally announced January 2017.

Comments: 29 pages, 1 figure

arXiv:1212.0575 [pdf]

doi 10.1118/1.3700166

Sparse and Optimal Acquisition Design for Diffusion MRI and Beyond

Authors: Cheng Guan Koay, Evren Özarslan, Kevin M Johnson, M. Elizabeth Meyerand

Abstract: The focus of this paper is on the development of a sparse and optimal acquisition (SOA) design for diffusion MRI multiple-shell acquisition and beyond. A novel optimality criterion is proposed for sparse multiple-shell acquisition and quasi multiple-shell designs in diffusion MRI and a novel and effective semi-stochastic and moderately greedy combinatorial search strategy with simulated annealing… ▽ More The focus of this paper is on the development of a sparse and optimal acquisition (SOA) design for diffusion MRI multiple-shell acquisition and beyond. A novel optimality criterion is proposed for sparse multiple-shell acquisition and quasi multiple-shell designs in diffusion MRI and a novel and effective semi-stochastic and moderately greedy combinatorial search strategy with simulated annealing to locate the optimum design or configuration. Even though the number of distinct configurations for a given set of diffusion gradient directions is very large in general---e.g., in the order of 10^{232} for a set of 144 diffusion gradient directions, the proposed search strategy was found to be effective in finding the optimum configuration. It was found that the square design is the most robust (i.e., with stable condition numbers and A-optimal measures under varying experimental conditions) among many other possible designs of the same sample size. Under the same performance evaluation, the square design was found to be more robust than the widely used sampling schemes similar to that of 3D radial MRI and of diffusion spectrum imaging (DSI). △ Less

Submitted 5 December, 2012; v1 submitted 3 December, 2012; originally announced December 2012.

Comments: 41 pages, 2 tables and 9 figures

Journal ref: Med. Phys. 39, 2499 (2012)

arXiv:1105.4665 [pdf, ps, other]

Improved Linear Programming Decoding using Frustrated Cycles

Authors: Shrinivas Kudekar, Jason K. Johnson, Misha Chertkov

Abstract: We consider transmission over a binary-input additive white Gaussian noise channel using low-density parity-check codes. One of the most popular techniques for decoding low-density parity-check codes is the linear programming decoder. In general, the linear programming decoder is suboptimal. I.e., the word error rate is higher than the optimal, maximum a posteriori decoder. In this paper we pres… ▽ More We consider transmission over a binary-input additive white Gaussian noise channel using low-density parity-check codes. One of the most popular techniques for decoding low-density parity-check codes is the linear programming decoder. In general, the linear programming decoder is suboptimal. I.e., the word error rate is higher than the optimal, maximum a posteriori decoder. In this paper we present a systematic approach to enhance the linear program decoder. More precisely, in the cases where the linear program outputs a fractional solution, we give a simple algorithm to identify frustrated cycles which cause the output of the linear program to be fractional. Then adding these cycles, adaptively to the basic linear program, we show improved word error rate performance. △ Less

Submitted 23 May, 2011; originally announced May 2011.

Comments: 5 Pages, Submitted to Information Theory Workshop (ITW) 2011

Report number: LA-UR 11-02962

arXiv:1102.5386 [pdf, ps, other]

Linear Programming based Detectors for Two-Dimensional Intersymbol Interference Channels

Authors: Shrinivas Kudekar, Jason K. Johnson, Michael Chertkov

Abstract: We present and study linear programming based detectors for two-dimensional intersymbol interference channels. Interesting instances of two-dimensional intersymbol interference channels are magnetic storage, optical storage and Wyner's cellular network model. We show that the optimal maximum a posteriori detection in such channels lends itself to a natural linear programming based sub-optimal de… ▽ More We present and study linear programming based detectors for two-dimensional intersymbol interference channels. Interesting instances of two-dimensional intersymbol interference channels are magnetic storage, optical storage and Wyner's cellular network model. We show that the optimal maximum a posteriori detection in such channels lends itself to a natural linear programming based sub-optimal detector. We call this the Pairwise linear program detector. Our experiments show that the Pairwise linear program detector performs poorly. We then propose two methods to strengthen our detector. These detectors are based on systematically enhancing the Pairwise linear program. The first one, the Block linear program detector adds higher order potential functions in an {\em exhaustive} manner, as constraints, to the Pairwise linear program detector. We show by experiments that the Block linear program detector has performance close to the optimal detector. We then develop another detector by {\em adaptively} adding frustrated cycles to the Pairwise linear program detector. Empirically, this detector also has performance close to the optimal one and turns out to be less complex then the Block linear program detector. △ Less

Submitted 25 February, 2011; originally announced February 2011.

Comments: 5 Pages, Submitted to ISIT 2011

Report number: LA-UR 11-01283

arXiv:1011.3494 [pdf, other]

Learning Planar Ising Models

Authors: Jason K. Johnson, Praneeth Netrapalli, Michael Chertkov

Abstract: Inference and learning of graphical models are both well-studied problems in statistics and machine learning that have found many applications in science and engineering. However, exact inference is intractable in general graphical models, which suggests the problem of seeking the best approximation to a collection of random variables within some tractable family of graphical models. In this paper… ▽ More Inference and learning of graphical models are both well-studied problems in statistics and machine learning that have found many applications in science and engineering. However, exact inference is intractable in general graphical models, which suggests the problem of seeking the best approximation to a collection of random variables within some tractable family of graphical models. In this paper, we focus our attention on the class of planar Ising models, for which inference is tractable using techniques of statistical physics [Kac and Ward; Kasteleyn]. Based on these techniques and recent methods for planarity testing and planar embedding [Chrobak and Payne], we propose a simple greedy algorithm for learning the best planar Ising model to approximate an arbitrary collection of binary random variables (possibly from sample data). Given the set of all pairwise correlations among variables, we select a planar graph and optimal planar Ising model defined on this graph to best approximate that set of correlations. We demonstrate our method in some simulations and for the application of modeling senate voting records. △ Less

Submitted 15 November, 2010; originally announced November 2010.

Comments: 11 pages, 4 figures, Submitted to 14th International Conference on Artificial Intelligence and Statistics (AISTATS 2011)

Report number: LANL LA-UR 10-07656

arXiv:1007.2442 [pdf]

Neural Network Based Reconstruction of a 3D Object from a 2D Wireframe

Authors: Kyle Johnson, Clayton Chang, Hod Lipson

Abstract: We propose a new approach for constructing a 3D representation from a 2D wireframe drawing. A drawing is simply a parallel projection of a 3D object onto a 2D surface; humans are able to recreate mental 3D models from 2D representations very easily, yet the process is very difficult to emulate computationally. We hypothesize that our ability to perform this construction relies on the angles in the… ▽ More We propose a new approach for constructing a 3D representation from a 2D wireframe drawing. A drawing is simply a parallel projection of a 3D object onto a 2D surface; humans are able to recreate mental 3D models from 2D representations very easily, yet the process is very difficult to emulate computationally. We hypothesize that our ability to perform this construction relies on the angles in the 2D scene, among other geometric properties. Being able to reproduce this reconstruction process automatically would allow for efficient and robust 3D sketch interfaces. Our research focuses on the relationship between 2D geometry observable in the sketch and 3D geometry derived from a potential 3D construction. We present a fully automated system that constructs 3D representations from 2D wireframes using a neural network in conjunction with a genetic search algorithm. △ Less

Submitted 14 July, 2010; originally announced July 2010.

arXiv:1004.2285 [pdf, other]

doi 10.1109/CDC.2010.5717226

A Majorization-Minimization Approach to Design of Power Transmission Networks

Authors: Jason K. Johnson, Michael Chertkov

Abstract: We propose an optimization approach to design cost-effective electrical power transmission networks. That is, we aim to select both the network structure and the line conductances (line sizes) so as to optimize the trade-off between network efficiency (low power dissipation within the transmission network) and the cost to build the network. We begin with a convex optimization method based on the p… ▽ More We propose an optimization approach to design cost-effective electrical power transmission networks. That is, we aim to select both the network structure and the line conductances (line sizes) so as to optimize the trade-off between network efficiency (low power dissipation within the transmission network) and the cost to build the network. We begin with a convex optimization method based on the paper ``Minimizing Effective Resistance of a Graph'' [Ghosh, Boyd \& Saberi]. We show that this (DC) resistive network method can be adapted to the context of AC power flow. However, that does not address the combinatorial aspect of selecting network structure. We approach this problem as selecting a subgraph within an over-complete network, posed as minimizing the (convex) network power dissipation plus a non-convex cost on line conductances that encourages sparse networks where many line conductances are set to zero. We develop a heuristic approach to solve this non-convex optimization problem using: (1) a continuation method to interpolate from the smooth, convex problem to the (non-smooth, non-convex) combinatorial problem, (2) the majorization-minimization algorithm to perform the necessary intermediate smooth but non-convex optimization steps. Ultimately, this involves solving a sequence of convex optimization problems in which we iteratively reweight a linear cost on line conductances to fit the actual non-convex cost. Several examples are presented which suggest that the overall method is a good heuristic for network design. We also consider how to obtain sparse networks that are still robust against failures of lines and/or generators. △ Less

Submitted 13 September, 2010; v1 submitted 13 April, 2010; originally announced April 2010.

Comments: 8 pages, 3 figures. To appear in Proc. 49th IEEE Conference on Decision and Control (CDC '10)

Report number: LANL LA-UR 10-02039

arXiv:0901.4192 [pdf, ps, other]

doi 10.1109/ISIT.2009.5205777

Fixing Convergence of Gaussian Belief Propagation

Authors: Jason K. Johnson, Danny Bickson, Danny Dolev

Abstract: Gaussian belief propagation (GaBP) is an iterative message-passing algorithm for inference in Gaussian graphical models. It is known that when GaBP converges it converges to the correct MAP estimate of the Gaussian random vector and simple sufficient conditions for its convergence have been established. In this paper we develop a double-loop algorithm for forcing convergence of GaBP. Our method… ▽ More Gaussian belief propagation (GaBP) is an iterative message-passing algorithm for inference in Gaussian graphical models. It is known that when GaBP converges it converges to the correct MAP estimate of the Gaussian random vector and simple sufficient conditions for its convergence have been established. In this paper we develop a double-loop algorithm for forcing convergence of GaBP. Our method computes the correct MAP estimate even in cases where standard GaBP would not have converged. We further extend this construction to compute least-squares solutions of over-constrained linear systems. We believe that our construction has numerous applications, since the GaBP algorithm is linked to solution of linear systems of equations, which is a fundamental problem in computer science and engineering. As a case study, we discuss the linear detection problem. We show that using our new construction, we are able to force convergence of Montanari's linear detection algorithm, in cases where it would originally fail. As a consequence, we are able to increase significantly the number of users that can transmit concurrently. △ Less

Submitted 3 July, 2009; v1 submitted 27 January, 2009; originally announced January 2009.

Comments: In the IEEE International Symposium on Information Theory (ISIT) 2009, Seoul, South Korea, July 2009

arXiv:0710.0013 [pdf, ps, other]

Lagrangian Relaxation for MAP Estimation in Graphical Models

Authors: Jason K. Johnson, Dmitry M. Malioutov, Alan S. Willsky

Abstract: We develop a general framework for MAP estimation in discrete and Gaussian graphical models using Lagrangian relaxation techniques. The key idea is to reformulate an intractable estimation problem as one defined on a more tractable graph, but subject to additional constraints. Relaxing these constraints gives a tractable dual problem, one defined by a thin graph, which is then optimized by an it… ▽ More We develop a general framework for MAP estimation in discrete and Gaussian graphical models using Lagrangian relaxation techniques. The key idea is to reformulate an intractable estimation problem as one defined on a more tractable graph, but subject to additional constraints. Relaxing these constraints gives a tractable dual problem, one defined by a thin graph, which is then optimized by an iterative procedure. When this iterative optimization leads to a consistent estimate, one which also satisfies the constraints, then it corresponds to an optimal MAP estimate of the original model. Otherwise there is a ``duality gap'', and we obtain a bound on the optimal solution. Thus, our approach combines convex optimization with dynamic programming techniques applicable for thin graphs. The popular tree-reweighted max-product (TRMP) method may be seen as solving a particular class of such relaxations, where the intractable graph is relaxed to a set of spanning trees. We also consider relaxations to a set of small induced subgraphs, thin subgraphs (e.g. loops), and a connected tree obtained by ``unwinding'' cycles. In addition, we propose a new class of multiscale relaxations that introduce ``summary'' variables. The potential benefits of such generalizations include: reducing or eliminating the ``duality gap'' in hard problems, reducing the number or Lagrange multipliers in the dual problem, and accelerating convergence of the iterative optimization procedure. △ Less

Submitted 28 September, 2007; originally announced October 2007.

Comments: 10 pages, presented at 45th Allerton conference on communication, control and computing, to appear in proceedings

arXiv:cs/9909001 [pdf, ps, other]

Emerging Challenges in Computational Topology

Authors: Marshall Bern, David Eppstein, Pankaj K. Agarwal, Nina Amenta, Paul Chew, Tamal Dey, David P. Dobkin, Herbert Edelsbrunner, Cindy Grimm, Leonidas J. Guibas, John Harer, Joel Hass, Andrew Hicks, Carroll K. Johnson, Gilad Lerman, David Letscher, Paul Plassmann, Eric Sedgwick, Jack Snoeyink, Jeff Weeks, Chee Yap, Denis Zorin

Abstract: Here we present the results of the NSF-funded Workshop on Computational Topology, which met on June 11 and 12 in Miami Beach, Florida. This report identifies important problems involving both computation and topology. Here we present the results of the NSF-funded Workshop on Computational Topology, which met on June 11 and 12 in Miami Beach, Florida. This report identifies important problems involving both computation and topology. △ Less

Submitted 1 September, 1999; originally announced September 1999.

Comments: 20 pages

ACM Class: F.2.2; I.2.9; I.2.10; I.3.5; J.2

Showing 1–42 of 42 results for author: Johnson, K