Skip to main content

Showing 1–40 of 40 results for author: Simon, B

  1. arXiv:2406.05285  [pdf, other

    cs.CV

    VISTA3D: Versatile Imaging SegmenTation and Annotation model for 3D Computed Tomography

    Authors: Yufan He, Pengfei Guo, Yucheng Tang, Andriy Myronenko, Vishwesh Nath, Ziyue Xu, Dong Yang, Can Zhao, Benjamin Simon, Mason Belue, Stephanie Harmon, Baris Turkbey, Daguang Xu, Wenqi Li

    Abstract: Segmentation foundation models have attracted great interest, however, none of them are adequate enough for the use cases in 3D computed tomography scans (CT) images. Existing works finetune on medical images with 2D foundation models trained on natural images, but interactive segmentation, especially in 2D, is too time-consuming for 3D scans and less useful for large cohort analysis. Models that… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  2. arXiv:2404.17544  [pdf, ps, other

    cs.DS

    Root-to-Leaf Scheduling in Write-Optimized Trees

    Authors: Christopher Chung, William Jannen, Samuel McCauley, Bertrand Simon

    Abstract: Write-optimized dictionaries are a class of cache-efficient data structures that buffer updates and apply them in batches to optimize the amortized cache misses per update. For example, a B^epsilon tree inserts updates as messages at the root. B^epsilon trees only move ("flush") messages when they have total size close to a cache line, optimizing the amount of work done per cache line written. Thu… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  3. arXiv:2404.12485  [pdf, other

    cs.DS cs.AI cs.LG

    Contract Scheduling with Distributional and Multiple Advice

    Authors: Spyros Angelopoulos, Marcin Bienkowski, Christoph Dürr, Bertrand Simon

    Abstract: Contract scheduling is a widely studied framework for designing real-time systems with interruptible capabilities. Previous work has showed that a prediction on the interruption time can help improve the performance of contract-based systems, however it has relied on a single prediction that is provided by a deterministic oracle. In this work, we introduce and study more general and realistic lear… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: To appear in Proceedings of IJCAI 2024

  4. arXiv:2402.05764  [pdf

    cs.CY

    Datastringer: easy dataset monitoring for journalists

    Authors: Matt Shearer, Basile Simon, Clément Geiger

    Abstract: We created a software enabling journalists to define a set of criteria they would like to see applied regularly to a constantly-updated dataset, sending them an alert when these criteria are met, thus signaling them that there may be a story to write. The main challenges were to keep the product scalable and powerful, while making sure that it could be used by journalists who would not possess all… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  5. arXiv:2311.14646  [pdf, other

    cs.LG stat.ML

    More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory

    Authors: James B. Simon, Dhruva Karkada, Nikhil Ghosh, Mikhail Belkin

    Abstract: In our era of enormous neural networks, empirical progress has been driven by the philosophy that more is better. Recent deep learning practice has found repeatedly that larger model size, more data, and more computation (resulting in lower training loss) improves performance. In this paper, we give theoretical backing to these empirical observations by showing that these three properties hold in… ▽ More

    Submitted 15 May, 2024; v1 submitted 24 November, 2023; originally announced November 2023.

    Comments: Appeared in ICLR 2024

  6. arXiv:2310.17813  [pdf, other

    cs.LG

    A Spectral Condition for Feature Learning

    Authors: Greg Yang, James B. Simon, Jeremy Bernstein

    Abstract: The push to train ever larger neural networks has motivated the study of initialization and training at large network width. A key challenge is to scale training so that a network's internal representations evolve nontrivially at all widths, a process known as feature learning. Here, we show that feature learning is achieved by scaling the spectral norm of weight matrices and their updates like… ▽ More

    Submitted 13 May, 2024; v1 submitted 26 October, 2023; originally announced October 2023.

  7. arXiv:2309.10594  [pdf, other

    cs.SI cs.AI cs.LG

    Decentralized Online Learning in Task Assignment Games for Mobile Crowdsensing

    Authors: Bernd Simon, Andrea Ortiz, Walid Saad, Anja Klein

    Abstract: The problem of coordinated data collection is studied for a mobile crowdsensing (MCS) system. A mobile crowdsensing platform (MCSP) sequentially publishes sensing tasks to the available mobile units (MUs) that signal their willingness to participate in a task by sending sensing offers back to the MCSP. From the received offers, the MCSP decides the task assignment. A stable task assignment must ad… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  8. arXiv:2309.01592  [pdf, other

    stat.ML cs.AI cs.LG hep-th math.PR

    Les Houches Lectures on Deep Learning at Large & Infinite Width

    Authors: Yasaman Bahri, Boris Hanin, Antonin Brossollet, Vittorio Erba, Christian Keup, Rosalba Pacelli, James B. Simon

    Abstract: These lectures, presented at the 2022 Les Houches Summer School on Statistical Physics and Machine Learning, focus on the infinite-width limit and large-width regime of deep neural networks. Topics covered include various statistical and dynamical properties of these networks. In particular, the lecturers discuss properties of random deep neural networks; connections between trained deep neural ne… ▽ More

    Submitted 12 February, 2024; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: These are notes from lectures delivered by Yasaman Bahri and Boris Hanin at the 2022 Les Houches Summer School on Statistics Physics and Machine Learning and a first version of them were transcribed by Antonin Brossollet, Vittorio Erba, Christian Keup, Rosalba Pacelli, James B. Simon

  9. arXiv:2306.13185  [pdf, ps, other

    stat.ML cs.LG

    An Agnostic View on the Cost of Overfitting in (Kernel) Ridge Regression

    Authors: Lijia Zhou, James B. Simon, Gal Vardi, Nathan Srebro

    Abstract: We study the cost of overfitting in noisy kernel ridge regression (KRR), which we define as the ratio between the test error of the interpolating ridgeless model and the test error of the optimally-tuned model. We take an "agnostic" view in the following sense: we consider the cost as a function of sample size for any target function, even if the sample size is not large enough for consistency or… ▽ More

    Submitted 22 March, 2024; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: This is the ICLR CR version

  10. arXiv:2306.08055  [pdf, other

    cs.LG cs.AI

    Tune As You Scale: Hyperparameter Optimization For Compute Efficient Training

    Authors: Abraham J. Fetterman, Ellie Kitanidis, Joshua Albrecht, Zachary Polizzi, Bryden Fogelman, Maksis Knutins, Bartosz Wróblewski, James B. Simon, Kanjun Qiu

    Abstract: Hyperparameter tuning of deep learning models can lead to order-of-magnitude performance gains for the same amount of compute. Despite this, systematic tuning is uncommon, particularly for large models, which are expensive to evaluate and tend to have many hyperparameters, necessitating difficult judgment calls about tradeoffs, budgets, and search bounds. To address these issues and propose a prac… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  11. arXiv:2305.12682  [pdf, other

    cs.NI quant-ph

    Matching Game for Optimized Association in Quantum Communication Networks

    Authors: Mahdi Chehimi, Bernd Simon, Walid Saad, Anja Klein, Don Towsley, Mérouane Debbah

    Abstract: Enabling quantum switches (QSs) to serve requests submitted by quantum end nodes in quantum communication networks (QCNs) is a challenging problem due to the heterogeneous fidelity requirements of the submitted requests and the limited resources of the QCN. Effectively determining which requests are served by a given QS is fundamental to foster developments in practical QCN applications, like quan… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: 6 pages, 4 figures

  12. arXiv:2304.01781  [pdf, ps, other

    cs.LG cs.DS

    Mixing predictions for online metric algorithms

    Authors: Antonios Antoniadis, Christian Coester, Marek Eliáš, Adam Polak, Bertrand Simon

    Abstract: A major technique in learning-augmented online algorithms is combining multiple algorithms or predictors. Since the performance of each predictor may vary over time, it is desirable to use not the single best predictor as a benchmark, but rather a dynamic combination which follows different predictors at different times. We design algorithms that combine predictions and are competitive against suc… ▽ More

    Submitted 15 December, 2023; v1 submitted 4 April, 2023; originally announced April 2023.

  13. arXiv:2303.15438  [pdf, other

    cs.LG

    On the Stepwise Nature of Self-Supervised Learning

    Authors: James B. Simon, Maksis Knutins, Liu Ziyin, Daniel Geisz, Abraham J. Fetterman, Joshua Albrecht

    Abstract: We present a simple picture of the training process of joint embedding self-supervised learning methods. We find that these methods learn their high-dimensional embeddings one dimension at a time in a sequence of discrete, well-separated steps. We arrive at this conclusion via the study of a linearized model of Barlow Twins applicable to the case in which the trained network is infinitely wide. We… ▽ More

    Submitted 30 May, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: 9 pages (main text) + 14 pages (refs + appendices). ICML '23

  14. arXiv:2210.13417  [pdf, other

    cs.AI cs.LG

    Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds

    Authors: Joshua Albrecht, Abraham J. Fetterman, Bryden Fogelman, Ellie Kitanidis, Bartosz Wróblewski, Nicole Seo, Michael Rosenthal, Maksis Knutins, Zachary Polizzi, James B. Simon, Kanjun Qiu

    Abstract: Despite impressive successes, deep reinforcement learning (RL) systems still fall short of human performance on generalization to new tasks and environments that differ from their training. As a benchmark tailored for studying RL generalization, we introduce Avalon, a set of tasks in which embodied agents in highly diverse procedural 3D worlds must survive by navigating terrain, hunting or gatheri… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted to NeurIPS Datasets and Benchmarks 2022. Video and links to all code, data, etc can be found at https://generallyintelligent.com/avalon/

  15. arXiv:2210.02775  [pdf, ps, other

    cs.LG cs.DS cs.OS

    Paging with Succinct Predictions

    Authors: Antonios Antoniadis, Joan Boyar, Marek Eliáš, Lene M. Favrholdt, Ruben Hoeksma, Kim S. Larsen, Adam Polak, Bertrand Simon

    Abstract: Paging is a prototypical problem in the area of online algorithms. It has also played a central role in the development of learning-augmented algorithms -- a recent line of research that aims to ameliorate the shortcomings of classical worst-case analysis by giving algorithms access to predictions. Such predictions can typically be generated using a machine learning approach, but they are inherent… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

  16. arXiv:2209.01691  [pdf, other

    cs.LG stat.ML

    On Kernel Regression with Data-Dependent Kernels

    Authors: James B. Simon

    Abstract: The primary hyperparameter in kernel regression (KR) is the choice of kernel. In most theoretical studies of KR, one assumes the kernel is fixed before seeing the training data. Under this assumption, it is known that the optimal kernel is equal to the prior covariance of the target function. In this note, we consider KR in which the kernel may be updated after seeing the training data. We point o… ▽ More

    Submitted 26 September, 2022; v1 submitted 4 September, 2022; originally announced September 2022.

    Comments: 7 pages, 1 figure

  17. arXiv:2207.06569  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Benign, Tempered, or Catastrophic: A Taxonomy of Overfitting

    Authors: Neil Mallinar, James B. Simon, Amirhesam Abedsoltan, Parthe Pandit, Mikhail Belkin, Preetum Nakkiran

    Abstract: The practical success of overparameterized neural networks has motivated the recent scientific study of interpolating methods, which perfectly fit their training data. Certain interpolating methods, including neural networks, can fit noisy training data without catastrophically bad test performance, in defiance of standard intuitions from statistical learning theory. Aiming to explain this, a body… ▽ More

    Submitted 15 July, 2024; v1 submitted 13 July, 2022; originally announced July 2022.

    Comments: NM and JS co-first authors

  18. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  19. arXiv:2203.17019  [pdf, other

    eess.AS cs.LG cs.SD

    DeepFry: Identifying Vocal Fry Using Deep Neural Networks

    Authors: Bronya R. Chernyak, Talia Ben Simon, Yael Segal, Jeremy Steffman, Eleanor Chodroff, Jennifer S. Cole, Joseph Keshet

    Abstract: Vocal fry or creaky voice refers to a voice quality characterized by irregular glottal opening and low pitch. It occurs in diverse languages and is prevalent in American English, where it is used not only to mark phrase finality, but also sociolinguistic factors and affect. Due to its irregular periodicity, creaky voice challenges automatic speech processing and recognition systems, particularly f… ▽ More

    Submitted 26 June, 2022; v1 submitted 31 March, 2022; originally announced March 2022.

    Comments: Accepted to Interspeech 2022

  20. arXiv:2202.11730  [pdf, other

    astro-ph.EP cs.LG

    Using Bayesian Deep Learning to infer Planet Mass from Gaps in Protoplanetary Disks

    Authors: Sayantan Auddy, Ramit Dey, Min-Kai Lin, Daniel Carrera, Jacob B. Simon

    Abstract: Planet induced sub-structures, like annular gaps, observed in dust emission from protoplanetary disks provide a unique probe to characterize unseen young planets. While deep learning based model has an edge in characterizing the planet's properties over traditional methods, like customized simulations and empirical relations, it lacks in its ability to quantify the uncertainty associated with its… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

    Comments: 14 pages, 6 figures, submitted to ApJ

  21. arXiv:2112.09384  [pdf, ps, other

    cs.DC cs.DS

    An Exact Algorithm for the Linear Tape Scheduling Problem

    Authors: Valentin Honoré, Bertrand Simon, Frédéric Suter

    Abstract: Magnetic tapes are often considered as an outdated storage technology, yet they are still used to store huge amounts of data. Their main interests are a large capacity and a low price per gigabyte, which come at the cost of a much larger file access time than on disks. With tapes, finding the right ordering of multiple file accesses is thus key to performance. Moving the reading head back and fort… ▽ More

    Submitted 4 May, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

  22. arXiv:2110.13116  [pdf, other

    cs.DS cs.LG

    Learning-Augmented Dynamic Power Management with Multiple States via New Ski Rental Bounds

    Authors: Antonios Antoniadis, Christian Coester, Marek Eliáš, Adam Polak, Bertrand Simon

    Abstract: We study the online problem of minimizing power consumption in systems with multiple power-saving states. During idle periods of unknown lengths, an algorithm has to choose between power-saving states of different energy consumption and wake-up costs. We develop a learning-augmented online algorithm that makes decisions based on (potentially inaccurate) predicted lengths of the idle periods. The a… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

  23. arXiv:2110.03922  [pdf, other

    cs.LG stat.ML

    The Eigenlearning Framework: A Conservation Law Perspective on Kernel Regression and Wide Neural Networks

    Authors: James B. Simon, Madeline Dickens, Dhruva Karkada, Michael R. DeWeese

    Abstract: We derive simple closed-form estimates for the test risk and other generalization metrics of kernel ridge regression (KRR). Relative to prior work, our derivations are greatly simplified and our final expressions are more readily interpreted. These improvements are enabled by our identification of a sharp conservation law which limits the ability of KRR to learn any orthonormal basis of functions.… ▽ More

    Submitted 26 October, 2023; v1 submitted 8 October, 2021; originally announced October 2021.

    Comments: 12 pages (main text) + 25 pages (refs + appendices). A previous version of this manuscript was entitled "Neural Tangent Kernel Eigenvalues Accurately Predict Generalization."

  24. arXiv:2107.11774  [pdf, other

    cs.LG math.OC stat.ML

    SGD with a Constant Large Learning Rate Can Converge to Local Maxima

    Authors: Liu Ziyin, Botao Li, James B. Simon, Masahito Ueda

    Abstract: Previous works on stochastic gradient descent (SGD) often focus on its success. In this work, we construct worst-case optimization problems illustrating that, when not in the regimes that the previous works often assume, SGD can exhibit many strange and potentially undesirable behaviors. Specifically, we construct landscapes and data distributions such that (1) SGD converges to local maxima, (2) S… ▽ More

    Submitted 27 May, 2023; v1 submitted 25 July, 2021; originally announced July 2021.

    Comments: Fixed typos

  25. arXiv:2106.03186  [pdf, other

    cs.LG

    Reverse Engineering the Neural Tangent Kernel

    Authors: James B. Simon, Sajant Anand, Michael R. DeWeese

    Abstract: The development of methods to guide the design of neural networks is an important open challenge for deep learning theory. As a paradigm for principled neural architecture design, we propose the translation of high-performing kernels, which are better-understood and amenable to first-principles design, into equivalent network architectures, which have superior efficiency, flexibility, and feature… ▽ More

    Submitted 13 August, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: 15 pages, 5 figures

  26. arXiv:2103.01640  [pdf, other

    cs.LG cs.DS

    Double Coverage with Machine-Learned Advice

    Authors: Alexander Lindermayr, Nicole Megow, Bertrand Simon

    Abstract: We study the fundamental online k-server problem in a learning-augmented setting. While in the traditional online model, an algorithm has no information about the request sequence, we assume that there is given some advice (e.g. machine-learned predictions) on an algorithm's decision. There is, however, no guarantee on the quality of the prediction and it might be far from being correct. Our mai… ▽ More

    Submitted 16 November, 2021; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: Accepted at ITCS 2022

  27. arXiv:2011.05181  [pdf, other

    cs.DS

    Speed-Robust Scheduling -- Sand, Bricks, and Rocks

    Authors: Franziska Eberle, Ruben Hoeksma, Nicole Megow, Lukas Nölke, Kevin Schewior, Bertrand Simon

    Abstract: The speed-robust scheduling problem is a two-stage problem where given $m$ machines, jobs must be grouped into at most $m$ bags while the processing speeds of the given $m$ machines are unknown. After the speeds are revealed, the grouped jobs must be assigned to the machines without being separated. To evaluate the performance of algorithms, we determine upper bounds on the worst-case ratio of the… ▽ More

    Submitted 31 May, 2022; v1 submitted 10 November, 2020; originally announced November 2020.

  28. arXiv:2007.08415  [pdf, other

    cs.DS

    Fully Dynamic Algorithms for Knapsack Problems with Polylogarithmic Update Time

    Authors: Franziska Eberle, Nicole Megow, Lukas Nölke, Bertrand Simon, Andreas Wiese

    Abstract: Knapsack problems are among the most fundamental problems in optimization. In the Multiple Knapsack problem, we are given multiple knapsacks with different capacities and items with values and sizes. The task is to find a subset of items of maximum total value that can be packed into the knapsacks without exceeding the capacities. We investigate this problem and special cases thereof in the contex… ▽ More

    Submitted 4 October, 2021; v1 submitted 16 July, 2020; originally announced July 2020.

    Comments: Accepted for publication at FSTTCS 2021

  29. On Hop-Constrained Steiner Trees in Tree-Like Metrics

    Authors: Martin Böhm, Ruben Hoeksma, Nicole Megow, Lukas Nölke, Bertrand Simon

    Abstract: We consider the problem of computing a Steiner tree of minimum cost under a hop constraint which requires the depth of the tree to be at most $k$. Our main result is an exact algorithm for metrics induced by graphs with bounded treewidth that runs in time $n^{O(k)}$. For the special case of a path, we give a simple algorithm that solves the problem in polynomial time, even if $k$ is part of the in… ▽ More

    Submitted 11 October, 2022; v1 submitted 12 March, 2020; originally announced March 2020.

    Journal ref: SIAM Journal on Discrete Mathematics, Vol. 36, Iss. 2 (2022)

  30. arXiv:2003.02144  [pdf, ps, other

    cs.DS

    Online metric algorithms with untrusted predictions

    Authors: Antonios Antoniadis, Christian Coester, Marek Elias, Adam Polak, Bertrand Simon

    Abstract: Machine-learned predictors, although achieving very good results for inputs resembling training data, cannot possibly provide perfect predictions in all situations. Still, decision-making systems that are based on such predictors need not only to benefit from good predictions but also to achieve a decent performance when the predictions are inadequate. In this paper, we propose a prediction setup… ▽ More

    Submitted 6 April, 2023; v1 submitted 4 March, 2020; originally announced March 2020.

  31. Discovering and Certifying Lower Bounds for the Online Bin Stretching Problem

    Authors: Martin Böhm, Bertrand Simon

    Abstract: There are several problems in the theory of online computation where tight lower bounds on the competitive ratio are unknown and expected to be difficult to describe in a short form. A good example is the Online Bin Stretching problem, in which the task is to pack the incoming items online into bins while minimizing the load of the largest bin. Additionally, the optimal load of the entire instance… ▽ More

    Submitted 14 October, 2022; v1 submitted 4 January, 2020; originally announced January 2020.

  32. arXiv:1912.09170  [pdf, ps, other

    cs.DS cs.DC

    Energy Minimization in DAG Scheduling on MPSoCs at Run-Time: Theory and Practice

    Authors: Bertrand Simon, Joachim Falk, Nicole Megow, Jürgen Teich

    Abstract: Static (offline) techniques for mapping applications given by task graphs to MPSoC systems often deliver overly pessimistic and thus suboptimal results w.r.t. exploiting time slack in order to minimize the energy consumption. This holds true in particular in case computation times of tasks may be workload-dependent and becoming known only at runtime or in case of conditionally executed tasks or sc… ▽ More

    Submitted 19 December, 2019; originally announced December 2019.

  33. arXiv:1912.03088  [pdf, ps, other

    cs.DS

    Scheduling on Hybrid Platforms: Improved Approximability Window

    Authors: Vincent Fagnon, Imed Kacem, Giorgio Lucarelli, Bertrand Simon

    Abstract: Modern platforms are using accelerators in conjunction with standard processing units in order to reduce the running time of specific operations, such as matrix operations, and improve their performance. Scheduling on such hybrid platforms is a challenging problem since the algorithms used for the case of homogeneous resources do not adapt well. In this paper we consider the problem of scheduling… ▽ More

    Submitted 9 February, 2020; v1 submitted 6 December, 2019; originally announced December 2019.

  34. Scheduling on Two Types of Resources: a Survey

    Authors: Olivier Beaumont, Louis-claude Canon, Lionel Eyraud-Dubois, Giorgio Lucarelli, Loris Marchal, Clément Mommessin, Bertrand Simon, Denis Trystram

    Abstract: The evolution in the design of modern parallel platforms leads to revisit the scheduling jobs on distributed heterogeneous resources. The goal of this survey is to present the main existing algorithms, to classify them based on their underlying principles and to propose unified implementations to enable their fair comparison, both in terms of running time and quality of schedules, on a large set o… ▽ More

    Submitted 30 July, 2020; v1 submitted 25 September, 2019; originally announced September 2019.

    Journal ref: ACM Computing Survey, Vol. 53, No. 3, 2020

  35. Data Driven Vulnerability Exploration for Design Phase System Analysis

    Authors: Georgios Bakirtzis, Brandon J. Simon, Aidan G. Collins, Cody H. Fleming, Carl R. Elks

    Abstract: Applying security as a lifecycle practice is becoming increasingly important to combat targeted attacks in safety-critical systems. Among others there are two significant challenges in this area: (1) the need for models that can characterize a realistic system in the absence of an implementation and (2) an automated way to associate attack vector information; that is, historical data, to such syst… ▽ More

    Submitted 6 September, 2019; originally announced September 2019.

  36. Looking for a Black Cat in a Dark Room: Security Visualization for Cyber-Physical System Design and Analysis

    Authors: Georgios Bakirtzis, Brandon J. Simon, Cody H. Fleming, Carl R. Elks

    Abstract: Today, there is a plethora of software security tools employing visualizations that enable the creation of useful and effective interactive security analyst dashboards. Such dashboards can assist the analyst to understand the data at hand and, consequently, to conceive more targeted preemption and mitigation security strategies. Despite the recent advances, model-based security analysis is lacking… ▽ More

    Submitted 23 October, 2018; v1 submitted 24 August, 2018; originally announced August 2018.

  37. arXiv:1711.10693  [pdf

    cs.CV

    Small Drone Field Experiment: Data Collection & Processing

    Authors: Dalton Rosario, Christoph Borel, Damon Conover, Ryan McAlinden, Anthony Ortiz, Sarah Shiver, Blair Simon

    Abstract: Following an initiative formalized in April 2016 formally known as ARL West between the U.S. Army Research Laboratory (ARL) and University of Southern California's Institute for Creative Technologies (USC ICT), a field experiment was coordinated and executed in the summer of 2016 by ARL, USC ICT, and Headwall Photonics. The purpose was to image part of the USC main campus in Los Angeles, USA, usin… ▽ More

    Submitted 29 November, 2017; originally announced November 2017.

  38. An Efficient and Robust Social Network De-anonymization Attack

    Authors: Gábor György Gulyás, Benedek Simon, Sándor Imre

    Abstract: Releasing connection data from social networking services can pose a significant threat to user privacy. In our work, we consider structural social network de-anonymization attacks, which are used when a malicious party uses connections in a public or other identified network to re-identify users in an anonymized social network release that he obtained previously. In this paper we design and eva… ▽ More

    Submitted 13 October, 2016; originally announced October 2016.

  39. arXiv:1604.03446  [pdf

    cs.CY

    The Importance of Computing Education Research

    Authors: Steve Cooper, Jeff Forbes, Armando Fox, Susanne Hambrusch, Andrew Ko, Beth Simon

    Abstract: Interest in computer science is growing. As a result, computer science (CS) and related departments are experiencing an explosive increase in undergraduate enrollments and unprecedented demand from other disciplines for learning computing. According to the 2014 CRA Taulbee Survey, the number of undergraduates declaring a computing major at Ph.D. granting departments in the US has increased 60% fro… ▽ More

    Submitted 12 April, 2016; originally announced April 2016.

    Comments: A Computing Community Consortium (CCC) white paper, 12 pages

  40. arXiv:1410.7249  [pdf, other

    cs.DC

    Scheduling Trees of Malleable Tasks for Sparse Linear Algebra

    Authors: Abdou Guermouche, Loris Marchal, Bertrand Simon, Frédéric Vivien

    Abstract: Scientific workloads are often described as directed acyclic task graphs. In this paper, we focus on the multifrontal factorization of sparse matrices, whose task graph is structured as a tree of parallel tasks. Among the existing models for parallel tasks, the concept of malleable tasks is especially powerful as it allows each task to be processed on a time-varying number of processors. Following… ▽ More

    Submitted 4 June, 2015; v1 submitted 27 October, 2014; originally announced October 2014.

    Comments: Paper accepted for publication at EuroPar 2015