Skip to main content

Showing 1–42 of 42 results for author: Akella, A

  1. arXiv:2407.06325  [pdf, other

    cs.LG cs.DC math.OC

    CONGO: Compressive Online Gradient Optimization with Application to Microservices Management

    Authors: Jeremy Carleton, Prathik Vijaykumar, Divyanshu Saxena, Dheeraj Narasimha, Srinivas Shakkottai, Aditya Akella

    Abstract: We address the challenge of online convex optimization where the objective function's gradient exhibits sparsity, indicating that only a small number of dimensions possess non-zero gradients. Our aim is to leverage this sparsity to obtain useful estimates of the objective function's gradient even when the only information available is a limited number of function samples. Our motivation stems from… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 28 pages, 7 figures

  2. arXiv:2405.19213  [pdf, other

    eess.SY cs.AI cs.LG cs.NI

    HawkVision: Low-Latency Modeless Edge AI Serving

    Authors: ChonLam Lao, Jiaqi Gao, Ganesh Ananthanarayanan, Aditya Akella, Minlan Yu

    Abstract: The trend of modeless ML inference is increasingly growing in popularity as it hides the complexity of model inference from users and caters to diverse user and application accuracy requirements. Previous work mostly focuses on modeless inference in data centers. To provide low-latency inference, in this paper, we promote modeless inference at the edge. The edge environment introduces additional c… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  3. arXiv:2405.05618  [pdf, other

    cs.LG cs.CL

    An Automatic Prompt Generation System for Tabular Data Tasks

    Authors: Ashlesha Akella, Abhijit Manatkar, Brij Chavda, Hima Patel

    Abstract: Efficient processing of tabular data is important in various industries, especially when working with datasets containing a large number of columns. Large language models (LLMs) have demonstrated their ability on several tasks through carefully crafted prompts. However, creating effective prompts for tabular datasets is challenging due to the structured nature of the data and the need to manage nu… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: Accepted to NAACL 2024 Industry Track

  4. arXiv:2404.19093  [pdf, other

    cs.IR cs.AI cs.HC

    Large Language Models as Conversational Movie Recommenders: A User Study

    Authors: Ruixuan Sun, Xinyi Li, Avinash Akella, Joseph A. Konstan

    Abstract: This paper explores the effectiveness of using large language models (LLMs) for personalized movie recommendations from users' perspectives in an online field experiment. Our study involves a combination of between-subject prompt and historic consumption assessments, along with within-subject recommendation scenario evaluations. By examining conversation and survey response data from 160 active us… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  5. arXiv:2404.18322  [pdf, other

    cs.DC

    BlockLLM: Multi-tenant Finer-grained Serving for Large Language Models

    Authors: Jiamin Li, Le Xu, Hong Xu, Aditya Akella

    Abstract: The growing demand for Large Language Models (LLMs) across diverse applications has prompted a paradigm shift in the design of deep learning serving systems. Deploying LLMs, especially in multi-tenant environments, presents considerable challenges due to their high computational and memory demands. We present BlockLLM, a serving system that exploits the potential of sharing components among fine-t… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  6. arXiv:2404.03865  [pdf, other

    cs.CL cs.LG

    FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping

    Authors: Ajay Jaiswal, Bodun Hu, Lu Yin, Yeonju Ro, Shiwei Liu, Tianlong Chen, Aditya Akella

    Abstract: Autoregressive Large Language Models (e.g., LLaMa, GPTs) are omnipresent achieving remarkable success in language understanding and generation. However, such impressive capability typically comes with a substantial model size, which presents significant challenges for autoregressive token-by-token generation. To mitigate computation overload incurred during generation, several early-exit and layer… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.01382

  7. arXiv:2402.07529  [pdf, other

    cs.DC cs.DS cs.LG cs.NI

    Accelerating Distributed Deep Learning using Lossless Homomorphic Compression

    Authors: Haoyu Li, Yuchen Xu, Jiayi Chen, Rohit Dwivedula, Wenfei Wu, Keqiang He, Aditya Akella, Daehyeok Kim

    Abstract: As deep neural networks (DNNs) grow in complexity and size, the resultant increase in communication overhead during distributed training has become a significant bottleneck, challenging the scalability of distributed training systems. Existing solutions, while aiming to mitigate this bottleneck through worker-level compression and in-network aggregation, fall short due to their inability to effici… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  8. arXiv:2401.11632  [pdf, other

    cs.IR cs.HC cs.LG

    What Are We Optimizing For? A Human-centric Evaluation of Deep Learning-based Movie Recommenders

    Authors: Ruixuan Sun, Xinyi Wu, Avinash Akella, Ruoyan Kong, Bart Knijnenburg, Joseph A. Konstan

    Abstract: In the past decade, deep learning (DL) models have gained prominence for their exceptional accuracy on benchmark datasets in recommender systems (RecSys). However, their evaluation has primarily relied on offline metrics, overlooking direct user perception and experience. To address this gap, we conduct a human-centric evaluation case study of four leading DL-RecSys models in the movie domain. We… ▽ More

    Submitted 1 May, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

  9. arXiv:2312.07813  [pdf, other

    cs.OS cs.LG

    On a Foundation Model for Operating Systems

    Authors: Divyanshu Saxena, Nihal Sharma, Donghyun Kim, Rohit Dwivedula, Jiayi Chen, Chenxi Yang, Sriram Ravula, Zichao Hu, Aditya Akella, Sebastian Angel, Joydeep Biswas, Swarat Chaudhuri, Isil Dillig, Alex Dimakis, P. Brighten Godfrey, Daehyeok Kim, Chris Rossbach, Gang Wang

    Abstract: This paper lays down the research agenda for a domain-specific foundation model for operating systems (OSes). Our case for a foundation model revolves around the observations that several OS components such as CPU, memory, and network subsystems are interrelated and that OS traces offer the ideal dataset for a foundation model to grasp the intricacies of diverse OS components and their behavior in… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: Machine Learning for Systems Workshop at 37th NeurIPS Conference, 2023, New Orleans, LA, USA

  10. arXiv:2310.18481  [pdf, other

    cs.LG cs.AI cs.OS

    MOSEL: Inference Serving Using Dynamic Modality Selection

    Authors: Bodun Hu, Le Xu, Jeongyoon Moon, Neeraja J. Yadwadkar, Aditya Akella

    Abstract: Rapid advancements over the years have helped machine learning models reach previously hard-to-achieve goals, sometimes even exceeding human capabilities. However, to attain the desired accuracy, the model sizes and in turn their computational requirements have increased drastically. Thus, serving predictions from these models to meet any target latency and cost requirements of applications remain… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  11. Interactive Content Diversity and User Exploration in Online Movie Recommenders: A Field Experiment

    Authors: Ruixuan Sun, Avinash Akella, Ruoyan Kong, Moyan Zhou, Joseph A. Konstan

    Abstract: Recommender systems often struggle to strike a balance between matching users' tastes and providing unexpected recommendations. When recommendations are too narrow and fail to cover the full range of users' preferences, the system is perceived as useless. Conversely, when the system suggests too many items that users don't like, it is considered impersonal or ineffective. To better understand user… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

    Comments: International Journal of Human Computer Interaction

  12. arXiv:2308.13632  [pdf, other

    cs.DB cs.DS cs.IT

    ChainedFilter: Combining Membership Filters by Chain Rule

    Authors: Haoyu Li, Liuhui Wang, Qizhi Chen, Jianan Ji, Yuhan Wu, Yikai Zhao, Tong Yang, Aditya Akella

    Abstract: Membership (membership query / membership testing) is a fundamental problem across databases, networks and security. However, previous research has primarily focused on either approximate solutions, such as Bloom Filters, or exact methods, like perfect hashing and dictionaries, without attempting to develop a an integral theory. In this paper, we propose a unified and complete theory, namely chain… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: SIGMOD 2024

  13. arXiv:2308.12580  [pdf, other

    cs.DL cs.IR

    Laying foundations to quantify the "Effort of Reproducibility"

    Authors: Akhil Pandey Akella, David Koop, Hamed Alhoori

    Abstract: Why are some research studies easy to reproduce while others are difficult? Casting doubt on the accuracy of scientific work is not fruitful, especially when an individual researcher cannot reproduce the claims made in the paper. There could be many subjective reasons behind the inability to reproduce a scientific paper. The field of Machine Learning (ML) faces a reproducibility crisis, and survey… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: Accepted at ACM/IEEE conference JCDL' 2023. Refer https://2023.jcdl.org/program/schedule-printable/ for confirmation

  14. arXiv:2308.03615  [pdf, other

    cs.DC cs.DB

    Dirigo: Self-scaling Stateful Actors For Serverless Real-time Data Processing

    Authors: Le Xu, Divyanshu Saxena, Neeraja J. Yadwadkar, Aditya Akella, Indranil Gupta

    Abstract: We propose Dirigo, a distributed stream processing service built atop virtual actors. Dirigo achieves both a high level of resource efficiency and performance isolation driven by user intent (SLO). To improve resource efficiency, Dirigo adopts a serverless architecture that enables time-sharing of compute resources among streaming operators, both within and across applications. Meanwhile, Dirigo i… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  15. arXiv:2308.00852  [pdf, other

    cs.NI cs.DC cs.LG

    CASSINI: Network-Aware Job Scheduling in Machine Learning Clusters

    Authors: Sudarsanan Rajasekaran, Manya Ghobadi, Aditya Akella

    Abstract: We present CASSINI, a network-aware job scheduler for machine learning (ML) clusters. CASSINI introduces a novel geometric abstraction to consider the communication pattern of different jobs while placing them on network links. To do so, CASSINI uses an affinity graph that finds a series of time-shift values to adjust the communication phases of a subset of jobs, such that the communication patter… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    ACM Class: C.2.4

  16. arXiv:2305.00538  [pdf, other

    cs.NI

    SFC: Near-Source Congestion Signaling and Flow Control

    Authors: Yanfang Le, Jeongkeun Lee, Jeremias Blendin, Jiayi Chen, Georgios Nikolaidis, Rong Pan, Robert Soule, Aditya Akella, Pedro Yebenes Segura, Arjun singhvi, Yuliang Li, Qingkai Meng, Changhoon Kim, Serhat Arslan

    Abstract: State-of-the-art congestion control algorithms for data centers alone do not cope well with transient congestion and high traffic bursts. To help with these, we revisit the concept of direct \emph{backward} feedback from switches and propose Back-to-Sender (BTS) signaling to many concurrent incast senders. Combining it with our novel approach to in-network caching, we achieve near-source sub-RTT c… ▽ More

    Submitted 30 April, 2023; originally announced May 2023.

  17. arXiv:2301.04369  [pdf, other

    cs.DL stat.AP

    Reproducibility Signals in Science: A preliminary analysis

    Authors: Akhil Pandey Akella, Hamed Alhoori, David Koop

    Abstract: Reproducibility is an important feature of science; experiments are retested, and analyses are repeated. Trust in the findings increases when consistent results are achieved. Despite the importance of reproducibility, significant work is often involved in these efforts, and some published findings may not be reproducible due to oversights or errors. In this paper, we examine a myriad of features i… ▽ More

    Submitted 11 January, 2023; originally announced January 2023.

    Comments: Accepted as a Workshop paper for WIESP-22 (https://ui.adsabs.harvard.edu/WIESP/2022/Schedule)

  18. arXiv:2301.04205  [pdf, other

    cs.LO

    A Performance Verification Methodology for Resource Allocation Heuristics

    Authors: Saksham Goel, Benjamin Mikek, Jehad Aly, Venkat Arun, Ahmed Saeed, Aditya Akella

    Abstract: Performance verification is a nascent but promising tool for understanding the performance and limitations of heuristics under realistic assumptions. Bespoke performance verification tools have already demonstrated their value in settings like congestion control and packet scheduling. In this paper, we aim to emphasize the broad applicability and utility of performance verification. To that end, w… ▽ More

    Submitted 28 February, 2024; v1 submitted 10 January, 2023; originally announced January 2023.

    Comments: 12 pages, 11 figures

  19. arXiv:2211.05594  [pdf, other

    cs.LG

    A Brief Survey on Representation Learning based Graph Dimensionality Reduction Techniques

    Authors: Akhil Pandey Akella

    Abstract: Dimensionality reduction techniques map data represented on higher dimensions onto lower dimensions with varying degrees of information loss. Graph dimensionality reduction techniques adopt the same principle of providing latent representations of the graph structure with minor adaptations to the output representations along with the input data. There exist several cutting edge techniques that are… ▽ More

    Submitted 13 October, 2022; originally announced November 2022.

  20. Auxo: Efficient Federated Learning via Scalable Client Clustering

    Authors: Jiachen Liu, Fan Lai, Yinwei Dai, Aditya Akella, Harsha Madhyastha, Mosharaf Chowdhury

    Abstract: Federated learning (FL) is an emerging machine learning (ML) paradigm that enables heterogeneous edge devices to collaboratively train ML models without revealing their raw data to a logically centralized server. However, beyond the heterogeneous device capacity, FL participants often exhibit differences in their data distributions, which are not independent and identically distributed (Non-IID).… ▽ More

    Submitted 30 September, 2023; v1 submitted 29 October, 2022; originally announced October 2022.

    Comments: 18 pages

  21. arXiv:2210.00093  [pdf, other

    cs.DC

    Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning

    Authors: Pengfei Zheng, Rui Pan, Tarannum Khan, Shivaram Venkataraman, Aditya Akella

    Abstract: Dynamic adaptation has become an essential technique in accelerating distributed machine learning (ML) training. Recent studies have shown that dynamically adjusting model structure (e.g., lottery ticket hypothesis) or hyperparameters (e.g., batch size) can significantly accelerate training without sacrificing accuracy. However, existing ML cluster schedulers are not designed to handle dynamic ada… ▽ More

    Submitted 30 September, 2022; originally announced October 2022.

    Comments: Accepted at the 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI '23)

  22. arXiv:2207.10898  [pdf, other

    cs.NI cs.AI

    Impact of RoCE Congestion Control Policies on Distributed Training of DNNs

    Authors: Tarannum Khan, Saeed Rashidi, Srinivas Sridharan, Pallavi Shurpali, Aditya Akella, Tushar Krishna

    Abstract: RDMA over Converged Ethernet (RoCE) has gained significant attraction for datacenter networks due to its compatibility with conventional Ethernet-based fabric. However, the RDMA protocol is efficient only on (nearly) lossless networks, emphasizing the vital role of congestion control on RoCE networks. Unfortunately, the native RoCE congestion control scheme, based on Priority Flow Control (PFC), s… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

  23. arXiv:2205.14323  [pdf, other

    cs.DB cs.LG

    Multi-agent Databases via Independent Learning

    Authors: Chi Zhang, Olga Papaemmanouil, Josiah P. Hanna, Aditya Akella

    Abstract: Machine learning is rapidly being used in database research to improve the effectiveness of numerous tasks included but not limited to query optimization, workload scheduling, physical design, etc. Currently, the research focus has been on replacing a single database component responsible for one task by its learning-based counterpart. However, query performance is not simply determined by the per… ▽ More

    Submitted 5 August, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Journal ref: AIDB@VLDB 2022 Proceedings of 4th International Workshop on Applied AI for Database Systems and Applications

  24. arXiv:2204.03211  [pdf, other

    cs.DC

    Elastic Model Aggregation with Parameter Service

    Authors: Juncheng Gu, Mosharaf Chowdhury, Kang G. Shin, Aditya Akella

    Abstract: Model aggregation, the process that updates model parameters, is an important step for model convergence in distributed deep learning (DDL). However, the parameter server (PS), a popular paradigm of performing model aggregation, causes CPU underutilization in deep learning (DL) clusters, due to the bursty nature of aggregation and static resource allocation. To remedy this problem, we propose Para… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

  25. arXiv:2111.10672  [pdf, other

    cs.DC cs.LG

    Doing More by Doing Less: How Structured Partial Backpropagation Improves Deep Learning Clusters

    Authors: Adarsh Kumar, Kausik Subramanian, Shivaram Venkataraman, Aditya Akella

    Abstract: Many organizations employ compute clusters equipped with accelerators such as GPUs and TPUs for training deep learning models in a distributed fashion. Training is resource-intensive, consuming significant compute, memory, and network resources. Many prior works explore how to reduce training resource footprint without impacting quality, but their focus on a subset of the bottlenecks (typically on… ▽ More

    Submitted 20 November, 2021; originally announced November 2021.

    Comments: Accepted at DistributedML-2021

  26. arXiv:2109.06449  [pdf, other

    cs.AI cs.CR cs.LG

    Deep hierarchical reinforcement agents for automated penetration testing

    Authors: Khuong Tran, Ashlesha Akella, Maxwell Standen, Junae Kim, David Bowman, Toby Richer, Chin-Teng Lin

    Abstract: Penetration testing the organised attack of a computer system in order to test existing defences has been used extensively to evaluate network security. This is a time consuming process and requires in-depth knowledge for the establishment of a strategy that resembles a real cyber-attack. This paper presents a novel deep reinforcement learning architecture with hierarchically structured agents cal… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

    Comments: Presented at 1st International Workshop on Adaptive Cyber Defense, 2021 (arXiv:2108.08476)

    Report number: IJCAI-ACD/2021/114

  27. arXiv:2101.07344  [pdf, other

    cs.LG cs.DC cs.PF

    Accelerating Deep Learning Inference via Learned Caches

    Authors: Arjun Balasubramanian, Adarsh Kumar, Yuhan Liu, Han Cao, Shivaram Venkataraman, Aditya Akella

    Abstract: Deep Neural Networks (DNNs) are witnessing increased adoption in multiple domains owing to their high accuracy in solving real-world problems. However, this high accuracy has been achieved by building deeper networks, posing a fundamental challenge to the low latency inference desired by user-facing applications. Current low latency solutions trade-off on accuracy or fail to exploit the inherent t… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

  28. arXiv:2101.06537  [pdf, other

    cs.NI

    PL2: Towards Predictable Low Latency in Rack-Scale Networks

    Authors: Yanfang Le, Radhika Niranjan Mysore, Lalith Suresh, Gerd Zellweger, Sujata Banerjee, Aditya Akella, Michael Swift

    Abstract: High performance rack-scale offerings package disaggregated pools of compute, memory and storage hardware in a single rack to run diverse workloads with varying requirements, including applications that need low and predictable latency. The intra-rack network is typically high speed Ethernet, which can suffer from congestion leading to packet drops and may not satisfy the stringent tail latency re… ▽ More

    Submitted 22 January, 2021; v1 submitted 16 January, 2021; originally announced January 2021.

    Comments: 13 pages, 23 pdf figures, use acmart.cls

  29. arXiv:2012.13599  [pdf

    cs.DL cs.CY cs.LG cs.SI

    Early Indicators of Scientific Impact: Predicting Citations with Altmetrics

    Authors: Akhil Pandey Akella, Hamed Alhoori, Pavan Ravikanth Kondamudi, Cole Freeman, Haiming Zhou

    Abstract: Identifying important scholarly literature at an early stage is vital to the academic research community and other stakeholders such as technology companies and government bodies. Due to the sheer amount of research published and the growth of ever-changing interdisciplinary areas, researchers need an efficient way to identify important scholarly work. The number of citations a given research publ… ▽ More

    Submitted 25 December, 2020; originally announced December 2020.

  30. arXiv:2002.02645  [pdf, other

    cs.LG stat.ML

    Accelerating Deep Learning Inference via Freezing

    Authors: Adarsh Kumar, Arjun Balasubramanian, Shivaram Venkataraman, Aditya Akella

    Abstract: Over the last few years, Deep Neural Networks (DNNs) have become ubiquitous owing to their high accuracy on real-world tasks. However, this increase in accuracy comes at the cost of computationally expensive models leading to higher prediction latencies. Prior efforts to reduce this latency such as quantization, model distillation, and any-time prediction models typically trade-off accuracy for pe… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

    Comments: 11th USENIX Workshop on Hot Topics in Cloud Computing, HotCloud 2019

  31. arXiv:1912.02402  [pdf, other

    cs.NI

    D2R: Dataplane-Only Policy-Compliant Routing Under Failures

    Authors: Kausik Subramanian, Anubhavnidhi Abhashkumar, Loris D'Antoni, Aditya Akella

    Abstract: In networks today, the data plane handles forwarding---sending a packet to the next device in the path---and the control plane handles routing---deciding the path of the packet in the network. This architecture has limitations. First, when link failures occur, the data plane has to wait for the control plane to install new routes, and packet losses can occur due to delayed routing convergence or c… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

  32. arXiv:1911.09849  [pdf, other

    cs.DC

    Archipelago: A Scalable Low-Latency Serverless Platform

    Authors: Arjun Singhvi, Kevin Houck, Arjun Balasubramanian, Mohammed Danish Shaikh, Shivaram Venkataraman, Aditya Akella

    Abstract: The increased use of micro-services to build web applications has spurred the rapid growth of Function-as-a-Service (FaaS) or serverless computing platforms. While FaaS simplifies provisioning and scaling for application developers, it introduces new challenges in resource management that need to be handled by the cloud provider. Our analysis of popular serverless workloads indicates that schedule… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

    Comments: 14 pages

  33. arXiv:1910.07700  [pdf, other

    cs.DC

    SNF: Serverless Network Functions

    Authors: Arjun Singhvi, Junaid Khalid, Aditya Akella, Sujata Banerjee

    Abstract: It is increasingly common to outsource network functions (NFs) to the cloud. However, no cloud providers offer NFs-as-a-Service (NFaaS) that allows users to run custom NFs. Our work addresses how a cloud provider can offer NFaaS. We use the emerging serverless computing paradigm as it has the right building blocks - usage-based billing, convenient event-driven programming model and automatic compu… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

    Comments: 14 pages

  34. arXiv:1907.01484  [pdf, other

    cs.DC

    Themis: Fair and Efficient GPU Cluster Scheduling

    Authors: Kshiteej Mahajan, Arjun Balasubramanian, Arjun Singhvi, Shivaram Venkataraman, Aditya Akella, Amar Phanishayee, Shuchi Chawla

    Abstract: Modern distributed machine learning (ML) training workloads benefit significantly from leveraging GPUs. However, significant contention ensues when multiple such workloads are run atop a shared cluster of GPUs. A key question is how to fairly apportion GPUs across workloads. We find that established cluster scheduling disciplines are a poor fit because of ML workloads' unique attributes: ML jobs h… ▽ More

    Submitted 29 October, 2019; v1 submitted 2 July, 2019; originally announced July 2019.

  35. arXiv:1907.00434  [pdf, other

    cs.DC

    Network-accelerated Distributed Machine Learning Using MLFabric

    Authors: Raajay Viswanathan, Aditya Akella

    Abstract: Existing distributed machine learning (DML) systems focus on improving the computational efficiency of distributed learning, whereas communication aspects have received less attention. Many DML systems treat the network as a blackbox. Thus, DML algorithms' performance is impeded by network bottlenecks, and DML systems end up sacrificing important algorithmic and system-level benefits. We present M… ▽ More

    Submitted 30 June, 2019; originally announced July 2019.

  36. arXiv:1906.02043  [pdf, other

    cs.NI

    Tiramisu: Fast and General Network Verification

    Authors: Anubhavnidhi Abhashkumar, Aaron Gember-Jacobson, Aditya Akella

    Abstract: Today's distributed network control planes support multiple routing protocols, filtering mechanisms, and route selection policies. These protocols operate at different layers, e.g. BGP operates at the EGP layer, OSPF at the IGP layer, and VLANs at layer 2. The behavior of a network's control plane depends on how these protocols interact with each other. This makes network configurations highly com… ▽ More

    Submitted 5 June, 2019; originally announced June 2019.

    Comments: 14 pages + Appendices

  37. arXiv:1808.08309  [pdf, other

    eess.SY cs.RO

    Trajectory Tracking Control of a Flexible Spine Robot, With and Without a Reference Input

    Authors: Andrew P. Sabelhaus, Shirley Huajing Zhao, Mallory C. Daly, Ellande Tang, Edward Zhu, Abishek K. Akella, Zeerek A. Ahmad, Vytas SunSpiral, Alice M. Agogino

    Abstract: The Underactuated Lightweight Tensegrity Robotic Assistive Spine (ULTRA Spine) project is an ongoing effort to develop a flexible, actuated backbone for quadruped robots. In this work, model-predictive control is used to track a trajectory in the robot's state space, in simulation. The state trajectory used here corresponds to a bending motion of the spine, with translations and rotations of the m… ▽ More

    Submitted 24 August, 2018; originally announced August 2018.

    Journal ref: 2017 NASA/ESA Conference on Adaptive Hardware and Systems - Workshop on Structurally Adaptive Tensegrity Robots

  38. arXiv:1703.10272  [pdf, other

    cs.DC

    Whiz: A Fast and Flexible Data Analytics System

    Authors: Robert Grandl, Arjun Singhvi, Raajay Viswanathan, Aditya Akella

    Abstract: Today's data analytics frameworks are compute-centric, with analytics execution almost entirely dependent on the pre-determined physical structure of the high-level computation. Relegating intermediate data to a second class entity in this manner hurts flexibility, performance, and efficiency. We present Whiz, a new analytics framework that cleanly separates computation from intermediate data. It… ▽ More

    Submitted 21 June, 2019; v1 submitted 29 March, 2017; originally announced March 2017.

    Comments: 16 pages

  39. arXiv:1612.01497  [pdf, other

    cs.NI cs.DC

    Correctness and Performance for Stateful Chained Network Functions

    Authors: Junaid Khalid, Aditya Akella

    Abstract: Network functions virtualization (NFV) allows operators to employ NF chains to realize custom policies, and dynamically add instances to meet demand or for failover. NFs maintain detailed per- and cross-flow state which needs careful management, especially during dynamic actions. Crucially, state management must: (1) ensure NF chain-wide correctness and (2) have good performance. To this end, we b… ▽ More

    Submitted 16 October, 2018; v1 submitted 5 December, 2016; originally announced December 2016.

  40. arXiv:1604.07371  [pdf, other

    cs.DC cs.DB cs.OS cs.PF eess.SY

    Do the Hard Stuff First: Scheduling Dependent Computations in Data-Analytics Clusters

    Authors: Robert Grandl, Srikanth Kandula, Sriram Rao, Aditya Akella, Janardhan Kulkarni

    Abstract: We present a scheduler that improves cluster utilization and job completion times by packing tasks having multi-resource requirements and inter-dependencies. While the problem is algorithmically very hard, we achieve near-optimality on the job DAGs that appear in production clusters at a large enterprise and in benchmarks such as TPC-DS. A key insight is that carefully handling the long-running ta… ▽ More

    Submitted 25 April, 2016; originally announced April 2016.

    Report number: MSR-TR-2016-19

  41. arXiv:1403.7115  [pdf, ps, other

    cs.NI

    Active Switching: Packet Steering Flow Annotations

    Authors: Saul St. John, Aditya Akella

    Abstract: Our previous experience building systems for middlebox chain composition and scaling in software-defined networks has revealed that existing mechanisms of flow annotation commonly do not survive middlebox-traversals, or suffer from extreme identifier domain limitations resulting in excessive flow table size. In this paper, we analyze the structural artifacts resulting in these challenges, and offe… ▽ More

    Submitted 27 March, 2014; originally announced March 2014.

    MSC Class: 68M10 ACM Class: C.2.1

  42. arXiv:1305.0209  [pdf, ps, other

    cs.NI

    Stratos: A Network-Aware Orchestration Layer for Virtual Middleboxes in Clouds

    Authors: Aaron Gember, Anand Krishnamurthy, Saul St. John, Robert Grandl, Xiaoyang Gao, Ashok Anand, Theophilus Benson, Vyas Sekar, Aditya Akella

    Abstract: Enterprises want their in-cloud services to leverage the performance and security benefits that middleboxes offer in traditional deployments. Such virtualized deployments create new opportunities (e.g., flexible scaling) as well as new challenges (e.g., dynamics, multiplexing) for middlebox management tasks such as service composition and provisioning. Unfortunately, enterprises lack systematic to… ▽ More

    Submitted 11 March, 2014; v1 submitted 1 May, 2013; originally announced May 2013.