Skip to main content

Showing 1–9 of 9 results for author: Kaffes, K

.
  1. arXiv:2502.09281  [pdf, other

    cs.NI cs.DC cs.OS

    Fast Userspace Networking for the Rest of Us

    Authors: Alireza Sanaee, Vahab Jabrayilov, Ilias Marinos, Anuj Kalia, Divyanshu Saxena, Prateesh Goyal, Kostis Kaffes, Gianni Antichi

    Abstract: After a decade of research in userspace network stacks, why do new solutions remain inaccessible to most developers? We argue that this is because they ignored (1) the hardware constraints of public cloud NICs (vNICs) and (2) the flexibility required by applications. Concerning the former, state-of-the-art proposals rely on specific NIC features (e.g., flow steering, deep buffers) that are not bro… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

  2. arXiv:2408.17351  [pdf, other

    cs.OS

    Tide: A Split OS Architecture for Control Plane Offloading

    Authors: Jack Tigar Humphries, Neel Natu, Kostis Kaffes, Stanko Novaković, Paul Turner, Hank Levy, David Culler, Christos Kozyrakis

    Abstract: The end of Moore's Law is driving cloud providers to offload virtualization and the network data plane to SmartNICs to improve compute efficiency. Even though individual OS control plane tasks consume up to 5% of cycles across the fleet, they remain on the host CPU because they are tightly intertwined with OS mechanisms. Moreover, offloading puts the slow PCIe interconnect in the critical path of… ▽ More

    Submitted 20 October, 2024; v1 submitted 30 August, 2024; originally announced August 2024.

    Comments: About 11 pages

  3. arXiv:2401.08859  [pdf, other

    cs.DC cs.LG

    Shabari: Delayed Decision-Making for Faster and Efficient Serverless Functions

    Authors: Prasoon Sinha, Kostis Kaffes, Neeraja J. Yadwadkar

    Abstract: Serverless computing relieves developers from the burden of resource management, thus providing ease-of-use to the users and the opportunity to optimize resource utilization for the providers. However, today's serverless systems lack performance guarantees for function invocations, thus limiting support for performance-critical applications: we observed severe performance variability (up to 6x). P… ▽ More

    Submitted 25 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: 17 pages, 14 figures, update typo in manually entered arxiv title

  4. arXiv:2312.06808  [pdf, other

    cs.OS

    BPF-oF: Storage Function Pushdown Over the Network

    Authors: Ioannis Zarkadas, Tal Zussman, Jeremy Carin, Sheng Jiang, Yuhong Zhong, Jonas Pfefferle, Hubertus Franke, Junfeng Yang, Kostis Kaffes, Ryan Stutsman, Asaf Cidon

    Abstract: Storage disaggregation, wherein storage is accessed over the network, is popular because it allows applications to independently scale storage capacity and bandwidth based on dynamic application demand. However, the added network processing introduced by disaggregation can consume significant CPU resources. In many storage systems, logical storage operations (e.g., lookups, aggregations) involve a… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

  5. arXiv:2308.02896  [pdf, other

    cs.DC cs.AR cs.NI cs.OS

    Towards Fast, Adaptive, and Hardware-Assisted User-Space Scheduling

    Authors: Lisa, Li, Nikita Lazarev, David Koufaty, Yijun Yin, Andy Anderson, Zhiru Zhang, Edward Suh, Kostis Kaffes, Christina Delimitrou

    Abstract: Modern datacenter applications are prone to high tail latencies since their requests typically follow highly-dispersive distributions. Delivering fast interrupts is essential to reducing tail latency. Prior work has proposed both OS- and system-level solutions to reduce tail latencies for microsecond-scale workloads through better scheduling. Unfortunately, existing approaches like customized data… ▽ More

    Submitted 11 November, 2023; v1 submitted 5 August, 2023; originally announced August 2023.

    Comments: Accepted by HPCA2024

  6. arXiv:2208.13068  [pdf, other

    cs.DB cs.DC

    Apiary: A DBMS-Integrated Transactional Function-as-a-Service Framework

    Authors: Peter Kraft, Qian Li, Kostis Kaffes, Athinagoras Skiadopoulos, Deeptaanshu Kumar, Danny Cho, Jason Li, Robert Redmond, Nathan Weckwerth, Brian Xia, Peter Bailis, Michael Cafarella, Goetz Graefe, Jeremy Kepner, Christos Kozyrakis, Michael Stonebraker, Lalith Suresh, Xiangyao Yu, Matei Zaharia

    Abstract: Developers increasingly use function-as-a-service (FaaS) platforms for data-centric applications that perform low-latency and transactional operations on data, such as for microservices or web serving. Unfortunately, existing FaaS platforms support these applications poorly because they physically and logically separate application logic, executed in cloud functions, from data management, done in… ▽ More

    Submitted 30 June, 2023; v1 submitted 27 August, 2022; originally announced August 2022.

    Comments: 14 pages, 13 figures, 3 tables. Preprint

  7. arXiv:2111.07226  [pdf, other

    cs.DC

    Practical Scheduling for Real-World Serverless Computing

    Authors: Kostis Kaffes, Neeraja J. Yadwadkar, Christos Kozyrakis

    Abstract: Serverless computing has seen rapid growth due to the ease-of-use and cost-efficiency it provides. However, function scheduling, a critical component of serverless systems, has been overlooked. In this paper, we take a first-principles approach toward designing a scheduler that caters to the unique characteristics of serverless functions as seen in real-world deployments. We first create a taxonom… ▽ More

    Submitted 13 November, 2021; originally announced November 2021.

  8. arXiv:2010.05969  [pdf, other

    cs.DC

    RackSched: A Microsecond-Scale Scheduler for Rack-Scale Computers (Technical Report)

    Authors: Hang Zhu, Kostis Kaffes, Zixu Chen, Zhenming Liu, Christos Kozyrakis, Ion Stoica, Xin Jin

    Abstract: Low-latency online services have strict Service Level Objectives (SLOs) that require datacenter systems to support high throughput at microsecond-scale tail latency. Dataplane operating systems have been designed to scale up multi-core servers with minimal overhead for such SLOs. However, as application demands continue to increase, scaling up is not enough, and serving larger demands requires the… ▽ More

    Submitted 15 October, 2020; v1 submitted 12 October, 2020; originally announced October 2020.

  9. arXiv:1601.07400  [pdf, other

    cs.DC

    Improving virtual host efficiency through resource and interference aware scheduling

    Authors: Evangelos Angelou, Konstantinos Kaffes, Athanasia Asiki, Georgios Goumas, Nectarios Koziris

    Abstract: Modern Infrastructure-as-a-Service Clouds operate in a competitive environment that caters to any user's requirements for computing resources. The sharing of the various types of resources by diverse applications poses a series of challenges in order to optimize resource utilization while avoiding performance degradation caused by application interference. In this paper, we present two scheduling… ▽ More

    Submitted 27 January, 2016; originally announced January 2016.

    Comments: 2nd International Workshop on Dynamic Resource Allocation and Management in Embedded, High Performance and Cloud Computing DREAMCloud 2016 (arXiv:cs/1601.04675)

    Report number: DREAMCloud/2016/02