当前期刊: arXiv - CS - Operating Systems Go to current issue    加入关注   
显示样式:        排序: IF: - GO 导出
  • TardiS: Migrating Containers with RDMA Networks
    arXiv.cs.OS Pub Date : 2020-09-15
    Maksym PlanetaTU Dresden; Jan BierbaumTU Dresden; Leo Sahaya Daphne AntonyAMOLF; Torsten HoeflerETH Zurich; Hermann HärtigTU Dresden

    Major data centre providers are introducing RDMA-based networks for their tenants, as well as for operating the underlying infrastructure. In comparison to traditional socket-based network stacks, RDMA-based networks offer higher throughput, lower latency and reduced CPU overhead. However, transparent checkpoint and migration operations become much more difficult. The key reason is that the OS is removed

  • Secure Memory Management on Modern Hardware
    arXiv.cs.OS Pub Date : 2020-09-06
    Reto Achermann; Nora Hossle; Lukas Humbel; Daniel Schwyn; David Cock; Timothy Roscoe

    Almost all modern hardware, from phone SoCs to high-end servers with accelerators, contain memory translation and protection hardware like IOMMUs, firewalls, and lookup tables which make it impossible to reason about, and enforce protection and isolation based solely on the processor's MMUs. This has led to numerous bugs and security vulnerabilities in today's system software. In this paper we regain

  • Sirius: Enabling System-Wide Isolation for Trusted Execution Environments
    arXiv.cs.OS Pub Date : 2020-09-03
    Zahra Tarkhani; Anil Madhavapeddy

    Hardware-assisted trusted execution environments (TEEs) are critical building blocks of many modern applications. However, the one-way isolation model introduces a semantic gap between TEE and its outside world, including conventional OSs and applications. This causes the most practical and ever-increasing set of attacks on TEE-enabled applications by exploiting various insecure interactions with the

  • Quantifying the Latency and Possible Throughput of External Interrupts on Cyber-Physical Systems
    arXiv.cs.OS Pub Date : 2020-09-01
    Oliver Horst; Johannes Wiesböck; Raphael Wild; Uwe Baumgarten

    An important characteristic of cyber-physical systems is their capability to respond, in-time, to events from their physical environment. However, to the best of our knowledge there exists no benchmark for assessing and comparing the interrupt handling performance of different software stacks. Hence, we present a flexible evaluation method for measuring the interrupt latency and throughput on ARMv8-A

  • toki: A Build- and Test-Platform for Prototyping and Evaluating Operating System Concepts in Real-Time Environments
    arXiv.cs.OS Pub Date : 2020-09-01
    Oliver Horst; Uwe Baumgarten

    Typically, even low-level operating system concepts, such as resource sharing strategies and predictability measures, are evaluated with Linux on PC hardware. This leaves a large gap to real industrial applications. Hence, the direct transfer of the results might be difficult. As a solution, we present toki, a prototyping and evaluation platform based on FreeRTOS and several open-source libraries.

  • Analysis of Interference between RDMA and Local Access on Hybrid Memory System
    arXiv.cs.OS Pub Date : 2020-08-28
    Kazuichi Oe

    We can use a hybrid memory system consisting of DRAM and Intel Optane DC Persistent Memory (We call it DCPM in this paper) as DCPM is now commercially available since April 2019. Even if the latency for DCPM is several times higher than that for DRAM, the capacity for DCPM is several times higher than that for DRAM and the cost of DCPM is also several times lower than that for DRAM. In addition, DCPM

  • BumbleBee: Application-aware adaptation for container orchestration
    arXiv.cs.OS Pub Date : 2020-08-27
    HyunJong Lee; Shadi Noghabi; Brian Noble; Matthew Furlong; Landon P. Cox

    Application-aware adaptation is the key to maintaining acceptable quality when resources become scarce. Application-oblivious responses to resource scarcity, such as TCP congestion control, may fairly reallocate a diminishing resource pool, but only the application knows how to adjust its fidelity under resource scarcity. Unfortunately, modern container-orchestration platforms like Kubernetes do not

  • Making Distributed Mobile Applications SAFE: Enforcing User Privacy Policies on Untrusted Applications with Secure Application Flow Enforcement
    arXiv.cs.OS Pub Date : 2020-08-14
    Adriana Szekeres; Irene Zhang; Katelin Bailey; Isaac Ackerman; Haichen Shen; Franziska Roesner; Dan R. K. Ports; Arvind Krishnamurthy; Henry M. Levy

    Today's mobile devices sense, collect, and store huge amounts of personal information, which users share with family and friends through a wide range of applications. Once users give applications access to their data, they must implicitly trust that the apps correctly maintain data privacy. As we know from both experience and all-too-frequent press articles, that trust is often misplaced. While users

  • Consideration for effectively handling parallel workloads on public cloud system
    arXiv.cs.OS Pub Date : 2020-08-14
    Kazuichi Oe

    We retrieved and analyzed parallel storage workloads of the FUJITSU K5 cloud service to clarify how to build cost-effective hybrid storage systems. A hybrid storage system consists of fast but low-capacity tier (first tier) and slow but high-capacity tier (second tier). And, it typically consists of either SSDs and HDDs or NVMs and SSDs. As a result, we found that 1) regions for first tier should be

  • eXpOS: A Simple Pedagogical Operating System for Undergraduate Instruction
    arXiv.cs.OS Pub Date : 2020-08-08
    K. Murali Krishnan

    An operating system project suitable for undergraduate computing/electrical sciences students is presented. The project can be used as a course project in a one semester course, or as a self-study project for motivated students. The course is organized such that a student with a basic background in programming and computer organization can follow the implementation road map available online, and build

  • Interprocess Communication in FreeBSD 11: Performance Analysis
    arXiv.cs.OS Pub Date : 2020-08-05
    A. H. Bell-Thomas

    Interprocess communication, IPC, is one of the most fundamental functions of a modern operating system, playing an essential role in the fabric of contemporary applications. This report conducts an investigation in FreeBSD of the real world performance considerations behind two of the most common IPC mechanisms; pipes and sockets. A simple benchmark provides a fair sense of effective bandwidth for

  • HeRTA: Heaviside Real-Time Analysis
    arXiv.cs.OS Pub Date : 2020-07-23
    Frank Slomka; Mohammadreza Sadeghi

    We investigate the mathematical properties of event bound functions as they are used in the worst-case response time analysis and utilization tests. We figure out the differences and similarities between the two approaches. Based on this analysis, we derive a more general form do describe events and event bounds. This new unified approach gives clear new insights in the investigation of real-time systems

  • DBOS: A Proposal for a Data-Centric Operating System
    arXiv.cs.OS Pub Date : 2020-07-21
    Michael Cafarella; David DeWitt; Vijay Gadepally; Jeremy Kepner; Christos Kozyrakis; Tim Kraska; Michael Stonebraker; Matei Zaharia

    Current operating systems are complex systems that were designed before today's computing environments. This makes it difficult for them to meet the scalability, heterogeneity, availability, and security challenges in current cloud and parallel computing environments. To address these problems, we propose a radically new OS design based on data-centric architecture: all operating system state should

  • Scheduling of Real-Time Tasks with Multiple Critical Sections in Multiprocessor Systems
    arXiv.cs.OS Pub Date : 2020-07-16
    Jian-Jia Chen; Junjie Shi; Georg von der Brüggen; Niklas Ueter

    The performance of multiprocessor synchronization and locking protocols is a key factor to utilize the computation power of multiprocessor systems under real-time constraints. While multiple protocols have been developed in the past decades, their performance highly depends on the task partition and prioritization. The recently proposed Dependency Graph Approach showed its advantages and attracted

  • Analyzing and Mitigating Data Stalls in DNN Training
    arXiv.cs.OS Pub Date : 2020-07-14
    Jayashree Mohan; Amar Phanishayee; Ashish Raniwala; Vijay Chidambaram

    We present the first comprehensive analysis of how the data pipeline affects the training of the widely used Deep Neural Networks (DNNs). We analyze nine models and four datasets while varying factors such as the amount of memory, number of CPU threads, etc. We find that in many cases, DNN training time is dominated by data stall time: time spent waiting for data to be fetched from storage and pre-processed

  • LINTS^RT: A Learning-driven Testbed for Intelligent Scheduling in Embedded Systems
    arXiv.cs.OS Pub Date : 2020-07-10
    Zelun Kong; Yaswanth Yadlapalli; Soroush Bateni; Junfeng Guo; Cong Liu

    Due to the increasing complexity seen in both workloads and hardware resources in state-of-the-art embedded systems, developing efficient real-time schedulers and the corresponding schedulability tests becomes rather challenging. Although close to optimal schedulability performance can be achieved for supporting simple system models in practice, adding any small complexity element into the problem

  • IOCA: High-Speed I/O-Aware LLC Management for Network-Centric Multi-Tenant Platform
    arXiv.cs.OS Pub Date : 2020-07-09
    Yifan Yuan; Mohammad Alian; Yipeng Wang; Ilia Kurakin; Ren Wang; Charlie Tai; Nam Sung Kim

    In modern server CPUs, last-level cache (LLC) is a critical hardware resource that exerts significant influence on the performance of the workloads, and how to manage LLC is a key to the performance isolation and QoS in the cloud with multi-tenancy. In this paper, we argue that besides CPU cores, high-speed network I/O is also important for LLC management. This is because of an Intel architectural

  • DPCP-p: A Distributed Locking Protocol for Parallel Real-Time Tasks
    arXiv.cs.OS Pub Date : 2020-07-01
    Maolin Yang; Zewei Chen; Xu Jiang; Nan Guan; Hang Lei

    Real-time scheduling and locking protocols are fundamental facilities to construct time-critical systems. For parallel real-time tasks, predictable locking protocols are required when concurrent sub-jobs mutually exclusive access to shared resources. This paper for the first time studies the distributed synchronization framework of parallel real-time tasks, where both tasks and global resources are

  • Scalable Range Locks for Scalable Address Spaces and Beyond
    arXiv.cs.OS Pub Date : 2020-06-22
    Alex Kogan; Dave Dice; Shady Issa

    Range locks are a synchronization construct designed to provide concurrent access to multiple threads (or processes) to disjoint parts of a shared resource. Originally conceived in the file system context, range locks are gaining increasing interest in the Linux kernel community seeking to alleviate bottlenecks in the virtual memory management subsystem. The existing implementation of range locks in

  • Optimizing Placement of Heap Memory Objects in Energy-Constrained Hybrid Memory Systems
    arXiv.cs.OS Pub Date : 2020-06-22
    Taeuk Kim; Safdar Jamil; Joongeon Park; Youngjae Kim

    Main memory (DRAM) significantly impacts the power and energy utilization of the overall server system. Non-Volatile Memory (NVM) devices, such as Phase Change Memory and Spin-Transfer Torque RAM, are suitable candidates for main memory to reduce energy consumption. But unlike DRAM, NVMs access latencies are higher than DRAM and NVM writes are more energy sensitive than DRAM write operations. Thus

  • FastDrain: Removing Page Victimization Overheads in NVMe Storage Stack
    arXiv.cs.OS Pub Date : 2020-06-16
    Jie Zhang; Miryeong Kwon; Sanghyun Han; Nam Sung Kim; Mahmut Kandemir; Myoungsoo Jung

    Host-side page victimizations can easily overflow the SSD internal buffer, which interferes I/O services of diverse user applications thereby degrading user-level experiences. To address this, we propose FastDrain, a co-design of OS kernel and flash firmware to avoid the buffer overflow, caused by page victimizations. Specifically, FastDrain can detect a triggering point where a near-future page victimization

  • An Adaptive Approach to Recoverable Mutual Exlcusion
    arXiv.cs.OS Pub Date : 2020-06-12
    Sahil Dhoked; Neeraj Mittal

    Mutual exclusion (ME) is one of the most commonly used techniques to handle conflicts in concurrent systems. Traditionally, mutual exclusion algorithms have been designed under the assumption that a process does not fail while acquiring/releasing a lock or while executing its critical section. However, failures do occur in real life, potentially leaving the lock in an inconsistent state. This gives

  • Nefele: Process Orchestration for the Cloud
    arXiv.cs.OS Pub Date : 2020-06-12
    Mina Sedaghat; Pontus Sköldström; Daniell Turull; Vinay Yadhav; Joacim Halén; Madhubala Ganesan; Amardeep Mehta; Wolfgang John

    Virtualization, either at OS- or hardware level, plays an important role in cloud computing. It enables easier automation and faster deployment in distributed environments. While virtualized infrastructures provide a level of management flexibility, they lack practical abstraction of the distributed resources. A developer in such an environment still needs to deal with all the complications of building

  • The Art of CPU-Pinning: Evaluating and Improving the Performance of Virtualization and Containerization Platforms
    arXiv.cs.OS Pub Date : 2020-06-03
    Davood Ghatreh Samani; Chavit Denninnart; Josef Bacik; Mohsen Amini Salehi

    Cloud providers offer a variety of execution platforms in form of bare-metal, VM, and containers. However, due to the pros and cons of each execution platform, choosing the appropriate platform for a specific cloud-based application has become a challenge for solution architects. The possibility to combine these platforms (e.g. deploying containers within VMs) offers new capacities that makes the challenge

  • Flex: Closing the Gaps between Usage and Allocation
    arXiv.cs.OS Pub Date : 2020-06-02
    Tan N. Le; Zhenhua Liu

    Data centers are giant factories of Internet data and services. Worldwide data centers consume energy and emit emissions more than airline industry. Unfortunately, most of data centers are significantly underutilized. One of the major reasons is the big gaps between the real usage and the provisioned resources because users tend to over-estimate their demand and data center operators often rely on

  • Memory virtualization in virtualized systems: segmentation is better than paging
    arXiv.cs.OS Pub Date : 2020-05-30
    Boris Teabe; Peterson Yuhala; Alain Tchana; Fabien Hermenier; Daniel Hagimont; Gilles Muller

    The utilization of paging for virtual machine (VM) memory management is the root cause of memory virtualization overhead. This paper shows that paging is not necessary in the hypervisor. In fact, memory fragmentation, which explains paging utilization, is not an issue in virtualized datacenters thanks to VM memory demand patterns. Our solution Compromis, a novel Memory Management Unit, uses direct

  • Study of Firecracker MicroVM
    arXiv.cs.OS Pub Date : 2020-05-26
    Madhur Jain

    Firecracker is a virtualization technology that makes use of Kernel Virtual Machine (KVM). Firecracker belongs to a new virtualization class named the micro-virtual machines (MicroVMs). Using Firecracker, we can launch lightweight MicroVMs in non-virtualized environments in a fraction of a second, at the same time offering the security and workload isolation provided by traditional VMs and also the

  • Autonomous Task Dropping Mechanism to Achieve Robustness in Heterogeneous Computing Systems
    arXiv.cs.OS Pub Date : 2020-05-22
    Ali Mokhtari; Chavit Denninnart; Mohsen Amini Salehi

    Robustness of a distributed computing system is defined as the ability to maintain its performance in the presence of uncertain parameters. Uncertainty is a key problem in heterogeneous (and even homogeneous) distributed computing systems that perturbs system robustness. Notably, the performance of these systems is perturbed by uncertainty in both task execution time and arrival. Accordingly, our goal

  • A Way Around UMIP and Descriptor-Table Exiting via TSX-based Side-Channel Attack
    arXiv.cs.OS Pub Date : 2020-05-20
    Mohammad Sina Karvandi; Saleh Khalaj Monfared; Mohammad Sina Kiarostami; Dara Rahmati; Saeid Gorgin

    Nowadays, in operating systems, numerous protection mechanisms prevent or limit the user-mode applications to access the kernel's internal information. This is regularly carried out by software-based defenses such as Address Space Layout Randomization (ASLR) and Kernel ASLR (KASLR). They play pronounced roles when the security of sandboxed applications such as Web-browser are considered. Armed with

  • High Velocity Kernel File Systems with Bento
    arXiv.cs.OS Pub Date : 2020-05-19
    Samantha MillerUniversity of Washington; Kaiyuan ZhangUniversity of Washington; Danyang ZhuoDuke University; Tom AndersonUniversity of Washington

    High development velocity is critical for modern cloud systems. However, rapid development and release cycles have mostly skipped operating systems. Modifications to behavior in Linux, the most widely used server operating system in the cloud, must be done slowly to minimize risk of introducing bugs, be limited in scope, or be implemented in userspace with a potential performance penalty. We propose

  • Exploiting Inter- and Intra-Memory Asymmetries for Data Mapping in Hybrid Tiered-Memories
    arXiv.cs.OS Pub Date : 2020-05-10
    Shihao Song; Anup Das; Nagarajan Kandasamy

    Modern computing systems are embracing hybrid memory comprising of DRAM and non-volatile memory (NVM) to combine the best properties of both memory technologies, achieving low latency, high reliability, and high density. A prominent characteristic of DRAM-NVM hybrid memory is that it has NVM access latency much higher than DRAM access latency. We call this inter-memory asymmetry. We observe that parasitic

  • Improving Phase Change Memory Performance with Data Content Aware Access
    arXiv.cs.OS Pub Date : 2020-05-10
    Shihao Song; Anup Das; Onur Mutlu; Nagarajan Kandasamy

    A prominent characteristic of write operation in Phase-Change Memory (PCM) is that its latency and energy are sensitive to the data to be written as well as the content that is overwritten. We observe that overwriting unknown memory content can incur significantly higher latency and energy compared to overwriting known all-zeros or all-ones content. This is because all-zeros or all-ones content is

  • On Failure Diagnosis of the Storage Stack
    arXiv.cs.OS Pub Date : 2020-05-06
    Duo Zhang; Om Rameshwar Gatla; Runzhou Han; Mai Zheng

    Diagnosing storage system failures is challenging even for professionals. One example is the "When Solid State Drives Are Not That Solid" incident occurred at Algolia data center, where Samsung SSDs were mistakenly blamed for failures caused by a Linux kernel bug. With the system complexity keeps increasing, such obscure failures will likely occur more often. As one step to address the challenge, we

  • Dim Silicon and the Case for Improved DVFS Policies
    arXiv.cs.OS Pub Date : 2020-05-04
    Mathias Gottschlag; Yussuf Khalil; Frank Bellosa

    Due to thermal and power supply limits, modern Intel CPUs reduce their frequency when AVX2 and AVX-512 instructions are executed. As the CPUs wait for 670{\mu}s before increasing the frequency again, the performance of some heterogeneous workloads is reduced. In this paper, we describe parallels between this situation and dynamic power management as well as between the policy implemented by these CPUs

  • Designing Robust API Monitoring Solutions
    arXiv.cs.OS Pub Date : 2020-05-01
    Simone Nicchi; Matteo Mariani; Matteo Marini; Federico Palmaro; Daniele Cono D'Elia

    Tracing the sequence of library and system calls made by a program is very helpful in the characterization of its interactions with the environment and ultimately of its semantics. Due to entanglements of real-world software stacks, this task can become challenging as we take accuracy, reliability, and transparency aspects into the equation. In this paper we report on our experience in designing and

  • Vilamb: Low Overhead Asynchronous Redundancy for Direct Access NVM
    arXiv.cs.OS Pub Date : 2020-04-20
    Rajat Kateja; Andy Pavlo; Gregory R. Ganger

    Vilamb provides efficient asynchronous systemredundancy for direct access (DAX) non-volatile memory (NVM) storage. Production storage deployments often use system-redundancy in form of page checksums and cross-page parity. State-of-the-art solutions for maintaining system-redundancy for DAX NVM either incur a high performance overhead or require specialized hardware. The Vilamb user-space library maintains

  • A Linux Kernel Scheduler Extension for Multi-core Systems
    arXiv.cs.OS Pub Date : 2020-04-14
    Aleix Roca; Samuel Rodríguez; Albert Segura; Kevin Marquet; Vicenç Beltran

    The Linux kernel is mostly designed for multi-programed environments, but high-performance applications have other requirements. Such applications are run standalone, and usually rely on runtime systems to distribute the application's workload on worker threads, one per core. However, due to current OSes limitations, it is not feasible to track whether workers are actually running or blocked due to

  • Hardware Memory Management for Future Mobile Hybrid Memory Systems
    arXiv.cs.OS Pub Date : 2020-04-12
    Fei Wen; Mian Qin; Paul Gratz; Narasimha Reddy

    The current mobile applications have rapidly growing memory footprints, posing a great challenge for memory system design. Insufficient DRAM main memory will incur frequent data swaps between memory and storage, a process that hurts performance, consumes energy and deteriorates the write endurance of typical flash storage devices. Alternately, a larger DRAM has higher leakage power and drains the battery

  • Accelerating Filesystem Checking and Repair with pFSCK
    arXiv.cs.OS Pub Date : 2020-04-12
    David Domingo; Kyle Stratton; Sudarsun Kannan

    File system checking and recovery (C/R) tools play a pivotal role in increasing the reliability of storage software, identifying and correcting file system inconsistencies. However, with increasing disk capacity and data content, file system C/R tools notoriously suffer from long runtimes. We posit that current file system checkers fail to exploit CPU parallelism and high throughput offered by modern

  • Efficient Kernel Object Management for Tiered Memory Systems with KLOC
    arXiv.cs.OS Pub Date : 2020-04-09
    Sudarsun Kannan; Yujie Ren; Abhishek Bhatacharjee

    Software-controlled heterogeneous memory systems have the potential to improve performance, efficiency, and cost tradeoffs in emerging systems. Delivering on this promise requires an efficient operating system (OS) mechanisms and policies for data management. Unfortunately, modern OSes do not support efficient tiering of data between heterogeneous memories. While this problem is known (and is being

  • $μ$Tiles: Efficient Intra-Process Privilege Enforcement of Memory Regions
    arXiv.cs.OS Pub Date : 2020-04-09
    Zahra Tarkhani; Anil Madhavapeddy

    With the alarming rate of security advisories and privacy concerns on connected devices, there is an urgent need for strong isolation guarantees in resource-constrained devices that demand very lightweight solutions. However, the status quo is that Unix-like operating systems do not offer privilege separation inside a process. Lack of practical fine-grained compartmentalization inside a shared address

  • Duplication of Windows Services
    arXiv.cs.OS Pub Date : 2016-08-13
    Zhiyong Shan; Xin Wang; Tzi-cker Chiueh; Rajiv Bagai

    OS-level virtualization techniques virtualize system resources at the system call interface, has the distinct advantage of smaller run-time resource requirements as compared to HAL-level virtualization techniques, and thus forms an important building block for virtualizing parallel and distributed applications such as a HPC clusters. Because the Windows operating system puts certain critical functionalities

  • SoftWear: Software-Only In-Memory Wear-Leveling for Non-Volatile Main Memory
    arXiv.cs.OS Pub Date : 2020-04-07
    Christian Hakert; Kuan-Hsun Chen; Pual R. Genssler; Georg von der Brüggen; Lars Bauer; Hussam Amrouch; Jian-Jia Chen; Jörg Henkel

    Several emerging technologies for byte-addressable non-volatile memory (NVM) have been considered to replace DRAM as the main memory in computer systems during the last years. The disadvantage of a lower write endurance, compared to DRAM, of NVM technologies like Phase-Change Memory (PCM) or Ferroelectric RAM (FeRAM) has been addressed in the literature. As a solution, in-memory wear-leveling techniques

  • FLIC: A Distributed Fog Cache for City-Scale Applications
    arXiv.cs.OS Pub Date : 2020-03-25
    Jack West; Neil Kingensmith; George K. Thiruvathukal

    We present FLIC, a distributed software data caching framework for fogs that reduces network traffic and latency. FLICis targeted toward city-scale deployments of cooperative IoT devices in which each node gathers and shares data with surrounding devices. As machine learning and other data processing techniques that require large volumes of training data are ported to low-cost and low-power IoT systems

  • Co-Optimizing Performance and Memory FootprintVia Integrated CPU/GPU Memory Management, anImplementation on Autonomous Driving Platform
    arXiv.cs.OS Pub Date : 2020-03-17
    Soroush Bateni; Zhendong Wang; Yuankun Zhu; Yang Hu; Cong Liu

    Cutting-edge embedded system applications, such as self-driving cars and unmanned drone software, are reliant on integrated CPU/GPU platforms for their DNNs-driven workload, such as perception and other highly parallel components. In this work, we set out to explore the hidden performance implication of GPU memory management methods of integrated CPU/GPU architecture. Through a series of experiments

  • Dynamic Budget Management with Service Guarantees for Mixed-Criticality Systems
    arXiv.cs.OS Pub Date : 2020-03-11
    Xiaozhe Gu; Arvind Easwaran

    Many existing studies on mixed-criticality (MC) scheduling assume that low-criticality budgets for high-criticality applications are known apriori. These budgets are primarily used as guidance to determine when the scheduler should switch the system mode from low to high. Based on this key observation, in this paper we propose a dynamic MC scheduling model under which low-criticality budgets for individual

  • Efficient Schedulability Test for Dynamic-Priority Scheduling of Mixed-Criticality Real-Time Systems
    arXiv.cs.OS Pub Date : 2020-03-11
    Xiaozhe Gu; Arvind Easwaran

    Systems in many safety-critical application domains are subject to certification requirements. In such a system, there are typically different applications providing functionalities that have varying degrees of criticality. Consequently, the certification requirements for functionalities at these different criticality levels are also varying, with very high levels of assurance required for a highly

  • Multi-Rate Fluid Scheduling of Mixed-Criticality Systems on Multiprocessors
    arXiv.cs.OS Pub Date : 2020-03-11
    Saravanan Ramanathan; Arvind Easwaran; Hyeonjoong Cho

    In this paper we consider the problem of mixed-criticality (MC) scheduling of implicit-deadline sporadic task systems on a homogenous multiprocessor platform. Focusing on dual-criticality systems, algorithms based on the fluid scheduling model have been proposed in the past. These algorithms use a dual-rate execution model for each high-criticality task depending on the system mode. Once the system

  • Fissile Locks
    arXiv.cs.OS Pub Date : 2020-03-10
    Dave Dice; Alex Kogan

    Classic test-and-test (TS) mutual exclusion locks are simple, and enjoy high performance and low latency of ownership transfer under light or no contention. However, they do not scale gracefully under high contention and do not provide any admission order guarantees. Such concerns led to the development of scalable queue-based locks, such as a recent Compact NUMA-aware (CNA) lock, a variant of another

  • Data Centers Job Scheduling with Deep Reinforcement Learning
    arXiv.cs.OS Pub Date : 2019-09-16
    Sisheng Liang; Zhou Yang; Fang Jin; Yong Chen

    Efficient job scheduling on data centers under heterogeneous complexity is crucial but challenging since it involves the allocation of multi-dimensional resources over time and space. To adapt the complex computing environment in data centers, we proposed an innovative Advantage Actor-Critic (A2C) deep reinforcement learning based approach called A2cScheduler for job scheduling. A2cScheduler consists

  • Bringing Inter-Thread Cache Benefits to Federated Scheduling -- Extended Results & Technical Report
    arXiv.cs.OS Pub Date : 2020-02-28
    Corey Tessler; Venkata P. Modekurthy; Nathan Fisher; Abusayeed Saifullah

    Multiprocessor scheduling of hard real-time tasks modeled by directed acyclic graphs (DAGs) exploits the inherent parallelism presented by the model. For DAG tasks, a node represents a request to execute an object on one of the available processors. In one DAG task, there may be multiple execution requests for one object, each represented by a distinct node. These distinct execution requests offer

  • Virtual Gang based Scheduling of Real-Time Tasks on Multicore Platforms
    arXiv.cs.OS Pub Date : 2019-12-23
    Waqar Ali; Rodolfo Pellizzoni; Heechul Yun

    We propose a virtual-gang based parallel real-time task scheduling approach for multicore platforms. Our approach is based on the notion of a virtual-gang, which is a group of parallel real-time tasks that are statically linked and scheduled together by a gang scheduler. We present a light-weight intra-gang synchronization framework, called RTG-Sync, and virtual gang formation algorithms that provide

  • Safe and Efficient Remote Application Code Execution on Disaggregated NVM Storage with eBPF
    arXiv.cs.OS Pub Date : 2020-02-25
    Kornilios Kourtis; Animesh Trivedi; Nikolas Ioannou

    With rapid improvements in NVM storage devices, the performance bottleneck is gradually shifting to the network, thus giving rise to the notion of "data movement wall". To reduce the amount of data movement over the network, researchers have proposed near-data computing by shipping operations and compute-extensions closer to storage devices. However, running arbitrary, user-provided extensions in a

  • A Recurrent Neural Network Based Patch Recommender for Linux Kernel Bugs
    arXiv.cs.OS Pub Date : 2020-02-19
    Anusha Bableshwar; Arun Ravindran; Manoj Iyer

    Software bugs in a production environment have an undesirable impact on quality of service, unplanned system downtime, and disruption in good customer experience, resulting in loss of revenue and reputation. Existing approaches to automated software bug repair focuses on known bug templates detected using static code analysis tools and test suites, and in automatic generation of patch code for these

  • LibrettOS: A Dynamically Adaptable Multiserver-Library OS
    arXiv.cs.OS Pub Date : 2020-02-20
    Ruslan Nikolaev; Mincheol Sung; Binoy Ravindran

    We present LibrettOS, an OS design that fuses two paradigms to simultaneously address issues of isolation, performance, compatibility, failure recoverability, and run-time upgrades. LibrettOS acts as a microkernel OS that runs servers in an isolated manner. LibrettOS can also act as a library OS when, for better performance, selected applications are granted exclusive access to virtual hardware resources

  • Privaros: A Framework for Privacy-Compliant Drones
    arXiv.cs.OS Pub Date : 2020-02-16
    Rakesh Rajan Beck; Abhishek Vijeev; Vinod Ganapathy

    We present Privaros, a framework to enforce privacy policies on drones. Privaros is designed for commercial delivery drones, such as the ones that will likely be used by Amazon Prime Air. Such drones visit a number of host airspaces, each of which may have different privacy requirements. Privaros provides an information flow control framework to enforce the policies of these hosts on the guest delivery

  • ExplFrame: Exploiting Page Frame Cache for Fault Analysis of Block Ciphers
    arXiv.cs.OS Pub Date : 2019-05-30
    Anirban Chakraborty; Sarani Bhattacharya; Sayandeep Saha; Debdeep Mukhopadhyay

    Page Frame Cache (PFC) is a purely software cache, present in modern Linux based operating systems (OS), which stores the page frames that are recently being released by the processes running on a particular CPU. In this paper, we show that the page frame cache can be maliciously exploited by an adversary to steer the pages of a victim process to some pre-decided attacker-chosen locations in the memory

  • Intel Page Modification Logging, a hardware virtualization feature: study and improvement for virtual machine working set estimation
    arXiv.cs.OS Pub Date : 2020-01-26
    Stella Bitchebe; Djob Mvondo; Alain Tchana; Laurent Réveillère; Noël De Palma

    Intel Page Modification Logging (PML) is a novel hardware feature for tracking virtual machine (VM) accessed memory pages. This task is essential in today's data centers since it allows, among others, checkpointing, live migration and working set size (WSS) estimation. Relying on the Xen hypervisor, this paper studies PML from three angles: power consumption, efficiency, and performance impact on user

  • AppStreamer: Reducing Storage Requirements of Mobile Games through Predictive Streaming
    arXiv.cs.OS Pub Date : 2019-12-16
    Nawanol Theera-Ampornpunt; Shikhar Suryavansh; Sameer Manchanda; Rajesh Panta; Kaustubh Joshi; Mostafa Ammar; Mung Chiang; Saurabh Bagchi

    Storage has become a constrained resource on smartphones. Gaming is a popular activity on mobile devices and the explosive growth in the number of games coupled with their growing size contributes to the storage crunch. Even where storage is plentiful, it takes a long time to download and install a heavy app before it can be launched. This paper presents AppStreamer, a novel technique for reducing

  • SPARTA: A Divide and Conquer Approach to Address Translation for Accelerators
    arXiv.cs.OS Pub Date : 2020-01-20
    Javier Picorel; Seyed Alireza Sanaee Kohroudi; Zi Yan; Abhishek Bhattacharjee; Babak Falsafi; Djordje Jevdjic

    Virtual memory (VM) is critical to the usability and programmability of hardware accelerators. Unfortunately, implementing accelerator VM efficiently is challenging because the area and power constraints make it difficult to employ the large multi-level TLBs used in general-purpose CPUs. Recent research proposals advocate a number of restrictions on virtual-to-physical address mappings in order to

Contents have been reproduced by permission of the publishers.
ACS ES&T Engineering
ACS ES&T Water