Heterogeneous GPU reallocation

Wednesday July 5th, 1-2PM @ BA5205

Speaker: James Gleeson

Title:
Heterogeneous GPU reallocation

Abstract:
Emerging cloud markets like spot markets and batch computing services scale up services at the granularity of whole VMs. In this paper, we observe that GPU workloads underutilize GPU device memory, leading us to explore the benefits of reallocating heterogeneous GPUs within existing VMs. We outline approaches for upgrading and downgrading GPUs for OpenCL GPGPU workloads, and show how to minimize the chance of cloud operator VM termination by maximizing the heterogeneous environments in which applications can run.

Bio:
James is a PhD student under Eyal de Lara.  He has done research in mobile security for both physical and software attacks on Android phones.  His current research interests are in heterogeneous computing in data centers.

Crane: Fast and Migratable GPU Passthrough for OpenCL applications

Wednesday May 17th, 12-1PM @ BA5205

Speaker: James Gleeson

Title:
Crane: Fast and Migratable GPU Passthrough for OpenCL applications

Abstract:
General purpose GPU (GPGPU) computing in virtualized environments leverages PCI passthrough to achieve GPU performance comparable to bare-metal execution. However, GPU passthrough prevents service administrators from performing virtual machine migration between physical hosts.
Crane is a new technique for virtualizing OpenCL-based GPGPU computing that achieves within 5.25% of passthrough GPU performance while supporting VM migration. Crane interposes a virtualization-aware OpenCL library that makes it possible to reclaim and subsequently reassign physical GPUs to a VM without terminating the guest or its applications. Crane also enables continued GPU operation while the VM is undergoing live migration by transparently switching between GPU passthrough operation and API remoting.
 

Bio:
James is a PhD student under Eyal de Lara.  He has done research in mobile security for both physical and software attacks on Android phones.  His current research interests are in heterogeneous computing in data centers.