WIP: Asynchronous copy
Created by: leios
This is a WIP PR for asynchronous copies between CPU and GPU streams
For the CPU variant, I am literally just calling an unsafe_copyto!(...)
.
I need to properly use pinned memory for the GPU->GPU case as well. Working on it.
This fixes #40 (closed) (hopefully)