groupreduction and subgroupreduction
Created by: brabreda
I am unsure why my previous PR closed but here are the changes.
- I added docs
- I added tests
It was my first time writing tests, and they passed. How are these tested on GPUs, and do I need more tests?