GPU & Architectures
!!! warn Work in progress The GPU support of SpeedyWeather.jl is still work in progress and some parts of this documentation might not be always updated to the latest state. We will extend this documentation over time. Don't hesitate to contact us via GitHub issues or mail when you have questions or want to collaborate.
Some of SpeedyWeather.jl already supports GPU acceleration, e.g. the barotropic model. Our development focusses on CUDA GPUs, but other architectures are thinkable in the future as well, as our approach relies on the device agnostic KernelAbstractions.jl
. The SpeedyWeather.jl submodule Architectures
encodes all the information of the device we run our models on. In order to initialize a model on a GPU, we need to load the CUDA
package and pass the architecture to the model constructor. For example, to initialize a barotropic model on a GPU, we can do the following:
using SpeedyWeather, CUDA
architecture = SpeedyWeather.GPU()
spectral_grid = SpectralGrid(trunc=41, nlayers=1, architecture=architecture)
CUDA.@allowscalar model = BarotropicModel(spectral_grid=spectral_grid)
CUDA.@allowscalar simulation = initialize!(model)
run!(simulation, period=Day(10))
Note that we need to use CUDA.@allowscalar
here during initialization. Currently we do not yet support a fully GPU-accelerated model construction and initialization.
Architectures Utilities
In order to easily transfer our data structures between CPU (e.g. for plotting and output) and GPU, we have the following utilities that make can make use of the architecture
object defined above:
SpeedyWeather.Architectures.on_architecture
— Functionon_architecture(arch::AbstractArchitecture, a)
Return a
, but on the architecture arch
.
SpeedyWeather.Architectures.array_type
— Functionarray_type(arch::AbstractArchitecture)
Return the array type that's used with the architecture arch
.
array_type(::AbstractArchitecture, NF::Type, N::Int)
Return the concrete array type that's used with the architecture for a number type NF
and dimension N
.
Benchmarks
More to follow...