The Rapids Training Workshop Prepares Your Data Team For GPU Computing & Analytics Easily
Rapids Training Workshop Overview
This half-day course introduces the open-source Python RAPIDS libraries for accelerating computation with GPUs (graphics processing units). Participants practice using the RAPIDS libraries for common ETL and machine learning workloads without having to program with low-level languages like C++.
We assume participants have prior experience using the Python language and, in particular, using standard Python tools for data analysis (notably NumPy, Pandas, Jupyter). No prior experience with GPU programming is required (although some prior exposure to Dask, while not mandatory, will be helpful).
At the conclusion of this course, participants will be able to:
●Verify the availability of GPU hardware on a given system for accelerated performance.
●Explain relevant GPU computing concepts in the context of data analysis pipelines.
●Identify opportunities for GPU computation in existing Python data analysis pipelines.
●Extend example Pandas/Scikit-Learn pipelines to scalable GPU-pipelines with RAPIDS.
●Construct scalable machine learning pipelines in Python using RAPIDS from scratch.
●Identify when GPU dataframes can benefit from using Dask-cuDF instead of cuDF for improved performance.