gpu

Here are 3,950 public repositories matching this topic...

BrosnanYuen / RayBNN_Neural

Neural Networks with Sparse Weights in Rust using GPUs, CPUs, and FPGAs via CUDA, OpenCL, and oneAPI

rust machine-learning cpu deep-learning neural-network gpu opencl machine-learning-algorithms parallel cuda neural-networks sparse-neural-networks sparse-network raybnn

Updated May 18, 2024
Rust

pytorch / pytorch

Star

Tensors and Dynamic neural networks in Python with strong GPU acceleration

python machine-learning deep-learning neural-network gpu numpy autograd tensor

Updated May 18, 2024
Python

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.

python data-science machine-learning data-mining tutorial r big-data gpu cuda kaggle gbdt gbm gpu-computing decision-trees gradient-boosting coreml catboost categorical-features

Updated May 18, 2024
Python

SciML / SciMLBenchmarksOutput

Sponsor

Star

SciML-Bench Benchmarks for Scientific Machine Learning (SciML), Physics-Informed Machine Learning (PIML), and Scientific AI Performance

python performance r neural-network gpu julia matlab pytorch benchmarks differential-equations jax scientific-machine-learning physics-informed sciml

Updated May 18, 2024
HTML

skypilot-org / skypilot

Star

SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.

Updated May 18, 2024
Python

apache / tvm

Star

Open deep learning compiler stack for cpu, gpu and specialized accelerators

javascript machine-learning performance deep-learning metal compiler gpu vulkan opencl tensor spirv rocm tvm

Updated May 18, 2024
Python

microsoft / DeepSpeed

Star

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

machine-learning compression deep-learning gpu inference pytorch zero data-parallelism model-parallelism mixture-of-experts pipeline-parallelism billion-parameters trillion-parameters

Updated May 18, 2024
Python

Azure / kaito

Star

Kubernetes AI Toolchain Operator

kubernetes ai gpu operator

Updated May 18, 2024
Go

deno-windowing / gluten

Star

OpenGL bindings & WebGL API implementation for Deno.

webgl typescript opengl gpu ffi deno

Updated May 18, 2024
TypeScript

pytorch / executorch

Star

On-device AI across mobile, embedded and edge for PyTorch

machine-learning mobile embedded deep-learning neural-network gpu tensor

Updated May 18, 2024
C++

gpuweb / gpuweb

Star

Where the GPU for the Web work happens!

gpu w3c gpgpu-computing

Updated May 18, 2024
Bikeshed

drivendataorg / zamba

Star

A Python package for identifying 42 kinds of animals, training custom models, and estimating distance from camera trap videos

python cli machine-learning deep-learning neural-network gpu pytorch conservation video-processing videos animals ecology camera-traps jungle chimps pytorch-lightning

Updated May 18, 2024
Python

GEOS-DEV / GEOS

Star

GEOS Simulation Framework

hpc gpu llnl reservoir-simulation geomechanics carbon-storage

Updated May 18, 2024
C++

rapidsai / cudf

Star

cuDF - GPU DataFrame Library

python data-science cpp gpu arrow pydata cuda pandas data-analysis dask dataframe rapids cudf

Updated May 18, 2024
C++

triton-inference-server / server

Star

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

machine-learning cloud deep-learning gpu inference edge datacenter

Updated May 18, 2024
Python

gyroflow / gyroflow

Star

Video stabilization using gyroscope data

rust video fpv gpu gyroscope gopro video-processing gpu-computing sony-alpha-cameras stabilization insta360 rolling-shutter-undistortion

Updated May 18, 2024
Rust

bifurcationkit / BifurcationKit.jl

Star

A Julia package to perform Bifurcation Analysis

gpu continuation pde bifurcation bifurcation-diagram floquet pseudo-arclength-continuation periodic-orbits deflation hopf-point newton-krylov codim2

Updated May 18, 2024
Julia

AmusementClub / vs-mlrt

Star

Efficient CPU/GPU/Vulkan ML Runtimes for VapourSynth (with built-in support for waifu2x, DPIR, RealESRGANv2/v3, Real-CUGAN, RIFE, SCUNet and more!)

deep-learning neural-network gpu vulkan cuda waifu2x artificial-intelligence vapoursynth tensorrt ncnn onnx openvino onnxruntime rife real-esrgan dpir directml real-cugan migraphx

Updated May 18, 2024
C++

Jaswar / nnlib

Star

GPU-accelerated, C/C++ neural network library.

machine-learning neural-network cpp gpu cuda

Updated May 18, 2024
C++

NVIDIA / TransformerEngine

Star

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

python machine-learning deep-learning gpu cuda pytorch jax fp8

Updated May 18, 2024
Python

Improve this page

Add a description, image, and links to the gpu topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gpu topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gpu

Here are 3,950 public repositories matching this topic...

BrosnanYuen / RayBNN_Neural

pytorch / pytorch

catboost / catboost

SciML / SciMLBenchmarksOutput

skypilot-org / skypilot

apache / tvm

microsoft / DeepSpeed

Azure / kaito

deno-windowing / gluten

pytorch / executorch

gpuweb / gpuweb

drivendataorg / zamba

GEOS-DEV / GEOS

rapidsai / cudf

triton-inference-server / server

gyroflow / gyroflow

bifurcationkit / BifurcationKit.jl

AmusementClub / vs-mlrt

Jaswar / nnlib

NVIDIA / TransformerEngine

Improve this page

Add this topic to your repo