ray-serve

Here are 9 public repositories matching this topic...

ray-project / ray-educational-materials

This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.

deep-learning ray distributed-machine-learning ray-tune ray-train ray-distributed llm generative-ai ray-serve ray-data llm-serving llm-inference

Updated Feb 13, 2024
Jupyter Notebook

torchpipe / torchpipe

Star

Boosting DL Service Throughput 1.5-4x by Ensemble Pipeline Serving with Concurrent CUDA Streams for PyTorch/LibTorch Frontend and TensorRT/CVCUDA, etc., Backends

deployment inference pytorch ray serve tensorrt serving pipeline-parallelism torch2trt triton-inference-server ray-serve cvcuda

Updated Jun 5, 2024
C++

ray-project / ray-serve-arize-observe

Star

Building Real-Time Inference Pipelines with Ray Serve

deep-learning generative-model ray observability model-serving scalable-machine-learning online-inference llm ray-serve

Updated Apr 21, 2023
Jupyter Notebook

marwan116 / raycraft

Star

A drop-in replacement of fastapi to enable scalable and fault tolerant deployments with ray serve

fault-tolerance scalability ray fastapi ray-serve

Updated Nov 7, 2023
Python

fork123aniket / LLM-RAG-powered-QA-App

Star

A Production-Ready, Scalable RAG-powered LLM-based Context-Aware QA App

question-answering ray fine-tuning context-aware-system large-language-models ray-serve llmops llm-serving eleutherai llm-training llm-inference retrieval-augmented-generation parameter-efficient-fine-tuning

Updated Jan 8, 2024
Python

mpolinowski / ray-deployments

Star

Use Ray to deploy your remote services.

python deployment ray ray-serve

Updated Jan 29, 2023
Python

DhavalkumarPatel / ImageClassification-with-Pytorch-Ray

Star

This MLOps repository contains python modules intended for distributed model training, tuning, and serving using PyTorch and Ray, a distributed computing framework.