Baseten

Welcome to Baseten

Baseten is an AI infrastructure platform. We combine applied performance research, distributed multi-cloud infrastructure, and developer tooling to run models of all modalities in production.

Get started:

Deploy an open-source model in two clicks from the model library.
Read our docs to package and serve a fine-tuned or custom model.

Pinned Loading

The simplest way to serve AI/ML models in production

Python 1.2k 107
Examples of models deployable with Truss

Python 227 63

Repositories

Showing 10 of 107 repositories

truss Public
The simplest way to serve AI/ML models in production

basetenlabs/truss’s past year of commit activity
basetenlabs/autocomp’s past year of commit activity

Python 0 BSD-3-Clause
14 0 8
Updated Jun 18, 2026
basetenlabs/DeepGEMM’s past year of commit activity

Cuda 0 MIT
1,058 0 0
Updated Jun 18, 2026
basetenlabs/baseten-go’s past year of commit activity

Go
1
MIT 0
1 0
Updated Jun 17, 2026
basetenlabs/baseten-cli’s past year of commit activity

Go
15
MIT 0
0 3
Updated Jun 17, 2026
ml-cookbook Public
Ready-to-use ML training recipes to help you build and deploy models on Baseten.

basetenlabs/ml-cookbook’s past year of commit activity

Python
55
MIT
8 0 21
Updated Jun 17, 2026
basetenlabs/langchain-baseten’s past year of commit activity

Python 0 MIT
1 0 1
Updated Jun 17, 2026
basetenlabs/truss-examples’s past year of commit activity
Model-Optimizer Public Forked from NVIDIA/Model-Optimizer
A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

basetenlabs/Model-Optimizer’s past year of commit activity

Python
1
Apache-2.0
453 0 14
Updated Jun 15, 2026
genai-bench Public Forked from sgl-project/genai-bench
Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

basetenlabs/genai-bench’s past year of commit activity

Python
2
MIT
51 0 6
Updated Jun 13, 2026