◐ Shell
clean mode source ↗

Baseten

Welcome to Baseten

Baseten is an AI infrastructure platform. We combine applied performance research, distributed multi-cloud infrastructure, and developer tooling to run models of all modalities in production.

Get started:

  • Deploy an open-source model in two clicks from the model library.
  • Read our docs to package and serve a fine-tuned or custom model.

Pinned Loading

  1. The simplest way to serve AI/ML models in production

    Python 1.2k 107

  2. Examples of models deployable with Truss

    Python 227 63

Repositories

Showing 10 of 107 repositories

  • truss Public

    The simplest way to serve AI/ML models in production

    basetenlabs/truss’s past year of commit activity

  • basetenlabs/autocomp’s past year of commit activity

    Python 0 BSD-3-Clause

    14 0 8

    Updated Jun 18, 2026

  • basetenlabs/DeepGEMM’s past year of commit activity

    Cuda 0 MIT

    1,058 0 0

    Updated Jun 18, 2026

  • basetenlabs/baseten-go’s past year of commit activity

    Go

    1

    MIT 0

    1 0

    Updated Jun 17, 2026

  • basetenlabs/baseten-cli’s past year of commit activity

    Go

    15

    MIT 0

    0 3

    Updated Jun 17, 2026

  • ml-cookbook Public

    Ready-to-use ML training recipes to help you build and deploy models on Baseten.

    basetenlabs/ml-cookbook’s past year of commit activity

    Python

    55

    MIT

    8 0 21

    Updated Jun 17, 2026

  • basetenlabs/langchain-baseten’s past year of commit activity

    Python 0 MIT

    1 0 1

    Updated Jun 17, 2026

  • basetenlabs/truss-examples’s past year of commit activity

  • Model-Optimizer Public Forked from NVIDIA/Model-Optimizer

    A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

    basetenlabs/Model-Optimizer’s past year of commit activity

    Python

    1

    Apache-2.0

    453 0 14

    Updated Jun 15, 2026

  • genai-bench Public Forked from sgl-project/genai-bench

    Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

    basetenlabs/genai-bench’s past year of commit activity

    Python

    2

    MIT

    51 0 6

    Updated Jun 13, 2026