Skip to content
@basetenlabs

Baseten

Machine learning infrastructure for developers

Welcome to Baseten

Baseten is an AI infrastructure platform. We combine applied performance research, distributed multi-cloud infrastructure, and developer tooling to run models of all modalities in production.

Get started:

  • Deploy an open-source model in two clicks from the model library.
  • Read our docs to package and serve a fine-tuned or custom model.

Pinned Loading

  1. truss Public

    The simplest way to serve AI/ML models in production

    Python 981 85

  2. truss-examples Public

    Examples of models deployable with Truss

    Python 169 41

Repositories

Showing 10 of 58 repositories
  • truss-examples Public

    Examples of models deployable with Truss

    Python 169 MIT 41 13 49 Updated Apr 25, 2025
  • truss Public

    The simplest way to serve AI/ML models in production

    Python 981 MIT 85 63 (5 issues need help) 17 Updated Apr 25, 2025
  • dynamo Public Forked from ai-dynamo/dynamo

    A Datacenter Scale Distributed Inference Serving Framework

    Rust 0 Apache-2.0 332 0 2 Updated Apr 24, 2025
  • TensorRT-LLM Public Forked from NVIDIA/TensorRT-LLM

    TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

    C++ 0 Apache-2.0 1,396 0 0 Updated Apr 21, 2025
  • TensorRT-Model-Optimizer Public Forked from NVIDIA/TensorRT-Model-Optimizer

    A unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.

    Python 0 67 0 2 Updated Apr 18, 2025
  • lws Public Forked from kubernetes-sigs/lws

    LeaderWorkerSet: An API for deploying a group of pods as a unit of replication

    Go 0 Apache-2.0 71 0 1 Updated Apr 16, 2025
  • action-slack Public Forked from 8398a7/action-slack

    Provides the function of slack notification to GitHub Actions.

    TypeScript 0 MIT 140 0 1 Updated Mar 28, 2025
  • create-pull-request Public Forked from peter-evans/create-pull-request

    A GitHub action to create a pull request for changes to your repository in the actions workspace

    TypeScript 0 MIT 513 0 0 Updated Mar 26, 2025
  • 0 1 0 0 Updated Mar 24, 2025
  • honeymarker Public Forked from reconbot/honeymarker

    Add Honeycomb Markers to your GitHub Actions workflows.

    Dockerfile 0 6 0 0 Updated Mar 17, 2025