Skip to content

Pinned Loading

  1. flashinfer flashinfer Public

    FlashInfer: Kernel Library for LLM Serving

    Cuda 2.7k 286

  2. whl whl Public

    Pre-built wheels for flashinfer python package.

    HTML 1

Repositories

Showing 10 of 12 repositories
  • flashinfer Public

    FlashInfer: Kernel Library for LLM Serving

    flashinfer-ai/flashinfer’s past year of commit activity
    Cuda 2,713 Apache-2.0 286 98 14 Updated Apr 22, 2025
  • tg4perfetto Public Forked from ihavnoid/tg4perfetto

    Simple python library for generating your own perfetto traces for your application. Can be used for both app instrumentation and custom trace generation (for your own purposes)

    flashinfer-ai/tg4perfetto’s past year of commit activity
    Python 0 Apache-2.0 5 0 0 Updated Apr 16, 2025
  • cutlass-viz Public
    flashinfer-ai/cutlass-viz’s past year of commit activity
    Python 55 Apache-2.0 1 0 0 Updated Apr 12, 2025
  • flashinfer-nightly Public

    FlashInfer Nightly

    flashinfer-ai/flashinfer-nightly’s past year of commit activity
    6 MIT 1 0 0 Updated Apr 9, 2025
  • whl Public

    Pre-built wheels for flashinfer python package.

    flashinfer-ai/whl’s past year of commit activity
    HTML 0 1 0 0 Updated Apr 7, 2025
  • flashinfer-ai/performance-tracking’s past year of commit activity
    4 Apache-2.0 0 0 0 Updated Apr 2, 2025
  • flashinfer-ai.github.io Public

    Project website of FlashInfer project

    flashinfer-ai/flashinfer-ai.github.io’s past year of commit activity
    SCSS 0 4 1 0 Updated Mar 17, 2025
  • web-data Public
    flashinfer-ai/web-data’s past year of commit activity
    0 Apache-2.0 0 0 0 Updated Mar 5, 2025
  • flashinfer-ai/llm-based-compression’s past year of commit activity
    Jupyter Notebook 2 0 0 0 Updated Jan 10, 2025
  • debug-print Public

    Debug print operator for cudagraph debugging

    flashinfer-ai/debug-print’s past year of commit activity
    Cuda 10 0 0 0 Updated Aug 2, 2024