Skip to content
View srush's full-sized avatar

Block or report srush

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

NumPy+Jax with named axes and an uncompromising attitude

Jupyter Notebook 20 1 Updated Mar 4, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 985 78 Updated Mar 7, 2025

Stochastic Automatic Differentiation library for PyTorch.

Python 198 5 Updated Aug 30, 2024

A curated list for awesome discrete diffusion models resources.

294 12 Updated Feb 5, 2025

Custom triton kernels for training Karpathy's nanoGPT.

Python 18 Updated Oct 21, 2024

Minimal LLM inference in Rust

Rust 984 32 Updated Oct 24, 2024

Commit0: Library Generation from Scratch

Python 143 10 Updated Apr 9, 2025

Any model. Any hardware. Zero compromise. Built with @ziglang / @openxla / MLIR / @bazelbuild

Zig 2,205 78 Updated Apr 14, 2025

Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"

Python 22 Updated Aug 28, 2024

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,258 145 Updated Apr 14, 2025

TensorDict is a pytorch dedicated tensor container.

Python 907 86 Updated Apr 14, 2025

Tile primitives for speedy kernels

Cuda 2,256 134 Updated Apr 14, 2025

Accelerated First Order Parallel Associative Scan

Python 181 8 Updated Aug 20, 2024

Linear algebra foundation for the Rust programming language

Rust 2,109 78 Updated Apr 14, 2025

A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.

Python 529 28 Updated Feb 19, 2025

I have no idea what I'm doing , but llm.c in rust

Python 12 Updated Jul 16, 2024

LLM training in simple, raw C/CUDA

Cuda 26,337 3,026 Updated Oct 2, 2024

CUDA Templates for Linear Algebra Subroutines

C++ 7,287 1,195 Updated Apr 10, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 13,186 1,507 Updated Apr 15, 2025

Designing bridge trusses with Pytorch autograd

Jupyter Notebook 61 4 Updated Feb 4, 2024

Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX

Python 83 4 Updated Jan 25, 2024

Extract full next-token probabilities via language model APIs

Python 240 13 Updated Feb 23, 2024

Mamba SSM architecture

Python 14,571 1,268 Updated Apr 1, 2025

utilities for decoding deep representations (like sentence embeddings) back to text

Python 793 90 Updated Apr 14, 2025
Jupyter Notebook 8,300 593 Updated Jun 16, 2024

Turn an epub or text file into an audiobook

Python 734 62 Updated Apr 13, 2025

Robust recipes to align language models with human and AI preferences

Python 5,127 440 Updated Nov 21, 2024

A work in progress. Trying to write about all interesting or necessary pieces in the current development of LLMs and generative AI. Gradually adding more topics.

Jupyter Notebook 193 12 Updated Sep 14, 2023
Next
Showing results