GPT Inference in C# This is a port of Andrej's llm.c with a focus on inference on CPU in C# for small to medium size models. See llm.c