Large Lambda Model

Implements the forward pass of GPT-2 in Haskell.

I chose not to use a tensor library like hasktorch (which seems semi-abandoned?) or one of the array combinator DSLs like accelerate, and implemented this with the openblas bindings in hmatrix. It performs better than I expected, though the scaling factors in the quadratic slowdown of attention with respect to context length is noticeably way worse than pytorch (which uses fast attention). Once the model is loaded, it easily gets 1 token/s on an old thinkpad (until the context starts to grow).

The tokenizer is no fun at all so this is decode only and doesn't handle unicode very well (the fact that the vocab json uses unicode keys might be causing an issue with Aeson...)

References

The best references for this are Karpathy's NanoGPT and llm.c, as well as Brendan Bycroft's LLM Visualizer.

There is a handy interface for the tokenizer online.

Instructions

Download the open source weights and run the model with

./download_model.sh
cabal run

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
app		app
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
cabal.project		cabal.project
download_model.sh		download_model.sh
gpt2.cabal		gpt2.cabal
shell.nix		shell.nix

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Large Lambda Model

References

Instructions

About

Releases

Packages

Languages

License

spacedome/gpt2.hs

Folders and files

Latest commit

History

Repository files navigation

Large Lambda Model

References

Instructions

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages