Full Changelog: 1.0.8...1.1.0
- v1.1.0 support GPU depolyment with Triton and TensorRT-LLM by @yuekaizhang in #944
- v1.0.10 support custom chat model by @huanglizhuo in #932
- v1.0.9 several fixes by @huanglizhuo @ZhikangNiu in #924 #926 #928
Full Changelog: 1.0.8...1.1.0