v0.4.2
版本发布时间: 2023-03-30 23:10:21
huggingface/text-generation-inference最新发布版本:v3.0.1(2024-12-12 04:13:58)
Features
- benchmark: tui based benchmarking tool
- router: Clear cache on error
- server: Add mypy-protobuf
- server: reduce mlp and attn in one op for flash neox
- image: aws sagemaker compatible image
Fix
- server: avoid try/except to determine the kind of AutoModel
- server: fix flash neox rotary embedding