v1.52.9
版本发布时间: 2024-11-16 11:25:06
BerriAI/litellm最新发布版本:v1.56.9(2025-01-04 10:46:47)
What's Changed
- (feat) add bedrock/stability.stable-image-ultra-v1:0 by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/6723
- [Feature]: Stop swallowing up AzureOpenAi exception responses in litellm's implementation for a BadRequestError by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/6745
- [Feature]: json_schema in response support for Anthropic by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/6748
- fix: import audio check by @IamRash-7 in https://github.com/BerriAI/litellm/pull/6740
- (fix) Cost tracking for
vertex_ai/imagen3
by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/6752 - (feat) Vertex AI - add support for fine tuned embedding models by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/6749
- LiteLLM Minor Fixes & Improvements (11/13/2024) by @krrishdholakia in https://github.com/BerriAI/litellm/pull/6729
- feat - add us.llama 3.1 models by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/6760
- (Feat) Add Vertex Model Garden llama 3.1 models by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/6763
- (fix) Fix - don't allow
viewer
roles to create virtual keys by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/6764 - (feat) Use
litellm/
prefix when storing virtual keys in AWS secret manager by @ishaan-jaff in https://github.com/BerriAI/litellm/pull/6765
New Contributors
- @IamRash-7 made their first contribution in https://github.com/BerriAI/litellm/pull/6740
Full Changelog: https://github.com/BerriAI/litellm/compare/v1.52.8...v1.52.9
Docker Run LiteLLM Proxy
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.52.9
Don't want to maintain your internal proxy? get in touch 🎉
Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat
Load Test LiteLLM Proxy Results
Name | Status | Median Response Time (ms) | Average Response Time (ms) | Requests/s | Failures/s | Request Count | Failure Count | Min Response Time (ms) | Max Response Time (ms) |
---|---|---|---|---|---|---|---|---|---|
/chat/completions | Failed ❌ | 280.0 | 314.28547197285235 | 6.039371468840217 | 0.0 | 1805 | 0 | 226.56484299994872 | 2776.9337409999935 |
Aggregated | Failed ❌ | 280.0 | 314.28547197285235 | 6.039371468840217 | 0.0 | 1805 | 0 | 226.56484299994872 | 2776.9337409999935 |
1、 load_test.html 1.59MB
2、 load_test_stats.csv 540B