Releases: epam/ai-dial-adapter-openai
Releases · epam/ai-dial-adapter-openai
0.19.0
Other
- add integration tests (#185)
- pin poetry to 1.8.5 (#189)
- bump epam/ai-dial-ci from 1.10.0 to 1.10.2 (#187)
- bump virtualenv from 20.24.5 to 20.26.6 (#190)
0.18.0
Features
- introduce application config that simplifies handling of env variables and eases testing (#178)
Fixes
- tolerate tokenizer failures (#183)
- improve error handling (#182)
Other
- bump aidial-sdk from 0.15.0 to 0.16.0; bump protobuf from 5.29.0 to 5.29.1 (#181)
0.17.2
Fixes
- Supported approximate tokenization of tools and functions (#170)
0.17.1
Fixes
- gpt-4o: fixed failing tool/function calling (#175, #176)
- removed
Content-Encoding
header from response to avoid decoding errors from gzip (#174, #176)
- truncated long error messages in the plain-text tokenizer (#172, #176)
Other
- bump aiohttp from 3.10.2 to 3.10.11 (#173, #176)
0.17.0
Features
- supported multi-modal embeddings from Azure AI Vision service (#162)
Fixes
- fixed incorrect calculation of image tokens for GTP-4o mini models (#166)
- bugfix: image tokenizer for chat completion requests ignored
messages[*].content[*].image_url.detail
field (#167, #169)
Other
- bump epam/ai-dial-ci from 1.9.3 to 1.9.4 (#163)
0.16.0
Features
- supported content parts for multi-modal models (GPT-4V and GPT-4o) (#154)
- added Docker healthcheck (#147)
Other
- bump epam/ai-dial-ci from 1.9.1 to 1.9.2 (#159)
- bump epam/ai-dial-ci from 1.9.2 to 1.9.3 (#161)
- bump starlette from 0.36.3 to 0.40.0; fastapi from 0.109.2 to 0.115.2 (#160)
0.15.0
Features
- supported message truncation for multi-modal models (GPT4V and GPT-4o) (#150, #153)
- supported streaming emulation for deployments specified in
NON_STREAMING_DEPLOYMENTS
env var (#157)
- introduced
ELIMINATE_EMPTY_CHOICES
env var as a replacement for FIX_STREAMING_ISSUES_IN_NEW_API_VERSIONS
(#139)
Fixes
- fixed incorrect handling of usage reported by the upstream OpenAI model (#151)
Other
- bumped github workflow versions (#158)
0.14.0
Features
- added
code=truncate_prompt_error
to errors raised by prompt truncation algorithm (#142)
Fixes
- token usage isn't computed if upstream model has itself reported it (#138)
- changed type of
created
field in chat completion chunk from str to int (#141)
- improved display of DIAL links leading to the
public
bucket (#148)
Other
- bump aidial-sdk from 0.8.0 to 0.13.0 (#142)
- bump aiohttp from 3.9.4 to 3.10.2 (#143)
- bump cryptography from 42.0.5 to 43.0.1 (#146)
0.13.2
Fixes
- disabled retries in openai client accidentally introduced in 0.13.0 (#144) (#145)
0.13.1
Other
- bump setuptools from 68.2.2 to 70.0.0 (#134)
- bump epam/ai-dial-ci from 1.8.3 to 1.9.0 (#136)