Skip to content

能不能用ai优化一下性能,总感觉不够流畅。 #7215

Discussion options

You must be logged in to vote

@18651619390 @minemine-m 这里有个 trade off 要看下的:

如果自动总结不阻塞发送消息,那么有可能会出现没有总结完,带的是老的历史总结消息的情况,进而可能会导致效果上受影响。

目前实现的做法是为了保证历史总结消息是最新的,然后这样最新的历史记录总结 + 最近的 N 条消息,构成最完整的上下文。


我的建议是选一个 token 生成飞快的历史消息总结模型(比如 openai gpt4o-mini,甚至 groq 的Deepseek R1 Distill Llama 70B),这样就能兼具效果和性能

Replies: 47 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
8 participants
Converted from issue

This discussion was converted from issue #7158 on March 29, 2025 03:38.