Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Model] Reuse RoPE positions for Deepseek-v2 model #3084

Merged
merged 1 commit into from
Jan 8, 2025

Conversation

MasterJH5574
Copy link
Member

This PR updates the Deepseek-v2 model implementation with the updated RoPE position arrays. Prior to this PR, we will query the RoPE positions for every single layer, while in fact these arrays can be reused and thus only one query is sufficient.

This PR updates the Deepseek-v2 model implementation with the updated
RoPE position arrays. Prior to this PR, we will query the RoPE positions
for every single layer, while in fact these arrays can be reused and
thus only one query is sufficient.
@MasterJH5574 MasterJH5574 merged commit cf7ae82 into mlc-ai:main Jan 8, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant