-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
graph: fix the intermediate data types in SDPA patterns #2894
base: main
Are you sure you want to change the base?
Changes from 1 commit
7a0285f
d32cdd2
79f3e18
d55039e
fdea8ad
abe08ff
80f4f02
df61ffb
76f2cf9
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -52,7 +52,6 @@ Graph operations support bf16 and f16 data types. | |
|
||
A TypeCast operation performing down conversion should be inserted clearly to | ||
indicate the use of low numeric precision. oneDNN Graph implementation fully | ||
honors the API-specified numeric precision and only performs the computation | ||
using the API-specified or higher numeric precision. | ||
honors the API-specified numeric precision. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Just to make sure we are aligned. This still allows to use f32 values to store f16/bf16 data, as long as we respect roundings to f16/bf16 accuracy, right? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Yes, in my understanding, it's still allowed for backend implementations. From this perspective, it seems I need to keep the original statement. My intention here was to align the implementations. As the original statement sounds like different backends (eg. DNNL & GC, CPU & GPU) can have different numerical behaviors. |
||
|
||
@img{bf16_programming.jpg,Figure 2: Overview of bf16 programming model.,80%,} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What does intermediate data type mean when users construct graph? Do we need to describe it in text or picture?