-
Notifications
You must be signed in to change notification settings - Fork 95
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Driver quantize fp8 update #3715
Merged
Merged
Changes from all commits
Commits
Show all changes
41 commits
Select commit
Hold shift + click to select a range
70336db
Initial
CharlieL7 b41c8b6
Merge branch 'develop' of github.com:ROCm/AMDMIGraphX into ocp_to_fnuz
CharlieL7 bdebeb5
progress
CharlieL7 a1fb21e
cleanup
CharlieL7 b8e2041
remove unneeded files
CharlieL7 8366434
Fix bit_cast kernel
CharlieL7 a15e5a4
Merge branch 'develop' of github.com:ROCm/AMDMIGraphX into bit_cast_op
CharlieL7 be5d9a0
Merge branch 'develop' of github.com:ROCm/AMDMIGraphX into bit_cast_op
CharlieL7 3e08ab2
Merge branch 'develop' of github.com:ROCm/AMDMIGraphX into bit_cast_op
CharlieL7 7b40796
Merge branch 'bit_cast_op' of github.com:ROCm/AMDMIGraphX into bit_ca…
CharlieL7 697d459
progress
CharlieL7 4b6c8c1
fix template for gpu bit_cast
CharlieL7 531150f
Merge branch 'develop' into bit_cast_op
CharlieL7 d53ac35
Merge branch 'bit_cast_op' of github.com:ROCm/AMDMIGraphX into ocp_to…
CharlieL7 95a3cd7
first implementation
CharlieL7 98d8760
progress
CharlieL7 7357367
Merge branch 'develop' of github.com:ROCm/AMDMIGraphX into ocp_to_fnuz
CharlieL7 e3d84fc
Fixes and first test works
CharlieL7 dac07c2
formatting
CharlieL7 06b94b8
Added ref tests
CharlieL7 3e5d3a8
Merge branch 'develop' of github.com:ROCm/AMDMIGraphX into ocp_to_fnuz
CharlieL7 df0202e
Cleanup
CharlieL7 0318f32
initial
CharlieL7 3b48242
temporary
CharlieL7 b373d10
disable simpilify_qdq in quantization_8bits
CharlieL7 28aab5f
revert
CharlieL7 7e0142f
disable extra passes after quantize_8bits
CharlieL7 0a4d6bf
add verify test
CharlieL7 c94c520
Fix bug with __builtin_nan(string)
CharlieL7 d025e47
Merge branch 'develop' into ocp_to_fnuz
CharlieL7 0cddfbf
separate quantizable ops
CharlieL7 3c36b9b
Merge branch 'ocp_to_fnuz' of github.com:ROCmSoftwarePlatform/AMDMIGr…
CharlieL7 b62a304
Merge branch 'ocp_to_fnuz' of github.com:ROCm/AMDMIGraphX into driver…
CharlieL7 302adac
Merge branch 'develop' of github.com:ROCm/AMDMIGraphX into driver_qua…
CharlieL7 9ba6fe2
Merge branch 'develop' of github.com:ROCm/AMDMIGraphX into driver_qua…
CharlieL7 1de1d22
Merge branch 'develop' into driver_quantize_fp8_update
TedThemistokleous dac22d0
Merge branch 'develop' into driver_quantize_fp8_update
causten 083a9da
Fix the gpu context test
CharlieL7 39592ae
Merge branch 'driver_quantize_fp8_update' of github.com:ROCm/AMDMIGra…
CharlieL7 1ddf7a1
Merge branch 'develop' of github.com:ROCm/AMDMIGraphX into driver_qua…
CharlieL7 9fc2e97
licensing update
CharlieL7 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why is this removed on serialization?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I added this earlier when getting FP8 OCP in to query the gfx number from the driver. We could keep it, but it would not be used anywhere anymore.