clBLAS-2.6.0 Release for ACL 1.0 Beta 1
This clBLAS release is tagged as v2.6 is part of AMD Compute Libraries (ACL) 1.0 beta 1. This release is based on merge from develop branch to master branch.
The highlights of the release:
- Introduced offline kernel compilation
- Improved performance (with offline kernel compilation) of
- sgemm small matrices NN, TN, NT
- sgemm large matrices NN, TN, NT
- zgemm large matrices NT for m,n,k multiples of 32,64,8 respectively
- dtrsm large matrices for m,n multiples of 192
- Incorporated some CMake configuration changes
- Released binaries now includes offline compiled library for certain device and driver.
- "clBLAS-2.6.0-Windows-x64-Hawaii-14502.zip" is a binary built for Hawaii device with 14.502 driver on Windows platform
- Binary built for Hawaii device with 14.502 driver on Linux platform will be released once the driver is released
- (update 08/06/2015) "clBLAS-2.6.0-Linux-x64-Hawaii-14502.tar.gz" is a binary build for Hawaii device with 14.502 driver on Linux platform