-
Notifications
You must be signed in to change notification settings - Fork 141
Pull requests: ROCm/composable_kernel
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[CK_TILE] moe sorting ex kernel to support expert > 128
#1840
opened Jan 26, 2025 by
carlushuang
•
Draft
Add make_kernel_pt for specific architecture compilation guards
#1824
opened Jan 17, 2025 by
alugorey
Loading…
7 tasks
Change flag to CK_GFX90A_DENORM_WORKAROUND
#1817
opened Jan 15, 2025 by
darren-amd
Loading…
3 tasks done
Update LICENSE to 2025 (#1797)
ci:docs-only
Skip most non-doc CI for this PR
documentation
Improvements or additions to documentation
#1801
opened Jan 7, 2025 by
spolifroni-amd
Loading…
6 tasks
Cross GPU Reduce Operator Initial Development
#1795
opened Jan 6, 2025 by
ThomasNing
Loading…
4 of 6 tasks
[CK_TILE] Sync fmha fwd splitkv minor optimizations
#1785
opened Jan 1, 2025 by
poyenc
Loading…
2 tasks done
device_prop.hpp: move static map to helper function and initialize there
#1763
opened Dec 18, 2024 by
coconutruben
Loading…
3 of 6 tasks
[Ck tile] Use raw store to improve layernorm performance
#1752
opened Dec 16, 2024 by
rocking5566
Loading…
disable atomicAdd for C Output Vector Length = 1 with 16bit data type
#1737
opened Dec 10, 2024 by
zjing14
Loading…
Apply universal gemm to bwd_weight_cshuffle operator
#1658
opened Nov 12, 2024 by
mozga-amd
Loading…
[do not review] int4 scale based on jzhang's pre work
noCI
Disable testing on supported CI systems: math libraries CI has this feature enabled..
[DO NOT REVIEW] Add int4 dequant with scale only supports based on ZhangJing's PR #1572
noCI
Disable testing on supported CI systems: math libraries CI has this feature enabled..
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.