Skip to content

Pull requests: neuralmagic/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Tpu v1 mgoin
#53 opened Jan 9, 2025 by mgoin Draft
ignore
#51 opened Jan 8, 2025 by afeldman-nm Draft
updated
#50 opened Jan 3, 2025 by robertgshaw2-neuralmagic Loading…
Bump jinja2 from 3.1.4 to 3.1.5 dependencies Pull requests that update a dependency file
#49 opened Dec 27, 2024 by dependabot bot Loading…
Add: Support for Sparse24Bitmask Compressed Models
#47 opened Dec 17, 2024 by rahul-tuli Loading…
1 task
merged
#46 opened Dec 15, 2024 by robertgshaw2-neuralmagic Loading…
Logprobs
#45 opened Dec 15, 2024 by robertgshaw2-neuralmagic Loading…
Proto
#44 opened Dec 12, 2024 by robertgshaw2-neuralmagic Loading…
Cutlass grouped gemm
#42 opened Dec 10, 2024 by ElizaWszola Loading…
[DRAFT] use cutlass for 24
#33 opened Nov 15, 2024 by rahul-tuli Draft
Semi structured v2
#32 opened Nov 13, 2024 by ilmarkov Loading…
Add hf_transfer to testing image
#29 opened Nov 6, 2024 by mgoin Loading…
Hqq support
#21 opened Oct 14, 2024 by ElizaWszola Draft
Update cpu_extension.cmake stale
#12 opened Sep 23, 2024 by ProExpertProg Loading…
test
#7 opened Aug 28, 2024 by robertgshaw2-neuralmagic Draft
ProTip! Mix and match filters to narrow down what you’re looking for.