Skip to content
@efeslab

Efeslab

Efeslab at the University of Washington

Popular repositories Loading

  1. Nanoflow Nanoflow Public

    A throughput-oriented high-performance serving framework for LLMs

    Cuda 713 29

  2. Atom Atom Public

    [MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

    Cuda 291 25

  3. fiddler fiddler Public

    [ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration

    Python 180 16

  4. lapidary lapidary Public

    Creating beautiful gem5 simulations

    C++ 47 14

  5. DMon-AE DMon-AE Public

    DMon Prototype for OSDI 2021 Artifact Evaluation

    C++ 21 1

  6. optimus-hypervisor optimus-hypervisor Public

    18

Repositories

Showing 10 of 97 repositories
  • vllm Public Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    efeslab/vllm’s past year of commit activity
    Python 0 Apache-2.0 5,402 0 0 Updated Jan 24, 2025
  • genmc Public Forked from MPI-SWS/genmc

    Generic model checker for concurrent C programs (mirror repository)

    efeslab/genmc’s past year of commit activity
    C++ 0 GPL-3.0 21 0 0 Updated Jan 19, 2025
  • wiredtiger Public Forked from wiredtiger/wiredtiger

    WiredTiger's source tree

    efeslab/wiredtiger’s past year of commit activity
    C 0 398 0 0 Updated Jan 8, 2025
  • rocksdb-squint Public Forked from facebook/rocksdb

    A library that provides an embeddable, persistent key-value store for fast storage.

    efeslab/rocksdb-squint’s past year of commit activity
    C++ 0 GPL-2.0 6,545 0 0 Updated Jan 7, 2025
  • alice Public Forked from madthanu/alice
    efeslab/alice’s past year of commit activity
    C 0 26 0 0 Updated Jan 6, 2025
  • leveldb Public Forked from google/leveldb

    LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.

    efeslab/leveldb’s past year of commit activity
    C++ 0 BSD-3-Clause 8,108 0 0 Updated Dec 28, 2024
  • efeslab/UopRepl-Artifact’s past year of commit activity
    C++ 0 Apache-2.0 0 0 0 Updated Nov 30, 2024
  • fiddler Public

    [ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration

    efeslab/fiddler’s past year of commit activity
    Python 180 Apache-2.0 16 1 0 Updated Nov 18, 2024
  • Nanoflow Public

    A throughput-oriented high-performance serving framework for LLMs

    efeslab/Nanoflow’s past year of commit activity
    Cuda 713 Apache-2.0 29 8 2 Updated Sep 21, 2024
  • ispy-ripple Public
    efeslab/ispy-ripple’s past year of commit activity
    C++ 7 Apache-2.0 2 0 0 Updated Jul 4, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…