Skip to content

Latest commit

 

History

History
115 lines (79 loc) · 11.4 KB

mlops-tools.md

File metadata and controls

115 lines (79 loc) · 11.4 KB

MLOps Tools

I avoided discussing tools in the book because tooling is ephemeral -- all software will eventually become legacy software. However, I did mention a few tools in my book when they were useful to illustrate a point. Several early readers told me that the book helped them discover several useful tools, and asked me to recommend more, so here is my attempt.

In this doc, I’ll start with long lists of tools other people have compiled. Then we’ll go over a short list of open-source tools that I think are pretty cool. I exclude some of the already well known tools in the data science community to keep the list short. Feel free to open an issue or submit a PR to recommend a new tool. Thank you!

We also have a channel (#tools-watch) on our MLOps Discord to discuss interesting tools.

Long lists of MLOps tools

I get it -- there are hundreds of MLOps tools out there. There have been multiple long lists of MLOps tools, each with hundreds so I won’t create another list here. However, if you want a long list of tools, here are some of the more popular resources.

Notes

  • Many of these lists were created by vendor-sponsored communities. There’s nothing wrong with taking sponsorships -- communities have operational expenses too -- and being able to get sponsorships is impressive for a community. However, sometimes, there might be conflicts of interests between vendor-sponsorships and vendor-neutral reviews.
  • Many of these lists divide tools into categories and evaluate them by features. However, the landscape is still evolving quickly, so categories are not yet well-defined. At the same time, best practices are still converging, so it can be hard for companies to delineate what features they need.
  1. MLOps Community’s Learn: currently, it consists of 3 categories (Feature Store, Monitoring, Metadata Storage & Management -- which includes Model Store) with one in progress (Deploy). You can add 2 tools to compare them side-by-side.
  2. TwiML Solutions Guide: Tools are grouped into categories with features supported and where they can be deployed.
  3. StackShare: You can browse different devtools, see what companies are using them, see jobs that mention them, check out alternatives, even read reviews. You can also see the tech stacks of popular tech companies. You can also compare tools side by side. In my opinion, StackShare is what many MLOps solution guides aspire to become, but StackShare is general for all tech stacks, not just MLOps.
  4. AI Infrastructure Landscape: a membership-based foundation that focuses on AI infrastructure (more general than LFAI & Data, which focuses on open-source). They do a lot of comprehensive research and analysis on MLOps tooling.
  5. Matt Turck’s annual Machine Learning, AI and Data (MAD) Landscape: with high-level analysis from an investor’s perspective.
  6. Leigh Marie Braswell’s Startup Opportunities in Machine Learning Infrastructure: analysis of the gaps in MLOps landscape from another investor’s perspective.
  7. Yours truly’s MLOps Landscape. It comes with a Google Sheets list of 284 tools (last updated in December 2020) and interactive visualization. I stopped maintaining the list because it takes a ton of time and it’s not like no one else is doing it.

Short lists of MLOps open-source tools

I star cool open source tools whenever I come across them. You can check out the full list here.

Disclaimer: this is not supposed to be a comprehensive list.

Pandas alternatives

Table formats like Pandas DataFrame are great for data manipulation. However, Pandas is slow, quirky, and doesn’t leverage GPUs. Naturally, there have been many projects aiming to fix these problems. Here are a few of them.

  • modin: a drop-in replacement of pandas (using import modin.pandas as pd instead of import pandas as pd). Ponder is working on the enterprise version built on top of it.
  • dask and cuDF: both projects were at some point maintained by the RAPIDS AI team at NVIDIA, until a core developer of dask left NVIDIA to start Coiled, which provides the enterprise version built on top of dask.
  • Polars: built with Rust on top of Apache Arrow, Polars promises to be fast!

H2O did a fun database-like ops benchmark (though I hope they’ll consider another color scheme for their graphs).

alt_text

Data and features

Interpretability and fairness

Model development and evaluation

Use case specific frameworks

Just me nerding out on those cool use cases -- don’t mind me.

Dev environment

DevOps for MLOps