Software Engineering

Newsletter Edition #7 - Notebooks in Production: The eternal MLOps debate

Hamza Tahir
Aug 30, 2024
3 mins

Hi everyone!

August turned out to be an unexpectedly busy month, far from the slow summer pace I anticipated. Let me catch you up on what's been happening!

First, thank you to everyone who responded to my question about ZenML's core value proposition. Your diverse and nuanced replies were incredibly helpful in understanding how our community perceives the framework. Internally, we've spent considerable time this month revisiting our fundamentals and examining current MLOps practices, especially on hyperscalers.

Does this resonate with you? We've heard countless stories like:

  • "We set up Kubeflow in-house, but data scientists aren't using it."
  • "Infrastructure mandates AWS, but our ML teams dislike Sagemaker."
  • "GCP Vertex AI is overly complex with long spin-up times - we've reverted to hosted notebooks."
  • "Our internal K8s cluster is great, but data scientists struggle with kubectl."
  • "AzureML and Databricks are too costly and restrictive in coding approach."

If these scenarios sound familiar, ZenML might be just what you need. Give our new quickstart a try to see how we can help!

Now, let's dive into this month's updates...

🥂Dishing out new features, together in Istanbul

This month the entire ZenML team got together at a central meeting point in the beautiful and historical city of Istanbul. As a remote team, this was a moment where a lot of us met for the first time in the real world! We got to share many laughs, memories, and brainstormings about the future of MLOps, LLMOps, and the place ZenML takes in it!

I know this is a newsletter about MLOps, but I couldn't help but share this moment with everyone here. At least, you can now put some faces to the people behind the product!

📓ZenML and Notebooks: A love-hate relationship (or is it?)

ZenML has historically been used outside of notebooks - as it is usually in the phase of development where things need to be put into a pipeline, therefore after that fast, iterative notebook phase is over. There have been endless debates of how to bridge that gap between these two phases, and while I don't think ZenML will ever advocate for deploying notebooks in production, we now have something that is a great compromise for this eternal discussion.

From 0.64.0 onwards, you are now able to run pipelines (and even single steps) directly from a notebook on to your production orchestrator. This is really a big change for ZenML, and I believe will spur a new usage pattern we have not quite seen before. I expect many users to iterate and deploy their ZenML workloads directly from Jupyter now. Will this put the eternal notebook debate to rest? We'll have to wait and see!

Run an example

💻 A New Onboarding Experience for ZenML

So, remember how I mentioned trying out our new quickstart? Well, there's a bit more to that story. Our great OSS contributer team (shoutout to Tess and Jonathan!) have cooked up this nifty VSCode extension that basically gives you a guided tour of ZenML right in your IDE. No more jumping between docs and your code editor - it's all right there in one place.

Curious to try it out? Just head over to our GitHub repo, hit that 'Code' button, and spin up a new Codespace. Easy!

Try it Out!

🫱🏾‍🫲🏼 Meet me (Hamza) at AI Tinkerers Paris!

I'll be giving a talk next week at the AI Tinkerers event in Paris, featuring Mistral. It's on Tuesday, September 3rd, 2024. The talk is about scaling GenAI systems - specifically, how to build and manage a system that can handle thousands of personalized AI models. I'll be using a demo that showcases mass AI personalization using Flux.1 model finetunings with DreamBooth.

If you're in Paris next week, come check it out and hit me up! If not, I'll share some key takeaways in the next newsletter.

Hit me up!

😎 The rest...

  • MLOps, why it matters, and how to implement ot [Blog]
  • We have a brand-new quickstart. Experienced users should also try [Colab]
  • Bridge the gap between Composer and Vertex AI pipelines on GCP [Tutorial]
  • ZenML now has an AzureML Pipelines orchestrator [Docs]
  • 🤗Accelerate + ZenML: Multi-node, multi-gpu training [Docs]
  • Latest versions of ZenML have a smarter way of reusing docker builds [Notes]
  • Intricacies of implementing DAG visualizations in VS Code [Blog]
  • ZenML vs. Apache Airflow: A Comparative Analysis for MLOps [Tutorial]

Cheers,

Hamza

Co-founder, ZenML

Looking to Get Ahead in MLOps & LLMOps?

Subscribe to the ZenML newsletter and receive regular product updates, tutorials, examples, and more articles like this one.
We care about your data in our privacy policy.