Product
DATA SCience
Iterate at warp speed
Accelerate your ML workflow seamlessly
Auto-track everything
Automatic logging and versioning
Shared ML building blocks
Boost team productivity with reusable components
Infrastructure
Backend flexibility, zero lock-in
One framework for all your MLOps and LLMOps needs
Limitless scaling
Effortlessly deploy across clouds
Streamline cloud expenses
Gain clarity on resource usage and costs
Organization
ZenML Pro
Our managed control plane for MLOps
ZenML vs Other Tools
Compare ZenML to other ML tools
Integrations
50+ integrations to ease your workflow
Solutions
GENAI & LLMS
Finetuning LLMs
Customize large language models for specific tasks
Productionalizing a RAG application
Deploy and scale RAG systems
LLMOps Database
A curated knowledge base of real-world implementations
mlops
Abstract cloud compute
Simplify management of cloud-based ML resources
Automate workflows
Streamline and optimize ML pipelines
Track metrics and metadata
Monitor and analyze ML model performance and data
Success Stories
Adeo Leroy Merlin
Retail
Brevo
Email Marketing
Developers
Documentation
Docs
Comprehensive guides to use ZenML
Deploying ZenML
Understanding ZenML system architecture
Tutorials
Comprehensive guides to use ZenML
GUIDES
Quickstart
Quickly get your hands dirty
Showcase
Projects of ML use cases built with ZenML
Starter Guide
Get started with the basics
COMMUNITY
Slack
Join our Slack Community
Changelog
Discover what’s new on ZenML
Roadmap
Join us on our MLOps journey
Pricing
Blog
Log in
Sign In
Start Free
LLMOps Database
A curated knowledge base of real-world LLMOps implementations, with detailed summaries and technical notes.
Submit your case study.
Filters
Clear
Search filter
Clear
Filter by tag
Clear
amazon_aws
anthropic
api_gateway
argilla
aws
cache
caption_generation
chatbot
chromadb
chunking
cicd
circleci
classification
cloudflare
code_generation
code_interpretation
cohere
compliance
content_moderation
continuous_deployment
continuous_integration
cost_optimization
crewai
customer_support
data_analysis
data_cleaning
data_integration
databases
databricks
devops
docker
document_processing
documentation
elasticsearch
embeddings
error_handling
fallback_strategies
fastapi
few_shot
fine_tuning
fraud_detection
google
google_gcp
guardrails
healthcare
high_stakes_application
hugging_face
human_in_the_loop
instruction_tuning
internet_of_things
knowledge_distillation
kubernetes
langchain
latency_optimization
legacy_system_integration
llama_index
load_balancing
meta
microservices
microsoft
microsoft_azure
mistral
model_optimization
monitoring
multi_agent_systems
multi_modality
nvidia
open_source
openai
orchestration
pinecone
poc
postgresql
prompt_engineering
pytorch
qdrant
question_answering
rag
realtime_application
redis
regulatory_compliance
reliability
reranking
scalability
scaling
security
semantic_search
serverless
spacy
speech_recognition
sqlite
structured_output
summarization
system_prompts
tensorflow
token_optimization
translation
triton
unstructured_data
vector_search
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Looking to Get Ahead in LLMOps?
Subscribe to the ZenML newsletter and receive regular product updates, tutorials, examples, and more.
Tag
Showing
0
of
0
results
Microsoft
LLMs for Cloud Incident Management and Root Cause Analysis
Tech
2023
Microsoft
Building a Production-Ready Business Analytics Assistant with ChatGPT
Tech
2023
Microsoft
Building Analytics Applications with LLMs for E-commerce Review Analysis
E-commerce
2023
Microsoft
Real-time Question-Answering System with Two-Stage LLM Architecture for Sales Content Recommendations
Tech
2024
Microsoft
Best Practices for AI Agent Development and Deployment
Tech
2023
Microsoft
Lessons from Enterprise LLM Deployment: Cross-functional Teams, Experimentation, and Security
Tech
2024
Moonhub
Best Practices for Implementing LLMs in High-Stakes Applications
Healthcare
2023
Morgan Stanley
Enterprise Knowledge Management with LLMs: Morgan Stanley's GPT-4 Implementation
Finance
2024
MosaicML
Training and Deploying MPT: Lessons Learned in Large Scale LLM Development
Tech
2023
Myaskai
Network Security Block Page Implementation
Tech
2024
NICE
Natural Language to SQL System with Production Safeguards for Contact Center Analytics
Telecommunications
2024
NICE Actimize
Generative AI Integration in Financial Crime Detection Platform
Finance
2024
NICE Actimize
Leveraging Vector Embeddings for Financial Fraud Detection
Finance
2024
NTT Data
GenAI-Powered Work Order Management System POC
Other
2024
NVIDIA
Security Learnings from LLM Production Deployments
Tech
2023
Neeva
Overcoming LLM Production Deployment Challenges
Tech
2023
New Computer
Enhancing Memory Retrieval Systems Using LangSmith Testing and Evaluation
Tech
2024
Nextdoor
Improving Email Engagement Using Generative AI with Rejection Sampling
Tech
2023
Nextdoor
Optimizing Email Engagement Using LLMs and Rejection Sampling
Tech
2023
Notion
Scaling Data Infrastructure for AI Features and RAG
Tech
2024
Numbers Station
Integrating Foundation Models into the Modern Data Stack: Challenges and Solutions
Tech
2023
Numbers Station
Building Production-Ready SQL and Charting Agents with RAG Integration
Tech
Nvidia
Automated CVE Analysis and Remediation Using Event-Driven RAG and AI Agents
Tech
2024
OLX
Automating Job Role Extraction Using Prosus AI Assistant in Production
E-commerce
2024
OLX
Building a Conversational Shopping Assistant with Multi-Modal Search and Agent Architecture
E-commerce
2023
ONE
From SMS to AI: Lessons from 5 Years of Chatbot Development for Social Impact
Other
2024
Outerbounds / AWS
AWS Trainium & Metaflow: Democratizing Large-Scale ML Training Through Infrastructure Evolution
Tech
2024
PagerDuty
Rapid Development and Deployment of Enterprise LLM Features Through Centralized LLM Service Architecture
Tech
2023
Paradigm
Scaling Parallel Agent Operations with LangChain and LangSmith Monitoring
Tech
2024
Paramount+
Video Content Summarization and Metadata Enrichment for Streaming Platform
Media & Entertainment
2023
Parcha
Building Production-Grade AI Agents with Distributed Architecture and Error Recovery
Finance
2023
Parcha
Building Production-Ready AI Agents for Enterprise Operations
Finance
2023
Parlance Labs
Practical LLM Deployment: From Evaluation to Fine-tuning
Consulting
2023
Perplexity
Building a Complex AI Answer Engine with Multi-Step Reasoning
Tech
2024
Perplexity
Building a Production-Grade LLM Orchestration System for Conversational Search
Tech
2023
Perplexity AI
Scaling an AI-Powered Search and Research Assistant from Prototype to Production
Tech
2023
Picnic
Enhancing E-commerce Search with LLM-Powered Semantic Retrieval
E-commerce
2024
Podium
Optimizing Agent Behavior and Support Operations with LangSmith Testing and Observability
Tech
2024
Prem AI
Optimizing Production Vision Pipelines for Planet Image Generation
Tech
2024
Prosus
Plus One: Internal LLM Platform for Cross-Company AI Adoption
Tech
2023
Prosus
Agent-Based AI Assistants for Enterprise and E-commerce Applications
E-commerce
2024
Q4
SQL Generation and RAG for Financial Data Q&A Chatbot
Finance
2023
Qatar Computing Research Institute
T-RAG: Tree-Based RAG Architecture for Question Answering Over Organizational Documents
Research & Academia
2024
QuantumBlack
Data Engineering Challenges and Best Practices in LLM Production
Consulting
2023
QuantumBlack
LLM Applications in Drug Discovery and Call Center Analytics
Healthcare
2023
Rakuten
Building Enterprise-Scale AI Applications with LangChain and LangSmith
E-commerce
2024
Ramp
AI-Powered Tour Guide for Financial Platform Navigation
Finance
2024
Rasgo
Production Lessons from Building and Deploying AI Agents
Tech
2024
RealChar
Building a Production-Ready AI Phone Call Assistant with Multi-Modal Processing
Tech
2023
Renovai
Building Production-Ready LLM Agents with State Management and Workflow Engineering
Tech
2023
Replit
Building Production-Ready LLMs for Automated Code Repair: A Scalable IDE Integration Case Study
Tech
2024
Replit
Optimizing LLM Server Startup Times for Preemptable GPU Infrastructure
Tech
2023
Replit
Building Reliable AI Agents for Application Development with Multi-Agent Architecture
Tech
2024
Replit
Building a Production-Ready Multi-Agent Coding Assistant
Tech
2023
Replit
Advanced Agent Monitoring and Debugging with LangSmith Integration
Tech
2024
Replit
Building and Scaling Production Code Agents: Lessons from Replit
Tech
2023
Rexera
Evolving Quality Control AI Agents with LangGraph
Tech
2024
Runway
Multimodal Feature Stores and Research-Engineering Collaboration
Media & Entertainment
2024
Salesforce
Enterprise-Scale LLM Integration into CRM Platform
Tech
2023
Salesforce
Large-Scale Enterprise Copilot Deployment: Lessons from Einstein Copilot Implementation
Tech
2024
Previous
Next
amazon_aws
,
anthropic
,
api_gateway
,
argilla
,
aws
,
cache
,
caption_generation
,
chatbot
,
chromadb
,
chunking
,
cicd
,
circleci
,
classification
,
cloudflare
,
code_generation
,
code_interpretation
,
cohere
,
compliance
,
content_moderation
,
continuous_deployment
,
continuous_integration
,
cost_optimization
,
crewai
,
customer_support
,
data_analysis
,
data_cleaning
,
data_integration
,
databases
,
databricks
,
devops
,
docker
,
document_processing
,
documentation
,
elasticsearch
,
embeddings
,
error_handling
,
fallback_strategies
,
fastapi
,
few_shot
,
fine_tuning
,
fraud_detection
,
google
,
google_gcp
,
guardrails
,
healthcare
,
high_stakes_application
,
hugging_face
,
human_in_the_loop
,
instruction_tuning
,
internet_of_things
,
knowledge_distillation
,
kubernetes
,
langchain
,
latency_optimization
,
legacy_system_integration
,
llama_index
,
load_balancing
,
meta
,
microservices
,
microsoft
,
microsoft_azure
,
mistral
,
model_optimization
,
monitoring
,
multi_agent_systems
,
multi_modality
,
nvidia
,
open_source
,
openai
,
orchestration
,
pinecone
,
poc
,
postgresql
,
prompt_engineering
,
pytorch
,
qdrant
,
question_answering
,
rag
,
realtime_application
,
redis
,
regulatory_compliance
,
reliability
,
reranking
,
scalability
,
scaling
,
security
,
semantic_search
,
serverless
,
spacy
,
speech_recognition
,
sqlite
,
structured_output
,
summarization
,
system_prompts
,
tensorflow
,
token_optimization
,
translation
,
triton
,
unstructured_data
,
vector_search
,
Start Your Free Trial Now
No new paradigms - Bring your own tools and infrastructure
No data leaves your servers, we only track metadata
Free trial included - no strings attached, cancel anytime
Try Free
Book a Demo