Product
DATA SCience
Iterate at warp speed
Accelerate your ML workflow seamlessly
Auto-track everything
Automatic logging and versioning
Shared ML building blocks
Boost team productivity with reusable components
Infrastructure
Backend flexibility, zero lock-in
One framework for all your MLOps and LLMOps needs
Limitless scaling
Effortlessly deploy across clouds
Streamline cloud expenses
Gain clarity on resource usage and costs
Organization
ZenML Pro
Our managed control plane for MLOps
ZenML vs Other Tools
Compare ZenML to other ML tools
Integrations
50+ integrations to ease your workflow
Solutions
GENAI & LLMS
Finetuning LLMs
Customize large language models for specific tasks
Productionalizing a RAG application
Deploy and scale RAG systems
LLMOps Database
A curated knowledge base of real-world implementations
mlops
Abstract cloud compute
Simplify management of cloud-based ML resources
Automate workflows
Streamline and optimize ML pipelines
Track metrics and metadata
Monitor and analyze ML model performance and data
Success Stories
Adeo Leroy Merlin
Retail
Brevo
Email Marketing
Developers
Documentation
Docs
Comprehensive guides to use ZenML
Deploying ZenML
Understanding ZenML system architecture
Tutorials
Comprehensive guides to use ZenML
GUIDES
Quickstart
Quickly get your hands dirty
Showcase
Projects of ML use cases built with ZenML
Starter Guide
Get started with the basics
COMMUNITY
Slack
Join our Slack Community
Changelog
Discover what’s new on ZenML
Roadmap
Join us on our MLOps journey
Pricing
Blog
Log in
Sign In
Start Free
LLMOps Database
A curated knowledge base of real-world LLMOps implementations, with detailed summaries and technical notes.
Submit your case study.
Filters
Clear
Search filter
Clear
Filter by tag
Clear
amazon_aws
anthropic
api_gateway
argilla
aws
cache
caption_generation
chatbot
chromadb
chunking
cicd
circleci
classification
cloudflare
code_generation
code_interpretation
cohere
compliance
content_moderation
continuous_deployment
continuous_integration
cost_optimization
crewai
customer_support
data_analysis
data_cleaning
data_integration
databases
databricks
devops
docker
document_processing
documentation
elasticsearch
embeddings
error_handling
fallback_strategies
fastapi
few_shot
fine_tuning
fraud_detection
google
google_gcp
guardrails
healthcare
high_stakes_application
hugging_face
human_in_the_loop
instruction_tuning
internet_of_things
knowledge_distillation
kubernetes
langchain
latency_optimization
legacy_system_integration
llama_index
load_balancing
meta
microservices
microsoft
microsoft_azure
mistral
model_optimization
monitoring
multi_agent_systems
multi_modality
nvidia
open_source
openai
orchestration
pinecone
poc
postgresql
prompt_engineering
pytorch
qdrant
question_answering
rag
realtime_application
redis
regulatory_compliance
reliability
reranking
scalability
scaling
security
semantic_search
serverless
spacy
speech_recognition
sqlite
structured_output
summarization
system_prompts
tensorflow
token_optimization
translation
triton
unstructured_data
vector_search
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Looking to Get Ahead in LLMOps?
Subscribe to the ZenML newsletter and receive regular product updates, tutorials, examples, and more.
Tag
Showing
0
of
0
results
A2I
Multilingual Document Processing Pipeline with Human-in-the-Loop Validation
Tech
2024
AWS GenAIIC
Building Production-Grade Heterogeneous RAG Systems
Tech
2024
AWS GenAIIC
Optimizing RAG Systems: Lessons from Production
Tech
2024
Accenture
Implementing Generative AI in Manufacturing: A Multi-Use Case Study
Tech
2023
Accenture
Enterprise Knowledge Base Assistant Using Multi-Model GenAI Architecture
Healthcare
2023
Activeloop
Enterprise-Grade Memory Agents for Patent Processing with Deep Lake
Legal
2023
Adyen
Smart Ticket Routing and Support Agent Copilot using LLMs
Finance
2023
Agmatix
Generative AI Assistant for Agricultural Field Trial Analysis
Other
2024
AirBnB
Evolving a Conversational AI Platform for Production LLM Applications
Tech
2024
Airbnb
LLM Integration for Customer Support Automation and Enhancement
Tech
2022
Airtrain
Cost Reduction Through Fine-tuning: Healthcare Chatbot and E-commerce Product Classification
Healthcare
2024
Alaska Airlines
AI-Powered Natural Language Flight Search Implementation
Tech
2024
Allianz
AI-Powered Insurance Claims Chatbot with Continuous Feedback Loop
Insurance
2023
Amazon
HIPAA-Compliant LLM-Based Chatbot for Pharmacy Customer Service
Healthcare
2023
Amazon
Building a Commonsense Knowledge Graph for E-commerce Product Recommendations
E-commerce
2024
Amazon (Alexa)
Managing Model Updates and Robustness in Production Voice Assistants
Tech
2023
Amazon Finance
Scaling RAG Accuracy from 49% to 86% in Finance Q&A Assistant
Finance
2024
AngelList
LLM-Powered Investment Document Analysis and Processing
Finance
2023
Anzen
Using LLMs to Scale Insurance Operations at a Small Company
Insurance
2023
Anzen
Building Robust Legal Document Processing Applications with LLMs
Insurance
2023
Applaud
Lessons from Deploying an HR-Aware AI Assistant: Five Key Implementation Insights
HR
2024
Arcade AI
Building a Tool Calling Platform for LLM Agents
Tech
2024
Assembled
Automating Test Generation with LLMs at Scale
Tech
2023
Athena Intelligence
Optimizing Research Report Generation with LangChain Stack and LLM Observability
Tech
2024
Austrian Post Group
LLM-Based Agents for User Story Quality Enhancement in Agile Development
Government
2024
BNY Mellon
Enterprise-Wide Virtual Assistant for Employee Knowledge Access
Finance
2024
Babbel
Building an AI-Assisted Content Creation Platform for Language Learning
Education
2023
Bainbridge Capital
Deploying LLM-Based Recommendation Systems in Private Equity
Finance
2024
Barclays
MLOps Evolution and LLM Integration at a Major Bank
Finance
2024
Barclays
Enterprise Challenges and Opportunities in Large-Scale LLM Deployment
Tech
2024
BenchSci
Domain-Specific LLMs for Drug Discovery Biomarker Identification
Healthcare
2023
Bito
Multi-Model LLM Orchestration with Rate Limit Management
Tech
2023
Blueprint AI
Automated Software Development Insights and Communication Platform
Tech
2023
Bosch
Enterprise-Wide Generative AI Implementation for Marketing Content Generation and Translation
Tech
2023
Bud Financial / Scotts Miracle-Gro
Building Personalized Financial and Gardening Experiences with LLMs
Finance
2024
Build Great AI
LLM-Powered 3D Model Generation for 3D Printing
Tech
2024
Buzzfeed
Production-Ready LLM Integration Using Retrieval-Augmented Generation and Custom ReAct Implementation
Media & Entertainment
2023
Cambrium
LLMs and Protein Engineering: Building a Sustainable Materials Platform
Tech
2023
Campfire AI
Four Critical Lessons from Building 50+ Global Chatbots: A Practitioner's Guide to Real-World Implementation
Tech
2024
Canva
Automating Post Incident Review Summaries with GPT-4
Tech
2023
Canva
Systematic LLM Evaluation Framework for Content Generation
Tech
2023
Canva
LLM Feature Extraction for Content Categorization and Search Query Understanding
Tech
2023
Cesar
Practical Implementation of LLMs for Automated Test Case Generation
Research & Academia
2023
Chaos Labs
Multi-Agent System for Prediction Market Resolution Using LangChain and LangGraph
Finance
2024
Checkr
Streamlining Background Check Classification with Fine-tuned Small Language Models
HR
2024
CircleCI
AI Error Summarizer Implementation: A Tiger Team Approach
Tech
2023
CircleCI
Building and Testing Production AI Applications at CircleCI
Tech
2023
Cisco
Enterprise LLMOps: Development, Operations and Security Framework
Tech
2023
Clari
Real-time Data Streaming Architecture for AI Customer Support
Other
2023
Cleric AI
AI-Powered SRE Agent for Production Infrastructure Management
Tech
2023
Clipping
Building an AI Tutor with Enhanced LLM Accuracy Through Knowledge Base Integration
Education
2023
CoActive AI
Scaling AI Systems for Unstructured Data Processing: Logical Data Models and Embedding Optimization
Tech
2023
Convirza
Multi-LoRA Serving for Agent Performance Analysis at Scale
Tech
2024
Couchbase
Vector Search and RAG Implementation for Enhanced User Search Experience
Finance
2023
Coval
Agent Testing and Evaluation Using Autonomous Vehicle Simulation Principles
Tech
2023
Cox 2M
Integrating Gemini for Natural Language Analytics in IoT Fleet Management
Tech
2024
Credal
Lessons from Building a Production RAG System: Data Formatting and Prompt Engineering
Tech
2023
Credal
Enterprise AI Adoption Journey: From Experimentation to Core Operations
Tech
2023
DXC
LLM-Powered Multi-Tool Architecture for Oil & Gas Data Exploration
Energy
2024
Danswer
Scaling Enterprise RAG with Advanced Vector Search Migration
Tech
2024
Next
amazon_aws
,
anthropic
,
api_gateway
,
argilla
,
aws
,
cache
,
caption_generation
,
chatbot
,
chromadb
,
chunking
,
cicd
,
circleci
,
classification
,
cloudflare
,
code_generation
,
code_interpretation
,
cohere
,
compliance
,
content_moderation
,
continuous_deployment
,
continuous_integration
,
cost_optimization
,
crewai
,
customer_support
,
data_analysis
,
data_cleaning
,
data_integration
,
databases
,
databricks
,
devops
,
docker
,
document_processing
,
documentation
,
elasticsearch
,
embeddings
,
error_handling
,
fallback_strategies
,
fastapi
,
few_shot
,
fine_tuning
,
fraud_detection
,
google
,
google_gcp
,
guardrails
,
healthcare
,
high_stakes_application
,
hugging_face
,
human_in_the_loop
,
instruction_tuning
,
internet_of_things
,
knowledge_distillation
,
kubernetes
,
langchain
,
latency_optimization
,
legacy_system_integration
,
llama_index
,
load_balancing
,
meta
,
microservices
,
microsoft
,
microsoft_azure
,
mistral
,
model_optimization
,
monitoring
,
multi_agent_systems
,
multi_modality
,
nvidia
,
open_source
,
openai
,
orchestration
,
pinecone
,
poc
,
postgresql
,
prompt_engineering
,
pytorch
,
qdrant
,
question_answering
,
rag
,
realtime_application
,
redis
,
regulatory_compliance
,
reliability
,
reranking
,
scalability
,
scaling
,
security
,
semantic_search
,
serverless
,
spacy
,
speech_recognition
,
sqlite
,
structured_output
,
summarization
,
system_prompts
,
tensorflow
,
token_optimization
,
translation
,
triton
,
unstructured_data
,
vector_search
,
Start Your Free Trial Now
No new paradigms - Bring your own tools and infrastructure
No data leaves your servers, we only track metadata
Free trial included - no strings attached, cancel anytime
Try Free
Book a Demo