Product
DATA SCience
Iterate at warp speed
Accelerate your ML workflow seamlessly
Auto-track everything
Automatic logging and versioning
Shared ML building blocks
Boost team productivity with reusable components
Infrastructure
Backend flexibility, zero lock-in
One framework for all your MLOps and LLMOps needs
Limitless scaling
Effortlessly deploy across clouds
Streamline cloud expenses
Gain clarity on resource usage and costs
Organization
ZenML Pro
Our managed control plane for MLOps
ZenML vs Other Tools
Compare ZenML to other ML tools
Integrations
50+ integrations to ease your workflow
Solutions
GENAI & LLMS
Finetuning LLMs
Customize large language models for specific tasks
Productionalizing a RAG application
Deploy and scale RAG systems
LLMOps Database
A curated knowledge base of real-world implementations
mlops
Abstract cloud compute
Simplify management of cloud-based ML resources
Automate workflows
Streamline and optimize ML pipelines
Track metrics and metadata
Monitor and analyze ML model performance and data
Success Stories
Adeo Leroy Merlin
Retail
Brevo
Email Marketing
Developers
Documentation
Docs
Comprehensive guides to use ZenML
Deploying ZenML
Understanding ZenML system architecture
Tutorials
Comprehensive guides to use ZenML
GUIDES
Quickstart
Quickly get your hands dirty
Showcase
Projects of ML use cases built with ZenML
Starter Guide
Get started with the basics
COMMUNITY
Slack
Join our Slack Community
Changelog
Discover what’s new on ZenML
Roadmap
Join us on our MLOps journey
Pricing
Blog
Log in
Sign In
Start Free
LLMOps Database
A curated knowledge base of real-world LLMOps implementations, with detailed summaries and technical notes.
Submit your case study.
Filters
Clear
Search filter
Clear
Filter by tag
Clear
amazon_aws
anthropic
api_gateway
argilla
aws
cache
caption_generation
chatbot
chromadb
chunking
cicd
circleci
classification
cloudflare
code_generation
code_interpretation
cohere
compliance
content_moderation
continuous_deployment
continuous_integration
cost_optimization
crewai
customer_support
data_analysis
data_cleaning
data_integration
databases
databricks
devops
docker
document_processing
documentation
elasticsearch
embeddings
error_handling
fallback_strategies
fastapi
few_shot
fine_tuning
fraud_detection
google
google_gcp
guardrails
healthcare
high_stakes_application
hugging_face
human_in_the_loop
instruction_tuning
internet_of_things
knowledge_distillation
kubernetes
langchain
latency_optimization
legacy_system_integration
llama_index
load_balancing
meta
microservices
microsoft
microsoft_azure
mistral
model_optimization
monitoring
multi_agent_systems
multi_modality
nvidia
open_source
openai
orchestration
pinecone
poc
postgresql
prompt_engineering
pytorch
qdrant
question_answering
rag
realtime_application
redis
regulatory_compliance
reliability
reranking
scalability
scaling
security
semantic_search
serverless
spacy
speech_recognition
sqlite
structured_output
summarization
system_prompts
tensorflow
token_optimization
translation
triton
unstructured_data
vector_search
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Looking to Get Ahead in LLMOps?
Subscribe to the ZenML newsletter and receive regular product updates, tutorials, examples, and more.
Tag
Showing
0
of
0
results
Databricks
Building a Custom LLM for Automated Documentation Generation
Tech
2023
Dataworkz
RAG-Powered Customer Service Call Center Analytics
Insurance
2024
Deepgram
Domain-Specific Small Language Models for Call Center Intelligence
Telecommunications
2023
Deepgram
Building Production-Ready Conversational AI Voice Agents: Latency, Voice Quality, and Integration Challenges
Tech
2024
Defense Innovation Unit
Dark Vessel Detection System Using SAR Imagery and ML
Government
2023
Delivery Hero
Semantic Product Matching Using Retrieval-Rerank Architecture
E-commerce
2024
Devin Kearns
Building Production AI Agents with Vector Databases and Automated Data Collection
Consulting
2023
Digits
Production-Ready Question Generation System Using Fine-Tuned T5 Models
Finance
2023
Discord
Building and Scaling LLM Applications at Discord
Tech
2024
DoorDash
Generative AI Contact Center Solution with Amazon Bedrock and Claude
E-commerce
2024
Doordash
Building an Enterprise LLMOps Stack: Lessons from Doordash
E-commerce
2023
Doordash
LLM-Based Dasher Support Automation with RAG and Quality Controls
E-commerce
2024
Doordash
Building a Product Knowledge Graph Using LLMs for Attribute Extraction and Catalog Management
E-commerce
2024
Doordash
Strategic Framework for Generative AI Implementation in Food Delivery Platform
E-commerce
2023
Doordash
Building a High-Quality RAG-based Support System with LLM Guardrails and Quality Monitoring
E-commerce
2024
Doordash
Scaling LLMs for Product Knowledge and Search in E-commerce
E-commerce
2024
Doordash
LLMs for Enhanced Search Retrieval and Query Understanding
E-commerce
2024
Dropbox
Building a Silicon Brain for Universal Enterprise Search
Tech
2024
Dropbox
Control Character-Based Prompt Injection Attack Discovery in ChatGPT
Tech
2023
Dropbox
Scaling AI-Powered File Understanding with Efficient Embedding and LLM Architecture
Tech
2024
Dropbox
LLM Security: Discovering and Mitigating Repeated Token Attacks in Production Models
Tech
2024
Dropbox
Detecting and Mitigating Prompt Injection via Control Characters in ChatGPT
Tech
2023
Duolingo
AI-Powered Lesson Generation System for Language Learning
Education
2023
Duolingo
GitHub Copilot Integration for Enhanced Developer Productivity
Education
2024
Dust.tt
Building a Horizontal Enterprise Agent Platform with Infrastructure-First Approach
Tech
2024
ElevenLabs
Scaling Voice AI with GPU-Accelerated Infrastructure
Media & Entertainment
2024
Ellipsis
Building and Operating Production LLM Agents: Lessons from the Trenches
Tech
2023
Emergent Methods
Production-Scale RAG System for Real-Time News Processing and Analysis
Media & Entertainment
2023
Faber Labs
Building Goal-Oriented Retrieval Agents for Low-Latency Recommendations at Scale
E-commerce
2024
Factory
LangSmith Integration for Automated Feedback and Improved Iteration in SDLC
Tech
2024
Factory.ai
Building Reliable Agentic Systems in Production
Tech
Faire
Fine-tuning and Scaling LLMs for Search Relevance Prediction
E-commerce
2024
Farfetch
Scaling Recommender Systems with Vector Database Infrastructure
E-commerce
2024
Farfetch
Multimodal Search and Conversational AI for Fashion E-commerce Catalog
E-commerce
2023
FeedYou
Production Intent Recognition System for Enterprise Chatbots
Tech
2023
Fiddler
Building a RAG-Based Documentation Chatbot: Lessons from Fiddler's LLMOps Journey
Tech
2023
First Orion
Leveraging Amazon Q for Integrated Cloud Operations Data Access and Automation
Telecommunications
2024
Five Sigma
Legacy PDF Document Processing with LLM
Tech
2024
Fuzzy Labs
Scaling Self-Hosted LLMs with GPU Optimization and Load Testing
Tech
2024
Ghostwriter
Building an AI-Powered Email Writing Assistant with Personalized Style Matching
Tech
2024
Github
Evolving GitHub Copilot through LLM Experimentation and User-Centered Design
Tech
2023
Github
Building Production-Grade LLM Applications: An Architectural Guide
Tech
2023
Github
Enterprise LLM Application Development: GitHub Copilot's Journey
Tech
2024
Github
Improving Contextual Understanding in GitHub Copilot Through Advanced Prompt Engineering
Tech
2024
Github
Evolution of LLM Integration in GitHub Copilot Development
Tech
2023
Gitlab
Building Production-Scale Code Completion Tools with Continuous Evaluation and Prompt Engineering
Tech
2023
Gitlab
LLM Validation and Testing at Scale: GitLab's Comprehensive Model Evaluation Framework
Tech
2024
Gitlab
Dogfooding AI Features in GitLab's Development Workflow
Tech
2024
Glean
Building Robust Enterprise Search with LLMs and Traditional IR
Tech
2023
Globant
LLM Production Case Studies: Consulting Database Search, Automotive Showroom Assistant, and Banking Development Tools
Consulting
2023
GoDaddy
From Mega-Prompts to Production: Lessons Learned Scaling LLMs in Enterprise Customer Support
E-commerce
2024
Golden State Warriors
AI-Powered Personalized Content Recommendations for Sports and Entertainment Venue
Media & Entertainment
2023
Gong
Implementing Question-Answering Over Sales Conversations with Deal Me at Gong
Tech
2023
Google
Optimizing Security Incident Response with LLMs at Google
Tech
2024
Google
Building and Testing a Production LLM-Powered Quiz Application
Education
2023
Google / NotebookLLM
Source-Grounded LLM Assistant with Multi-Modal Output Capabilities
Tech
2024
Google / Vertex AI
Lessons Learned from Production AI Agent Deployments
Tech
2024
Grab
LLM-Powered Data Classification System for Enterprise-Scale Metadata Generation
Tech
2023
Grab
Enhancing Vector Similarity Search with LLM-Based Reranking
Tech
2024
Grab
LLM-Powered Automated Data Classification and Governance System
Tech
2023
Previous
Next
amazon_aws
,
anthropic
,
api_gateway
,
argilla
,
aws
,
cache
,
caption_generation
,
chatbot
,
chromadb
,
chunking
,
cicd
,
circleci
,
classification
,
cloudflare
,
code_generation
,
code_interpretation
,
cohere
,
compliance
,
content_moderation
,
continuous_deployment
,
continuous_integration
,
cost_optimization
,
crewai
,
customer_support
,
data_analysis
,
data_cleaning
,
data_integration
,
databases
,
databricks
,
devops
,
docker
,
document_processing
,
documentation
,
elasticsearch
,
embeddings
,
error_handling
,
fallback_strategies
,
fastapi
,
few_shot
,
fine_tuning
,
fraud_detection
,
google
,
google_gcp
,
guardrails
,
healthcare
,
high_stakes_application
,
hugging_face
,
human_in_the_loop
,
instruction_tuning
,
internet_of_things
,
knowledge_distillation
,
kubernetes
,
langchain
,
latency_optimization
,
legacy_system_integration
,
llama_index
,
load_balancing
,
meta
,
microservices
,
microsoft
,
microsoft_azure
,
mistral
,
model_optimization
,
monitoring
,
multi_agent_systems
,
multi_modality
,
nvidia
,
open_source
,
openai
,
orchestration
,
pinecone
,
poc
,
postgresql
,
prompt_engineering
,
pytorch
,
qdrant
,
question_answering
,
rag
,
realtime_application
,
redis
,
regulatory_compliance
,
reliability
,
reranking
,
scalability
,
scaling
,
security
,
semantic_search
,
serverless
,
spacy
,
speech_recognition
,
sqlite
,
structured_output
,
summarization
,
system_prompts
,
tensorflow
,
token_optimization
,
translation
,
triton
,
unstructured_data
,
vector_search
,
Start Your Free Trial Now
No new paradigms - Bring your own tools and infrastructure
No data leaves your servers, we only track metadata
Free trial included - no strings attached, cancel anytime
Try Free
Book a Demo