Product
DATA SCience
Iterate at warp speed
Accelerate your ML workflow seamlessly
Auto-track everything
Automatic logging and versioning
Shared ML building blocks
Boost team productivity with reusable components
Infrastructure
Backend flexibility, zero lock-in
One framework for all your MLOps and LLMOps needs
Limitless scaling
Effortlessly deploy across clouds
Streamline cloud expenses
Gain clarity on resource usage and costs
Organization
ZenML Pro
Our managed control plane for MLOps
ZenML vs Other Tools
Compare ZenML to other ML tools
Integrations
50+ integrations to ease your workflow
Solutions
GENAI & LLMS
Finetuning LLMs
Customize large language models for specific tasks
Productionalizing a RAG application
Deploy and scale RAG systems
LLMOps Database
A curated knowledge base of real-world implementations
mlops
Abstract cloud compute
Simplify management of cloud-based ML resources
Automate workflows
Streamline and optimize ML pipelines
Track metrics and metadata
Monitor and analyze ML model performance and data
Success Stories
Adeo Leroy Merlin
Retail
Brevo
Email Marketing
Developers
Documentation
Docs
Comprehensive guides to use ZenML
Deploying ZenML
Understanding ZenML system architecture
Tutorials
Comprehensive guides to use ZenML
GUIDES
Quickstart
Quickly get your hands dirty
Showcase
Projects of ML use cases built with ZenML
Starter Guide
Get started with the basics
COMMUNITY
Slack
Join our Slack Community
Changelog
Discover what’s new on ZenML
Roadmap
Join us on our MLOps journey
Pricing
Blog
Log in
Sign In
Start Free
LLMOps Database
A curated knowledge base of real-world LLMOps implementations, with detailed summaries and technical notes.
Submit your case study.
Filters
Clear
Search filter
Clear
Filter by tag
Clear
amazon_aws
anthropic
api_gateway
argilla
aws
cache
caption_generation
chatbot
chromadb
chunking
cicd
circleci
classification
cloudflare
code_generation
code_interpretation
cohere
compliance
content_moderation
continuous_deployment
continuous_integration
cost_optimization
crewai
customer_support
data_analysis
data_cleaning
data_integration
databases
databricks
devops
docker
document_processing
documentation
elasticsearch
embeddings
error_handling
fallback_strategies
fastapi
few_shot
fine_tuning
fraud_detection
google
google_gcp
guardrails
healthcare
high_stakes_application
hugging_face
human_in_the_loop
instruction_tuning
internet_of_things
knowledge_distillation
kubernetes
langchain
latency_optimization
legacy_system_integration
llama_index
load_balancing
meta
microservices
microsoft
microsoft_azure
mistral
model_optimization
monitoring
multi_agent_systems
multi_modality
nvidia
open_source
openai
orchestration
pinecone
poc
postgresql
prompt_engineering
pytorch
qdrant
question_answering
rag
realtime_application
redis
regulatory_compliance
reliability
reranking
scalability
scaling
security
semantic_search
serverless
spacy
speech_recognition
sqlite
structured_output
summarization
system_prompts
tensorflow
token_optimization
translation
triton
unstructured_data
vector_search
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Looking to Get Ahead in LLMOps?
Subscribe to the ZenML newsletter and receive regular product updates, tutorials, examples, and more.
Tag
Showing
0
of
0
results
Grab
Productionizing LLM-Powered Data Governance with LangChain and LangSmith
Tech
2024
Grab
LLM-Powered Data Classification System for Large-Scale Enterprise Data Governance
Tech
2023
Grab
LLM-Powered Data Discovery and Documentation Platform
Tech
2024
Grab
RAG-Powered LLM System for Automated Analytics and Fraud Investigation
Tech
2024
Gradient Labs
Building Production-Ready Customer Support AI Agents: Challenges and Solutions
Tech
Grammarly
Building a Delicate Text Detection System for Content Safety
Tech
2024
Grammarly
Specialized Text Editing LLM Development through Instruction Tuning
Tech
2023
Hansard
Building a Modern Search Engine for Parliamentary Records with RAG Capabilities
Government
2024
Harvard
Building an AI Teaching Assistant: ChatLTV at Harvard Business School
Education
2023
HealthInsuranceLLM
Building an On-Premise Health Insurance Appeals Generation System
Healthcare
2023
HeyRevia
AI-Powered Call Center Agents for Healthcare Operations
Healthcare
2023
Holiday Extras
Enterprise AI Transformation: Holiday Extras' ChatGPT Enterprise Implementation Case Study
Other
2024
Honeycomb
Building and Scaling an LLM-Powered Query Assistant in Production
Tech
2023
Honeycomb
The Hidden Complexities of Building Production LLM Features: Lessons from Honeycomb's Query Assistant
Tech
2024
Honeycomb
Implementing LLM Observability for Natural Language Querying Interface
Tech
2023
Honeycomb
Natural Language Query Interface with Production LLM Integration
Tech
2023
Hotelplan Suisse
Generative AI-Powered Knowledge Sharing System for Travel Expertise
Other
2024
HumanLoop
Best Practices for LLM Production Deployments: Evaluation, Prompt Management, and Fine-tuning
Tech
2023
Humanloop
Building a Foundation Model Operations Platform
Tech
2023
Humanloop
Pitfalls and Best Practices for Production LLM Applications
Tech
2023
IDInsight
Optimizing Text-to-SQL Pipeline Using Agent Experiments
Tech
2024
Incident.io
Building and Deploying an AI-Powered Incident Summary Generator
Tech
2024
IncludedHealth
Building a Comprehensive LLM Platform for Healthcare Applications
Healthcare
2024
Instacart
Advanced Prompt Engineering Techniques for Production LLM Applications
E-commerce
2023
Instacart
Building and Scaling an Enterprise AI Assistant with GPT Models
E-commerce
2023
Instacart
Enhancing E-commerce Search with LLMs at Scale
E-commerce
2023
InsuranceDekho
Transforming Insurance Agent Support with RAG-Powered Chat Assistant
Insurance
2024
Intercom
Multilingual Content Navigation and Localization System
Media & Entertainment
2024
Invento Robotics
Challenges in Building Enterprise Chatbots with LLMs: A Banking Case Study
Finance
2024
Jockey
Building a Scalable Conversational Video Agent with LangGraph and Twelve Labs APIs
Media & Entertainment
2024
Johns Hopkins
Medical AI Assistant for Battlefield Care Using LLMs
Healthcare
2023
Kapa.ai
Production RAG Best Practices: Implementation Lessons at Scale
Tech
2024
Kentauros AI
Building Production-Grade AI Agents: Overcoming Reasoning and Tool Challenges
Tech
2023
Klarna
AI Assistant for Global Customer Service Automation
Finance
2024
Komodo
Healthcare Data Analytics Democratization with MapAI and LLM Integration
Healthcare
2024
Large Gaming Company
Fine-tuning LLMs for Toxic Speech Classification in Gaming
Media & Entertainment
2023
LeBonCoin
LLM-Powered Search Relevance Re-Ranking System
E-commerce
2023
Lemonade
Troubleshooting and Optimizing RAG Pipelines: Lessons from Production
Insurance
2024
Lime
AI-Powered Customer Support Automation for Global Transportation Service
Tech
2024
Lindy.ai
Evolution from Open-Ended LLM Agents to Guided Workflows
Tech
2024
LinkedIn
Productionizing Generative AI Applications: From Exploration to Scale
Tech
2023
LinkedIn
Building and Deploying Large Language Models for Skills Extraction at Scale
Tech
2023
LinkedIn
Pragmatic Product-Led Approach to LLM Integration and Prompt Engineering
Tech
2023
LinkedIn
Building and Scaling a Production Generative AI Assistant for Professional Networking
Tech
2024
MLflow
MLflow's Production-Ready Agent Framework and LLM Tracing
Tech
2024
MSD
Text-to-SQL System for Complex Healthcare Database Queries
Healthcare
2024
Malt
Building a Scalable Retriever-Ranker Architecture: Malt's Journey with Vector Databases and LLM-Powered Freelancer Matching
Tech
2024
Mark43
Secure Generative AI Integration for Public Safety Applications
Tech
2024
Mastercard
Linguistic-Informed Approach to Production LLM Systems
Finance
2023
Mendable
Leveraging LangSmith for Debugging Tools & Actions in Production LLM Applications
Tech
2024
Mendix
Integrating Generative AI into Low-Code Platform Development with Amazon Bedrock
Tech
2024
Mercado Libre
Real-World LLM Implementation: RAG, Documentation Generation, and Natural Language Processing at Scale
E-commerce
2024
Mercado Libre
Building a Scalable LLM Gateway for E-commerce Recommendations
E-commerce
2023
Mercado Libre
GitHub Copilot Deployment at Scale: Enhancing Developer Productivity
E-commerce
2024
Mercado Libre / Grupo Boticario
Enhancing E-commerce Search with Vector Embeddings and Generative AI
E-commerce
2024
Mercari
Fine-Tuning and Quantizing LLMs for Dynamic Attribute Extraction
E-commerce
2024
Mercari
Building AI Assist: LLM Integration for E-commerce Product Listings
E-commerce
2023
Meta
Automated Unit Test Improvement Using LLMs for Android Applications
Tech
2024
Meta
Scaling LLM Infrastructure: Building and Operating 24K GPU Clusters for LLaMA Training
Tech
2024
Meta
Scaling AI Image Animation System with Optimized Latency and Traffic Management
Tech
2024
Previous
Next
amazon_aws
,
anthropic
,
api_gateway
,
argilla
,
aws
,
cache
,
caption_generation
,
chatbot
,
chromadb
,
chunking
,
cicd
,
circleci
,
classification
,
cloudflare
,
code_generation
,
code_interpretation
,
cohere
,
compliance
,
content_moderation
,
continuous_deployment
,
continuous_integration
,
cost_optimization
,
crewai
,
customer_support
,
data_analysis
,
data_cleaning
,
data_integration
,
databases
,
databricks
,
devops
,
docker
,
document_processing
,
documentation
,
elasticsearch
,
embeddings
,
error_handling
,
fallback_strategies
,
fastapi
,
few_shot
,
fine_tuning
,
fraud_detection
,
google
,
google_gcp
,
guardrails
,
healthcare
,
high_stakes_application
,
hugging_face
,
human_in_the_loop
,
instruction_tuning
,
internet_of_things
,
knowledge_distillation
,
kubernetes
,
langchain
,
latency_optimization
,
legacy_system_integration
,
llama_index
,
load_balancing
,
meta
,
microservices
,
microsoft
,
microsoft_azure
,
mistral
,
model_optimization
,
monitoring
,
multi_agent_systems
,
multi_modality
,
nvidia
,
open_source
,
openai
,
orchestration
,
pinecone
,
poc
,
postgresql
,
prompt_engineering
,
pytorch
,
qdrant
,
question_answering
,
rag
,
realtime_application
,
redis
,
regulatory_compliance
,
reliability
,
reranking
,
scalability
,
scaling
,
security
,
semantic_search
,
serverless
,
spacy
,
speech_recognition
,
sqlite
,
structured_output
,
summarization
,
system_prompts
,
tensorflow
,
token_optimization
,
translation
,
triton
,
unstructured_data
,
vector_search
,
Start Your Free Trial Now
No new paradigms - Bring your own tools and infrastructure
No data leaves your servers, we only track metadata
Free trial included - no strings attached, cancel anytime
Try Free
Book a Demo