Integrations
Prodigy
and
ZenML logo in purple, representing machine learning pipelines and MLOps framework.
Streamline Data Annotation with Prodigy and ZenML
The image is blank. No elements are visible for description or keyword inclusion.
Prodigy
All integrations

Prodigy

Streamline Data Annotation with Prodigy and ZenML
Add to ZenML
COMPARE
related resources
No items found.

Streamline Data Annotation with Prodigy and ZenML

Enhance your machine learning workflows by integrating Prodigy, a modern annotation tool, with ZenML. This powerful combination enables efficient data labeling, data inspection, and error analysis, streamlining your ML pipeline and improving model performance.

Features with ZenML

  • Seamless Integration:
    Easily incorporate Prodigy as a data annotation step within your ZenML pipelines.
  • Efficient Data Labeling:
    Leverage Prodigy's intuitive and optimized interface for fast and accurate data annotation.
  • Flexible Workflow Customization:
    Customize annotation workflows using Prodigy's pre-built components and ZenML's extensible architecture.
  • Streamlined Data Management:
    Effortlessly manage datasets, annotations, and metadata within the ZenML framework.

Main Features

  • Intuitive and efficient web-based annotation interface
  • Pre-built workflows for various annotation tasks
  • Customizable scripts for data loading, saving, and annotation logic
  • Extensible front-end with custom HTML and JavaScript support
  • Optimized for fast and accurate data labeling

How to use ZenML with
Prodigy
# zenml annotator register prodigy --flavor prodigy
# optionally also pass in --custom_config_path="&alt;PATH_TO_CUSTOM_CONFIG_FILE>"
# zenml stack register prodigy -o default -a default -an prodigy --set

# wget https://raw.githubusercontent.com/explosion/prodigy-recipes/master/example-datasets/news_headlines.jsonl

# Now annotate your data
# zenml annotator dataset annotate your_dataset --command="textcat.manual news_topics ./news_headlines.jsonl --label Technology,Politics,Economy,Entertainment"

# access the data later on using Python in your pipelines
from zenml import step
from zenml.client import Client

@step
def import_annotations() -> List[Dict[str, Any]]:
    zenml_client = Client()
    annotations = zenml_client.active_stack.annotator.get_labeled_data(dataset_name="your_dataset")
    # Do something with the annotations
    return annotations
    

This code snippet demonstrates how to import annotations from Prodigy within a ZenML step. It uses the ZenML client to access the active stack's annotator component and retrieves the labeled data for a specific dataset. The annotations can then be processed further in the pipeline.

Additional Resources
ZenML Prodigy Integration Docs
Prodigy Documentation
Blog: How to annotate image data for object detection with Prodigy

Streamline Data Annotation with Prodigy and ZenML

Enhance your machine learning workflows by integrating Prodigy, a modern annotation tool, with ZenML. This powerful combination enables efficient data labeling, data inspection, and error analysis, streamlining your ML pipeline and improving model performance.
Prodigy

Start Your Free Trial Now

No new paradigms - Bring your own tools and infrastructure
No data leaves your servers, we only track metadata
Free trial included - no strings attached, cancel anytime
Dashboard displaying machine learning models, including versions, authors, and tags. Relevant to model monitoring and ML pipelines.

Connect Your ML Pipelines to a World of Tools

Expand your ML pipelines with Apache Airflow and other 50+ ZenML Integrations
AzureML Pipelines
Google Artifact Registry
Microsoft Azure
WhyLabs whylogs
XGBoost
PyTorch
Docker
Lightning AI
Google Cloud Vertex AI Pipelines
Pigeon
Comet