Hamilton
Visit ToolApache Hamilton helps data scientists and engineers define testable, modular, and self-documenting dataflows. It encodes lineage and metadata, running and scaling wherever Python does.
At a glance
Trending
Apache Hamilton helps data scientists and engineers define testable, modular, and self-documenting dataflows. It encodes lineage and metadata, running and scaling wherever Python does.
Trending
About
Apache Hamilton is a lightweight Python library designed for creating directed acyclic graphs (DAGs) of data transformations. It enables data scientists and engineers to define testable, modular, and self-documenting dataflows that encode lineage, tracing, and metadata. The library is highly portable, running anywhere Python does, including scripts, notebooks, Airflow pipelines, and FastAPI servers. Hamilton emphasizes separation of concerns, allowing data scientists to focus on problem-solving while engineers manage production pipelines. It supports data and schema validation, built-in coding styles, and a plugin-based architecture for custom integrations. The Apache Hamilton UI provides automatic visualization, cataloging, and monitoring of execution, including data cataloging, dataset profiling, and execution tracking.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending