ai-data-science-team is a Python library offering specialized AI agents for common data science workflows, significantly accelerating tasks. Its flagship application, AI Pipeline Studio, transforms data science work into a visual, reproducible pipeline. The AI team handles various stages of data science, including data loading, cleaning, visualization, and modeling. The library provides agent building blocks and multi-agent workflows for tasks like data loading and inspection, cleaning, wrangling, feature engineering, visualization, EDA, modeling, evaluation (with H2O + MLflow tools), and SQL database interaction. Notable agents include Data Loader Tools, Data Wrangling, Data Cleaning, Data Visualization, EDA Tools, Feature Engineering, SQL Database, H2O ML, MLflow Tools, and a Supervisor Agent. It supports both OpenAI and Ollama for local models.
Best used for
Ideal for data scientists who need to accelerate data loading and cleaning, generate visualizations and EDA, and build and evaluate machine learning models. Especially valuable for creating reproducible data science pipelines with AI agents.
Common actions