probabl-ai/skills — methodology for agents
For AI coding agents

Methodology for agents, by the scikit-learn maintainers.

probabl-ai/skills is a collection of thirteen skills that bring scikit-learn, skrub, and Skore methodology into any agentic coding tool. The agent not only writes a pipeline, but it also builds, evaluates, and iterates with the rigor your team would otherwise enforce by hand.

By the scikit-learn core maintainersOpen source · BSD-3-ClauseAny agent
Install the skill pack
npx skills add github.com/probabl-ai/skills
The skill pack

Thirteen skills, organized the way the best data science teams already work, improved daily.

04 / API references

Any library, indexed on demand.

One skill that discovers the public API of any installed package — so the agent reads real signatures instead of hallucinating them.

See it in practice

Install the pack, then prompt the way you already do.

The skills are plain markdown files. The agent reads them as part of its session and reaches for the right one when the task fits. You don't have to invoke them by name.

  1. 01

    Install the pack

    One command into your agent's skills directory: npx skills add github.com/probabl-ai/skills. BSD-3-Clause — fork it if you want.

  2. 02

    Prompt the workflow, not the tool

    "Build a churn model on this CSV and tell me what's weakest." The agent routes through the right skills.

  3. 03

    Get a Skore report, not a notebook

    Structured evaluation, fold-level diagnostics, per-slice metrics. The same report object whether you ran it or the agent did.

  4. 04

    Iterate, audit, ship

    Use the iterate-from-* skills to source the next experiment. Sync to Skore Hub or MLflow when it's ready for production.

agent_session.py
# Skill: build-ml-pipeline
import skrub
from sklearn.linear_model import Ridge
model = skrub.tabular_pipeline(Ridge())

# Skill: evaluate-ml-pipeline
import skore
report = skore.evaluate(model, df, y, splitter=5)
report

# Skill: iterate-from-diagnostic
# The agent inspects the report:
#   → calibration drifts on the over-65 slice
#   → next experiment: per-slice isotonic calibration

# Skill: skore-api → push when you're ready
project = skore.Project("churn", mode="hub")
project.put("baseline", report)
Why we built this

Agentic AI is fast. Methodology is what makes it trustworthy.

AI assistants ship scikit-learn pipelines in seconds - downloads doubled from 100M to 200M monthly in nine months. The bottleneck isn't compute, it's the absence of shared standards. Skills are how we put the maintainers' methodology in the loop, by default, every time the agent writes code.

Pitfall warnings before training
Task-appropriate metrics
Fold-level CV diagnostics
Reports the team can review
Hands-on support

Want this wired into your team's workflow?

Probabl runs Forward Deployed Engineering engagements for teams putting agentic ML on rails. We'll audit your pipeline, integrate the skill pack alongside your existing stack, and pair with your team on the first reports that go to production.