Fundamental ML
Proficiency in fundamental machine learning algorithms, knowing when to reach for a model, and when deep learning would be overkill.
The Associate Practitioner Certification is for junior data scientists. A 90 minute exam covering fundamental ML, preprocessing, model selection, and evaluation, designed and graded by the maintainers of scikit-learn.
The certification is designed to ensure that our certified professionals possess both the conceptual understanding and the practical skills of a junior data scientist. The exam is graded against the eight areas below.
Proficiency in fundamental machine learning algorithms, knowing when to reach for a model, and when deep learning would be overkill.
Comfort in Python, especially scikit-learn, Pandas, and NumPy. The everyday surface a junior DS lives in.
Cleaning, manipulating, and preprocessing data using Python libraries. Reshape it before you fit it.
Using Python plotting tools to inspect data and communicate results. Matplotlib first, seaborn for shape.
Working understanding of statistics, probability, and hypothesis testing, enough to interpret a model score.
Cross-validation, confusion matrices, ROC curves. Knowing what a good number actually means in context.
Strong attention to detail to ensure data accuracy and model reliability. The work behind reproducibility.
Logical analysis of issues, including design choices for data pipelines and how to evaluate them.
Each topic block lists the concepts and the scikit-learn surface area you will be tested on. If you can read the snippet on the right and explain what it does, you are on track.
The mental model. What a learning algorithm is, how it learns, and what can go wrong.
from sklearn.model_selection \
import train_test_split
X_tr, X_te, y_tr, y_te = \
train_test_split(X, y)
Fit, predict, score. The everyday loop, plus what the score actually tells you.
from sklearn.linear_model \
import LogisticRegression
model = LogisticRegression()
model.fit(X_tr, y_tr)
score = model.score(X_te, y_te)
Plotting results and explaining them to people who do not read confusion matrices for fun.
from sklearn.metrics import \
ConfusionMatrixDisplay
ConfusionMatrixDisplay\
.from_estimator(
model, X_te, y_te,
).plot()
Most of the job. Loading, cleaning, encoding, the work that decides whether the model can learn anything at all.
from sklearn.compose \
import ColumnTransformer
pre = ColumnTransformer([
("num", StandardScaler(), num),
("cat", OneHotEncoder(), cat),
])
Choosing the right model, tuning it honestly, and knowing how stable the answer is.
from sklearn.model_selection \
import GridSearchCV
grid = GridSearchCV(
pipeline,
param_grid={"C": [.1, 1, 10]},
cv=5,
).fit(X_tr, y_tr)
Three certifications, each matching a level and a typical data scientist career path. You are looking at the first one.
Junior data scientist. Fundamental ML, preprocessing, evaluation.
Mid-level. Pipelines, custom estimators, calibration, leakage.
Senior practitioner. Production ML, scaling, governance.
Each course matches a certification level and reflects a data scientist typical career path. Start with the Associate course, paced lessons, notebooks, and practice questions written by the scikit-learn team.
Everything you need to plan your sitting, in six lines.
No. The exam tests conceptual understanding alongside the API. If you have worked through a course like the Skolar Associate track, or have a few months of hands-on Python data work, you have the surface area.
Not in the Associate exam. Questions are multiple-choice, including short code-reading questions where you identify what a snippet does or what is wrong with it. The Professional and Expert exams include hands-on tasks.
One retake is included in the $299 fee. After that, retakes are $149. There is a 14-day cool-down between attempts so you can revisit weak topics on Skolar.
Yes. Every passing candidate gets a credential ID and a public verification page hosted on probabl.ai. Recruiters can confirm validity without contacting you.
The Associate certification is valid for 3 years. Renew by passing the Level 02 exam, or by re-taking the Associate exam at a 50% discount.
90 minutes. $299 USD. A credential issued by the maintainers themselves.