Advanced ML knowledge
Proficiency in a broad range of machine learning algorithms and the ability to select appropriate models for specific problems.
The Professional Practitioner Certification is for working data scientists. Regularization, ensembles, feature engineering, nested cross-validation, and the judgement to pick a model and defend it to a stakeholder.
The Professional certification is designed to ensure that our certified professionals possess both the conceptual understanding and the practical skills of a mid-level data scientist. The exam is graded against the seven areas below.
Proficiency in a broad range of machine learning algorithms and the ability to select appropriate models for specific problems.
Strong coding skills in Python, with experience in optimizing code for performance and scalability.
Ability to handle large datasets, including data extraction, transformation, and loading processes.
Experience in creating and selecting features to improve model performance.
Proficiency in hyperparameter tuning, model selection, and ensemble methods to improve model performance.
Approach complex problems systematically and evaluate multiple solutions, including diagnosing issues in a model pipeline.
How ML projects align with business goals and how to translate technical results into actionable business insights.
A step beyond Associate. You need to recognize when a model is regularized correctly, when a CV strategy leaks, and how to communicate that to non-technical readers.
The advanced mental model. Probabilistic outputs, regularization regimes, and what overfitting does to soft predictions.
from sklearn.linear_model \
import LogisticRegression
clf = LogisticRegression(
penalty="elasticnet",
l1_ratio=0.5,
solver="saga",
)
Pick the baseline, regularize the noise, ensemble when warranted, and choose the metric that fits the problem.
from sklearn.ensemble import \ HistGradientBoostingClassifier from sklearn.metrics import \ average_precision_score clf = HistGradientBoostingClassifier() clf.fit(X_tr, y_tr) ap = average_precision_score( y_te, clf.predict_proba(X_te)[:,1] )
Read the plot, name the failure mode, explain it without using the word probability twice.
from sklearn.metrics import \
PrecisionRecallDisplay
PrecisionRecallDisplay\
.from_estimator(
clf, X_te, y_te,
).plot()
Heatmaps, PCA, polynomial features, label propagation. The shaping work that makes a real-world dataset trainable.
from sklearn.pipeline import \
FeatureUnion
from sklearn.preprocessing import \
PolynomialFeatures, SplineTransformer
union = FeatureUnion([
("poly", PolynomialFeatures(2)),
("spline", SplineTransformer()),
])
Group structure, non i.i.d. data, nested CV, stable hyperparameters across folds.
from sklearn.model_selection \
import GridSearchCV, GroupKFold
inner = GridSearchCV(pipe, grid, cv=5)
outer = cross_val_score(
inner, X, y, groups=g,
cv=GroupKFold(5),
)
Three certifications, each matching a level and a typical data scientist career path. Associate is the prerequisite mindset, Professional is the working bar, Expert is the bar of the people who maintain the library.
Junior data scientist. Fundamental ML, preprocessing, evaluation.
Mid-level. Regularization, ensembles, feature engineering, nested CV.
Senior practitioner. Production ML, scaling, governance.
The Professional track on Skolar matches this exam: regularization, ensembles, feature unions, and nested validation, with notebooks and practice questions written by the scikit-learn team.
Everything you need to plan your sitting, in six lines.
No. Associate is recommended as a stepping stone but not required. If you have a year or two of working data science with scikit-learn, you can sit Professional directly.
Yes. The Professional exam adds one hands-on lab on top of the multiple-choice questions. You will write and tune a small pipeline against a held-out dataset, in a sandboxed scikit-learn environment.
One retake is included with your registration. After that, retakes are discounted. There is a 21-day cool-down between attempts so you can revisit weak topics on Skolar.
Yes. Every passing candidate gets a credential ID and a public verification page on probabl.ai. Recruiters can confirm validity without contacting you.
The Professional certification is valid for 3 years. Renew by passing the Level 03 (Expert) exam, or by re-taking Professional at a discount.
120 minutes. $349 USD. Multiple-choice plus a hands-on lab, a credential issued by the maintainers themselves.