Support end-to-end ML workflows (!25) · Merge requests · repos / research / Research Datasets · GitLab

How to register an account on GitLab. To prevent spam, new accounts are locked until approved by an admin or the approver bot. You can also file an unlock request to expedite access.

Support: mw:GitLab, how to host a project on GitLab, #wikimedia-gitlab on libera.chat, #GitLab on Phabricator.

Fabian Kaelin requested to merge revert_risk_model into main Apr 26, 2024

End to end ML training workflow for the revert risk model
research_ml module for shared ML abstractions, support for distributed/local training and prediction using xgboost
ML workflow notebook as an example using research_ml to train models interactively in notebooks

A number of other changes

introduced a base features step/job, in anticipation of separating the computation of the ML features and generation of a training dataset
updated the risk observatory batch prediction
move support for repartition into stratified sample transformation