machine learning pipelines for research
Stats about Wikimedia train deploys
Datapipelines operationalised by the Generated Data Platform team.
A simple script to deploy and run spark3 on WMF stat machines
An interface into Phabricator for use in Mediawiki projects
Don't leave Wikipedia articles and sections without images, here's the image suggestions data pipeline.
Friends call me ALIS and SLIS.
Investigations and QA around data captured through the Metrics Platform.
API for detecting and surfacting copyedits for Wikipedia articles
Aligning named tempaltes on Wikipedia
A repo for creating model cards and datasheets for algorithms and datasets currently in production at WMF.