Stats about Wikimedia train deploys
machine learning pipelines for research
Datapipelines operationalised by the Generated Data Platform team.
A repo for creating model cards and datasheets for algorithms and datasets currently in production at WMF.
Aligning named tempaltes on Wikipedia
API for detecting and surfacting copyedits for Wikipedia articles
A simple script to deploy and run spark3 on WMF stat machines
An interface into Phabricator for use in Mediawiki projects
Don't leave Wikipedia articles and sections without images, here's the image suggestions data pipeline.
Friends call me ALIS and SLIS.
Investigations and QA around data captured through the Metrics Platform.