machine learning pipelines for research
Stats about Wikimedia train deploys
A simple script to deploy and run spark3 on WMF stat machines
Investigations and QA around data captured through the Metrics Platform.
A repo for creating model cards and datasheets for algorithms and datasets currently in production at WMF.
Aligning named tempaltes on Wikipedia
Don't leave Wikipedia articles and sections without images, here's the image suggestions data pipeline.
Friends call me ALIS and SLIS.
API for detecting and surfacting copyedits for Wikipedia articles
An interface into Phabricator for use in Mediawiki projects
Experiments with airflow dags and jobs running on WMF analytics infrastructure.
Tools for processing Wikimedia content for natural language processing
Small one-off analyses/code around Wikimedia projects