Explore projects
-
-
repos / releng / Train Stats
GNU General Public License v3.0 or laterStats about Wikimedia train deploys
Updated -
repos / releng / GitLab Migration Stats
GNU General Public License v3.0 or laterUpdated -
Updated
-
repos / data-engineering / Mediawiki Event Enrichment
Apache License 2.0Updated -
Htriedman / stat-spark3
Creative Commons Zero v1.0 UniversalA simple script to deploy and run spark3 on WMF stat machines
Updated -
Updated
-
Investigations and QA around data captured through the Metrics Platform.
Updated -
Htriedman / algo-accountability
Creative Commons Zero v1.0 UniversalA repo for creating model cards and datasheets for algorithms and datasets currently in production at WMF.
Updated -
-
Don't leave Wikipedia articles and sections without images, here's the image suggestions data pipeline.
Friends call me ALIS and SLIS.
Updated -
API for detecting and surfacting copyedits for Wikipedia articles
Updated -
-
Experiments with airflow dags and jobs running on WMF analytics infrastructure.
Updated -
toolforge-repos / computer-aided-tagging-test
MIT LicenseUpdated -
repos / research / Wiki NLP Tools
MIT LicenseTools for processing Wikimedia content for natural language processing
Updated -
Isaac Johnson / miscellaneous-wikimedia
MIT LicenseSmall one-off analyses/code around Wikimedia projects
Updated