Explore projects
-
Libraries for working with the Wikimedia Foundation's Event Platform.
Updated -
Gmodena / streaming-imagematching
Apache License 2.0PoC for consuming revision and android interaction streams
Updated -
repos / search-platform / mjolnir
MIT LicenseUpdated -
A set of Apache Spark powered tools which are used to transform data and metrics via a set of Airflow DAGs.
Updated -
repos / wikidata-platform / Wikidata Query Service / WDQS Streaming Producer
Apache License 2.0Java Flink application which creates a Kafka stream of Wikidata page modifications in RDF format.
Updated -
Creates an artifact, a packed conda environment, to be deployed across the data engineering cluster. Currently contains Pyspark3 and JupyterHub.
Updated -
toolforge-repos / digero
MIT LicenseUpdated