Explore projects
-
Creates an artifact, a packed conda environment, to be deployed across the data engineering cluster. Currently contains Pyspark3 and JupyterHub.
Updated -
Creates an artifact, a packed conda environment, to be deployed across the data engineering cluster. Currently contains Pyspark3 and JupyterHub.
Updated -
repos / search-platform / mjolnir
MIT LicenseUpdated -
Updated
-
Bking / mjolnir
MIT LicenseUpdated -
Updated
-
Aqu / Analytics Refinery Source
Apache License 2.0Updated -
-
-
Updated
-
repos / generated-data-platform / event-driven-poc
Apache License 2.0Updated -
Updated
-
repos / generated-data-platform / Spark Metrics
Apache License 2.0This is a Work In Progress fork of https://github.com/banzaicloud/spark-metrics/ modified to meet WMF conventions.
Updated -
Gmodena / privacy-spark-pipeline
Apache License 2.0Differentially private analytics with Apache Spark (demo)
Updated -
Gmodena / streaming-imagematching
Apache License 2.0PoC for consuming revision and android interaction streams
Updated