Explore projects

M

repos / data-engineering / MediaWiki Stream Enrichment

Archived 4

Updated Dec 14, 2022

Archived 4 0 2

Updated Dec 14, 2022
repos / data-engineering / conda-analytics

Creates an artifact, a packed conda environment, to be deployed across the data engineering cluster. Currently contains Pyspark3 and JupyterHub.

2

Updated Jul 23, 2024

2 1 2

Updated Jul 23, 2024
I

repos / generated-data-platform / Image Suggestions Feedback

1

Updated Aug 30, 2022

1 0 0

Updated Aug 30, 2022
S

repos / generated-data-platform / Spark Metrics
Apache License 2.0

This is a Work In Progress fork of https://github.com/banzaicloud/spark-metrics/ modified to meet WMF conventions.

1

Updated Feb 24, 2022

1 0 0

Updated Feb 24, 2022
E

repos / generated-data-platform / event-driven-poc
Apache License 2.0

1

Updated May 16, 2022

1 0 0

Updated May 16, 2022
M

DCausse / mediawiki-page-state

1

Updated Jun 10, 2022

1 0 0

Updated Jun 10, 2022
G

TChin / Generic Flink Source-to-Sink

Experimental configurable generic Flink pipelines

1

Updated Aug 18, 2022

1 0 0

Updated Aug 18, 2022
M

repos / search-platform / mjolnir
MIT License

1

Updated Jun 06, 2024

1 1 0

Updated Jun 06, 2024
W

repos / data-engineering / patches / WMF SparkSQLCLIDriver

0

Updated Apr 27, 2023

0 0 0

Updated Apr 27, 2023
W

Gmodena / webrequests-deequ

0

Updated Dec 01, 2023

0 0 0

Updated Dec 01, 2023
P

Gmodena / privacy-spark-pipeline
Apache License 2.0

Differentially private analytics with Apache Spark (demo)

0

Updated Nov 29, 2021

0 0 0

Updated Nov 29, 2021
M

Bking / mjolnir
MIT License

0

Updated Sep 14, 2023

0 0 0

Updated Sep 14, 2023
A

Aqu / Analytics Refinery Source
Apache License 2.0

0

Updated Nov 02, 2022

0 0 0

Updated Nov 02, 2022
S

repos / search-platform / Spark MDLP Discretization
Other

0

Updated Feb 13, 2023

0 0 0

Updated Feb 13, 2023
S

repos / search-platform / Spark Infotheoretic Feature Selection
Other

0

Updated Feb 13, 2023

0 0 0

Updated Feb 13, 2023
S

Gmodena / spark-metrics
Apache License 2.0

Spark metrics related custom classes and sinks (e.g. Prometheus)

Archived 0

Updated Feb 09, 2022

Archived 0 0 0

Updated Feb 09, 2022
S

Gmodena / spark-pushgateway

Experiments with PrometheusSink and PushGateway

0

Updated Aug 19, 2022

0 0 0

Updated Aug 19, 2022
S

Gmodena / streaming-imagematching
Apache License 2.0

PoC for consuming revision and android interaction streams

PoC of a str...

0

Updated Sep 13, 2021

0 0 1

Updated Sep 13, 2021
Neil Shah-Quinn (WMF) / conda-analytics

Creates an artifact, a packed conda environment, to be deployed across the data engineering cluster. Currently contains Pyspark3 and JupyterHub.

0

Updated Jul 23, 2024

0 0 0

Updated Jul 23, 2024
F

TChin / flink-test

0

Updated Apr 25, 2022

0 0 0

Updated Apr 25, 2022

Admin message

Admin message

Admin message

Explore projects