Skip to content
GitLab
Projects Groups Topics Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in
  • D Differential Privacy
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributor statistics
    • Graph
    • Compare revisions
  • Custom issue tracker
    • Custom issue tracker
  • Merge requests 0
    • Merge requests 0
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Artifacts
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Packages and registries
    • Packages and registries
    • Package Registry
    • Terraform modules
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Activity
  • Graph
  • Jobs
  • Commits
Collapse sidebar
  • repos
  • security
  • Differential Privacy
  • Merge requests
  • !1

Refactor code to python package and add CI

  • Review changes

  • Download
  • Patches
  • Plain diff
Merged Gmodena requested to merge add-ci-config into main Jul 11, 2022
  • Overview 1
  • Commits 19
  • Pipelines 7
  • Changes 15

This MR adds a python package (differential_privacy) of pyspark DP jobs.

The MR adds a Gitlab CI pipeline for the repo (.gitlab-ci.yml). CI allows to

  • Automatically run unit tests on push
  • Automatically run linting (flake 8) on push
  • Manually run a build job that produces a conda-dist archive of dependencies, compatible with WMF airflow deployments.

Testing

This MR has been tested by running existing tmlt pipelines notebook using the conda environment published at https://gitlab.wikimedia.org/repos/security/differential-privacy/-/packages/158

TODO

The following will be tackled in follow up MRs.

  • [] try to build python-flint from source
  • [] try to reduce conda-dist size by remove pyspark deps (assuming avail on stat/airflow nodes)
  • [] simplify package management either by using pyproject and/or poetry. This might break compat with our internal tooling, and needs testing.
Edited Jul 12, 2022 by Gmodena
Assignee
Assign to
Reviewers
Request review from
Time tracking
Source branch: add-ci-config