Skip to content

Article quality for historical revisions

Fabian Kaelin requested to merge cleanup into main

The changes in this MR were done to facilitate the computation of article scores for all historical revisions

  • add the revision timestamp to output schemas (features and scores)
  • optimizations to the SQL queries to avoid joins, avoid shuffling wikitext, filter out data before joins
  • restructure the main application arguments to facilitate configuration

The production output tables for the article quality dag are in the article_quality table.

Merge request reports