Skip to content

Big section titles denylist

Marco Fossati requested to merge dev into main

This MR introduces a one-off script that populates a denylist of section titles to be excluded from the pipeline at section extraction time.

The denylist lives as a project resource in a JSON file, which now contains the outcome of https://phabricator.wikimedia.org/T323504.

Note that all manually curated denylists are propagated to all Wikipedias where section alignment is available.

Edited by Marco Fossati

Merge request reports