-
Xcollazo authored
* Rename HQL files to better reflect content. * Declare User-Agent. Try not using proxy. * Fix parameter name. * Fix text name. * Address review comments from @otto. * Delete old mismatch data only from current wiki_db run. * Fix runtime bugs on proxy access. Improve the results table DDL. * Attempt to fix module reference issue. Part two. * Attempt to fix module reference issue. * Add console script for emit_mismatch_rows.py. * Add page_id to results table. * Incorporate @milimetric's mismatch categorization improvements. As per https://phabricator.wikimedia.org/T368176#9973938 * PySpark job to emit mismatch rows.
91a34365Xcollazo authored* Rename HQL files to better reflect content. * Declare User-Agent. Try not using proxy. * Fix parameter name. * Fix text name. * Address review comments from @otto. * Delete old mismatch data only from current wiki_db run. * Fix runtime bugs on proxy access. Improve the results table DDL. * Attempt to fix module reference issue. Part two. * Attempt to fix module reference issue. * Add console script for emit_mismatch_rows.py. * Add page_id to results table. * Incorporate @milimetric's mismatch categorization improvements. As per https://phabricator.wikimedia.org/T368176#9973938 * PySpark job to emit mismatch rows.
Loading