Skip to content

Replace wikipedia2vec with outlink embeddings

AKhatun requested to merge new-baseline into language-agnostic-main

This MR

  • replaces wikipedia2vec with outlink embeddings
  • which also means replacing 2 python envs with 1 and making some changes in requirements
  • removes all files related to mysql/sqlite and backend api (because they refer to wikipedia2vec files and we will eventually change the way we store and access files)

Current baseline for language dependent models (w/ outlink embedding):

wiki precision recall
arwiki 0.808 0.340
bnwiki 0.721 0.349
bowiki 0.982 0.613
cswiki 0.785 0.437
dewiki 0.816 0.453
dzwiki 1.0 0.1
ganwiki 0.843 0.300
piwiki nan 0.0
ptwiki 0.835 0.433
simplewiki 0.783 0.398
viwiki 0.875 0.572
Edited by AKhatun

Merge request reports