Replace wikipedia2vec with outlink embeddings
This MR
- replaces wikipedia2vec with outlink embeddings
- which also means replacing 2 python envs with 1 and making some changes in requirements
- removes all files related to mysql/sqlite and backend api (because they refer to wikipedia2vec files and we will eventually change the way we store and access files)
Current baseline for language dependent models (w/ outlink embedding):
wiki | precision | recall |
---|---|---|
arwiki | 0.808 | 0.340 |
bnwiki | 0.721 | 0.349 |
bowiki | 0.982 | 0.613 |
cswiki | 0.785 | 0.437 |
dewiki | 0.816 | 0.453 |
dzwiki | 1.0 | 0.1 |
ganwiki | 0.843 | 0.300 |
piwiki | nan | 0.0 |
ptwiki | 0.835 | 0.433 |
simplewiki | 0.783 | 0.398 |
viwiki | 0.875 | 0.572 |