set up two envs and use two versions of wikipedia2vec
zhwiki
and fywiki
Unicode errors were resolved by using wikipedia2vec==2.0.0, but it gave rise to IndexError for several other languages that had run successfully before. To make the script work for all languages the following changes were made:
- Revert back to having 2 envs: conda-analytics with python3.10 and venv with python3.7
- conda has w2v==2.0.0 installed and will be used to run w2v in
zhwiki
andfywiki
- venv has w2v==1.0.5 installed (as before) and is used to run w2v for the rest of the models
All models now run successfully.