How to register an account on GitLab. Due to spam, new accounts are locked until approved by an admin or the approver bot. Your GitLab account gets automatically approved within one hour if you are a member of Trusted Contributors in Gerrit, or a member of the Trusted-Contributors group in Phabricator and linked your Developer account to your Phabricator account. If none of these apply, you can file an unlock request to expedite access.

Take the 2024 Developer Satisfaction Survey ^{(privacy statement)} to help identify areas for improvement and measure satisfaction within the Wikimedia developer community.

Support: mw:GitLab, how to host a project on GitLab, #wikimedia-gitlab on libera.chat, #GitLab on Phabricator.

Sentence Tokenization: language specific sentence joiner

In src/wikinlptools/benchmarking/bmark_sentence.py , instead of concatenating strings with a whitespace for all languages, we'll eventually want to replace this " " space joiner with a language-specific joiner -- e.g., something like delimiters.get(language, " ") where delimiters is a list we build of languages that don't use whitespace after full stops.

Edited Oct 26, 2023 by AKhatun