Bangla abbreviation list could use a review
In particular, while it's mostly Latin characters, the word হয়।
is included, which seems to be a very common legitimate word to end a sentence in and is triggering what I think are a lot of false positives. Example: https://bn.wikipedia.org/wiki/%E0%A6%B6%E0%A7%87%E0%A6%96_%E0%A6%AE%E0%A7%81%E0%A6%9C%E0%A6%BF%E0%A6%AC%E0%A7%81%E0%A6%B0_%E0%A6%B0%E0%A6%B9%E0%A6%AE%E0%A6%BE%E0%A6%A8
I'm not sure if this is as simple as just removing this word from the Bangla list (because it's a rare edge case) or points to a need to more aggressively filter these abbreviation lists or turn them off by default except in languages where they've been vetted.