Split sentences on newline characters?
Currently, we don't seem to split when encountering the newline symbol. Maybe it is ok to assume that users split in paragraphs before sentence-tokenization. But I was surprised when getting really long sentences for disambiguation pages (where individual bullet points are only separated by “\n”). Do we want to consider adding this as a punctuation symbol?