TBX: X.X, English apostrophe tokenization rules
The current apostrophe tokenization rules fail to segment correctly English words such as “he’s”, “it’s”, etc.
(from redmine: issue id 878, created on 2014/06/19 by Matthieu Decorde)
The current apostrophe tokenization rules fail to segment correctly English words such as “he’s”, “it’s”, etc.
(from redmine: issue id 878, created on 2014/06/19 by Matthieu Decorde)
changed milestone to %TXM 0.7.7
the english tokenisation rules have been set in the TokenizerClasses class
(from redmine: written on 2021/03/10 by Matthieu Decorde)
closed
assigned to @mdecorde