谢谢ArthurW的分享。有关MSTTR和MATTR,可见https://cran.r-project.org/web/packages/koRpus/koRpus.pdf
的66页和67页。
MSTTR (Mean Segmental Type-Token Ratio):
(1) segments the text into fixed-length segments (e.g., 100 words per segment).
(2) For each segment, the TTR is calculated as the ratio of unique words...