I've finally given up trying to make out the differentiation between Z-scores calculated by SARA and the method proposed in Yang's book, 'An Introduction to Corpus Linguistics'. I took 动态语法's suggestion "最好是找到不同语料库的相匹配的原始数据,然后用同一个统计软件计算", and left this issue unsolved.
However, by making a few...