Thank you very much Dr.Xiao! But what can we do with such kind of empirical formula of chemical substances like CO2,CH3CHO,NaCl? They keep on appearing in my corpus and WS5 seems to have failed to recognized them...
And my little raw corpus is just of 1.03million words, is it large enough for word frequency counting? I've tried to do some collocation research, but really don't know what to start with. Do you have any suggestions? Thx!