The normalization was obtained through this formula: NF = (AF/CS), that is, the normalized frequency (NF) is equal to the absolute frequency (AF) divided by the corpus size
and composition: Evidence from the inflectional morphology of nouns in Old English and Old Frisian.
1998) The effect of corpus size
in predicting reaction time in a basic word recognition task: Moving on from Kucera and Francis.
This result shows that a larger the corpus size
is the better acoustic model.
is given by the number of tokens in every corpus, that is, by the total of running words.
Keywords: lexical unit, lexical unit identification, token/type ratio, dice score, corpus size
, average minimum law
is obviously a matter of considerable discussion and is not the point of this particular paper but the subject of further research.