Generated files are not counted in language statistics and are not displayed in diffs on github.As the following figure indicates, Kurdish is not the only less-resourced language, but most languages in the world are actually still less-resourced. Spoken language identification github Semi-supervised acoustic model training by discriminative data selection from multiple ASR systems' hypotheses.