How to automatically count word tokens


Is there a way to automatically calculate the word tokens in an ELAN transcription?

Thank you,


There are several ways to calculate that number:

  • File->Export (Multiple Files) As->List of Words... this export function has an option to include a word count
  • use Search->Find and Replace... or a multiple file variant e.g. Search->Structured Search Multiple EAF... with a regular expression something like this \b[^\s]+\b
  • use Tier->Tokenize Tier... to create tiers with one word or token per annotation and use the View->Annotation Statistics... window to see the number. This option would be overdone if applied only for the purpose of calculating that number.