ASR language identifier

A language identifier is part of the grammar header that specifies the language to be used when performing decodes.

The format of the language identifier follows the convention set out by RFC 3066. In a nutshell, the identifier is a language and country pair, like "en-US" for United States English. The new DNN engine is accent agnostic so there is no longer a need to specify en-US or en-GB for English - you can simply specify en. Should you want dialect specific spelling then you would need to specify either en-US or en-GB for regional spelling.

The LumenVox Speech Engine currently supports the following languages:

Language Code	Description
en-US	American English acoustic model and dictionary
en-AU	Australian English acoustic model and dictionary
en-GB	U.K. English acoustic model and dictionary
en-IN	Indian English acoustic model and dictionary
es-MX	Mexican Spanish acoustic model and dictionary
es-CO	South American Spanish acoustic model and dictionary
fr-CA	French Canadian acoustic model and dictionary
pt-BR	Brazilian Portuguese acoustic model and dictionary
de-DE	German acoustic model and dictionary
it-IT	Italian acoustic model and dictionary

To specify the interaction mode in a grammar, use the following syntax in your grammar:

ABNF

language en-US;

GrXML

Was this article helpful?