ASR language identifier
A language identifier is part of the grammar header that specifies the language to be used when performing decodes.
The format of the language identifier follows the convention set out by RFC 3066. In a nutshell, the identifier is a language and country pair, like "en-US" for United States English. The new DNN engine is accent agnostic so there is no longer a need to specify en-US or en-GB for English - you can simply specify en. Should you want dialect specific spelling then you would need to specify either en-US or en-GB for regional spelling.
The LumenVox Speech Engine currently supports the following languages:
Language Code | Description |
en-US | American English acoustic model and dictionary |
en-AU | Australian English acoustic model and dictionary |
en-GB | U.K. English acoustic model and dictionary |
en-IN | Indian English acoustic model and dictionary |
es-MX | Mexican Spanish acoustic model and dictionary |
es-CO | South American Spanish acoustic model and dictionary |
fr-CA | French Canadian acoustic model and dictionary |
pt-BR | Brazilian Portuguese acoustic model and dictionary |
de-DE | German acoustic model and dictionary |
it-IT | Italian acoustic model and dictionary |
To specify the interaction mode in a grammar, use the following syntax in your grammar:
ABNF
language en-US;
GrXML
<grammar language="en-US" ... >