ASR language identifier

A language identifier is part of the grammar header that specifies the language to be used when performing decodes.

The format of the language identifier follows the convention set out by RFC 3066. In a nutshell, the identifier is a language and country pair, like "en-US" for United States English. The new DNN engine is accent agnostic so there is no longer a need to specify en-US or en-GB for English - you can simply specify en. Should you want dialect specific spelling then you would need to specify either en-US or en-GB for regional spelling.

The LumenVox Speech Engine currently supports the following languages:

  

Language Code

  
  

Description

  
  

en-US

  
  

American English acoustic model and dictionary

  
  

en-AU

  
  

Australian English acoustic model and dictionary

  
  

en-GB

  
  

U.K. English acoustic model and dictionary

  
  

en-IN

  
  

Indian English acoustic model and dictionary

  
  

es-MX

  
  

Mexican Spanish acoustic model and dictionary

  
  

es-CO

  
  

South American Spanish acoustic model and dictionary

  
  

fr-CA

  
  

French Canadian acoustic model and dictionary

  
  

pt-BR

  
  

Brazilian Portuguese acoustic model and dictionary

  
  

de-DE

  
  

German acoustic model and dictionary

  
  

it-IT

  
  

Italian acoustic model and dictionary

  

To specify the interaction mode in a grammar, use the following syntax in your grammar:

ABNF

 
language en-US;
 

GrXML

 
<grammar language="en-US" ... >
 


Was this article helpful?
Copyright (C) 2001-2024, Ai Software, LLC d/b/a LumenVox