Powered by Universal Speech Solutions LLC

Menu

Google Speech Recognition (GSR) Plugin 1.9.0 to the UniMRCP Server (UMS) has been released.

The plugin is based on the following components:

  • UniMRCP Server 1.5.0
  • Google Speech API v1
  • gRPC 1.7.3
  • Protobuf 3.4.0

The binaries are currently available for the following Linux distributions:

  • Red Hat / CentOS 7
  • Ubuntu 16.04 LTS

Speech contexts now can optionally be used not only as a hint but also a basic grammar. If the new attribute 'speech-complete' is set to true, then input will complete as soon as an interim result matches one of the phrases specified in the speech context. Each phrase may also be assigned an arbitrary tag to be set as an instance in the returned NLSML results.

This release also allows to specify the format of the confidence score returned in the NLSML results via a new configuration parameter 'confidence-format'. By default, the configuration parameter is set to 'auto', which means the format is implicitly determined based on the version of the protocol being used. In order to use the floating point format in the range of 0..1 consistently, the parameter must be set to 'mrcpv2'; and for the integer format in the range of 0..100 - 'mrcpv1'.

There is also a new configuration parameter 'alternatives-below-threshold', which allows to control whether or not to include alternatives with a confidence score below the specified threshold in the returned NLSML results.

Everyone is encouraged to upgrade.

The detailed list of changes introduced in this release follows.
New Features
  • Speech contexts now can optionally be used not only as a hint but also a basic grammar.
  • Allow to specify the format of the confidence score returned in the NLSML results based on a new configuration parameter.
  • Allow to control whether or not to include alternatives with a confidence score below the specified threshold in the returned NLSML results.
  • Keep track of the maximum number of channels used concurrently. This number is logged by default within the statement 'GSR Usage' and can also be written in the status file, if enabled.
Fixed Problems
  • Fixed an issue in the SDI detector, encountered when 'speech-incomplete-timeout' is significantly longer than 'speech-complete-timeout'.
  • Fixed processing of the header field 'Recognition-Timeout' to set the SDI 'speech-input-timeout' accordingly. Only the global configuration parameter was in effect before.
Configuration Parameters
  • Added a new attribute 'confidence-format' to the element 'streaming-recognition', which defaults to 'auto' and also accepts 'mrcpv2' and 'mrcpv1'.
  • Added a new boolean attribute 'alternatives-below-threshold' to the element 'streaming-recognition', which defaults to 'false'.
Miscellaneous
  • Updated the Usage Guide to reflect the changes introduced in this release.

Visit the GSR plugin page for more information.

http://www.unimrcp.org/gsr

Thank you for using UniMRCP.

--
Arsen Chaloyan
Author of UniMRCP
http://www.unimrcp.org

Latest News


This release introduces support for function calling and response requesting. The release also provides a fe minor fixes and improvements. ...Read more
IVR platforms can now utilize OpenAI Realtime Transcription WebSocket API via UniMRCP Server. ...Read more
This release fixes the format of KeepAlive messages. ...Read more
View all posts
Google Cloud

Products Provided By

Universal Speech Solutions LLC

Microsoft Azure

Products Provided By

Universal Speech Solutions LLC

IBM Watson

Products Provided By

Universal Speech Solutions LLC

Amazon Web Services

Products Provided By

Universal Speech Solutions LLC

Yandex Cloud

Products Provided By

Universal Speech Solutions LLC

Misc

Products Provided By

Universal Speech Solutions LLC

previous arrow
next arrow