MRCP
Watson SR Plugin
Administrator Guide
Distribution: Red Hat / Cent OS
Created: July 4, 2018
Last updated: March 16, 2021
Author: Arsen Chaloyan
Table of Contents
2.3 Watson SR Plugin Installation
3.2 Package Installation Order
5 Obtaining Service Credentials
5.1 Watson Speech to Text API Subscription
5.2 Installation of Credentials
6 Configuring Server and Plugin
6.1 Plugin Factory Configuration
6.3 Watson SR Plugin Configuration
This guide describes how to obtain and install binary packages for the IBM Watson Speech Recognition (SR) plugin to plugin to the UniMRCP server on Red Hat-based Linux distributions. The document is intended for system administrators and developers.
Instructions provided in this guide are applicable to the following versions.
UniMRCP 1.5.0 and above UniMRCP Watson SR Plugin 1.0.0 and above |
UniMRCP RPMs are currently available for x86_64 (64-bit) architecture only.
Operating System |
Released |
End of Support |
Red Hat / Cent OS 7 |
July 2018 |
TBA |
Red Hat / Cent OS 8 |
January 2021 |
TBA |
Note: packages for other distributions can be made available upon request. For more information, contact services@unimrcp.org.
UniMRCP binary packages are available to authenticated users only. In order to register a free account with UniMRCP, please visit the following page.
Note: a new account needs to be verified and activated prior further proceeding.
Using the Yellowdog Updater, Modifier (yum), a command-line package management utility for Red Hat-based distributions, is recommended for installation of UniMRCP binary packages.
The content of a typical yum configuration file, to be placed in /etc/yum.repos.d/unimrcp.repo, is provided below.
[unimrcp] name=UniMRCP Packages for Red Hat / Cent OS-$releasever $basearch baseurl=https://username:password@unimrcp.org/repo/yum/main/rhel$releasever/$basearch/ enabled=1 sslverify=1 gpgcheck=1 gpgkey=https://unimrcp.org/keys/unimrcp-gpg-key.public
[unimrcp-noarch] name=UniMRCP Packages for Red Hat / Cent OS-$releasever noarch baseurl=https://username:password@unimrcp.org/repo/yum/main/rhel$releasever/noarch/ enabled=1 sslverify=1 gpgcheck=1 gpgkey=https://unimrcp.org/keys/unimrcp-gpg-key.public |
The username and password fields included in the HTTPS URI must be replaced with the corresponding account credentials.
In order to verify that yum can properly connect and access the UniMRCP repository, the following command can be used.
yum repolist unimrcp yum repolist unimrcp-noarch |
where unimrcp and unimrcp-noarch are names of the sections set in the yum configuration file above.
In order to retrieve a list of packages the UniMRCP repository provides, the following command can be used.
yum --disablerepo="*" --enablerepo="unimrcp" list available yum --disablerepo="*" --enablerepo="unimrcp-noarch" list available |
In order to install the Watson SR plugin, including all the dependencies, use the following command.
yum install unimrcp-watson-sr |
In order to install the additional data files for the sample client application umc, the following command can be used.
yum install umc-addons |
Note: this package is optional and provides additional data which can be used for validation of basic setup.
UniMRCP RPM packages can be installed manually using the rpm utility. Note, however, that the system administrator should take care of package dependencies and install all the packages in appropriate order.
The RPM packages have the following naming convention:
$packagename-$universion-$packageversion.el$rhelversion.$arch.rpm |
where
· packagename is the name of a package
· universion is the UniMRCP version
· packageversion is the RPM release version
· rhelversion is the Red Hat version
· arch is the architecture (x86_64, i686, …)
The following is a list of UniMRCP RPM packages required for the installation of the Watson SR plugin.
Package Name |
Description |
unimrcp-watson-sr |
Watson SR plugin to the server. |
unilibevent |
UniMRCP edition of the libevent library. |
umc-addons |
Sample en-US data files used with umc. [Optional] |
unilicnodegen |
Node information retrieval tool, required for license deployment. |
unimrcp-server |
Shared library and application of the server. |
unimrcp-client |
Shared libraries and sample applications of the client. [Optional] |
unimrcp-demo-plugins |
Set of demo plugins to the server. [Optional] |
unimrcp-common |
Data common for the client and the server. |
uniapr |
UniMRCP edition of the Apache Portable Runtime (APR) library. |
uniapr-util |
UniMRCP edition of the Apache Portable Runtime Utility (APR-Util) library. |
unisofia-sip |
UniMRCP edition of the Sofia SIP library. |
Note that all the RPM packages provided by UniMRCP are signed by a GNU Privacy Guard (GPG) key. Before starting the installation, you may need to import the public key in order to allow the rpm utility to verify the packages.
rpm --import https://unimrcp.org/keys/unimrcp-gpg-key.public |
Packages for the APR, APR-Util and Sofia-SIP libraries must be installed first.
rpm -ivh uniapr-$aprversion-$packageversion.el$rhelversion.$arch.rpm rpm -ivh uniapr-util-$apuversion-$packageversion.el$rhelversion.$arch.rpm rpm -ivh unisofia-sip-$sofiaversion-$packageversion.el$rhelversion.$arch.rpm |
Then, a package containing common data for the client and the server, and a package for the server should follow.
rpm -ivh unimrcp-common-$universion-$packageversion.el$rhelversion.$arch.rpm rpm -ivh unimrcp-server-$universion-$packageversion.el$rhelversion.$arch.rpm |
Next, a package containing the utility tool unilicnodegen, required for license deployment.
rpm -ivh unilicnodegen-$toolversion-$packageversion.el$rhelversion.$arch.rpm |
Next, a package containing the libevent library.
rpm -ivh unilibevent-$libeventversion-$packageversion.el$rhelversion.$arch.rpm |
Finally, a package containing the Watson SR plugin should follow.
rpm -ivh unimrcp-watson-sr-$universion-$packageversion.el$rhelversion.noarch.rpm |
The Watson SR plugin to the UniMRCP server is a commercial product, which requires a license file to be installed.
The following license types are available:
· Trial
· Production
· Test and Development
The license files are bound to a node the product is installed on. In order to obtain a license, the corresponding node information needs to be retrieved and submitted for generation of a license file.
Use the installed tool unilicnodegen to retrieve the node information.
/opt/unimrcp/bin/unilicnodegen |
As a result, a text file uninode.info will be saved in the current directory. Submit the file uninode.info for license generation to services@unimrcp.org by mentioning the product name in the subject.
The license file needs to be placed into the directory /opt/unimrcp/data.
cp umswatsonsr_*.lic /opt/unimrcp/data |
Instructions provided in this section are for the new IAM token based authentication, supported since WSR 1.4.0. The legacy username and password based authentication is no longer supported. |
In order to utilize the IBM Watson Speech to Text API, corresponding service credentials need to be retrieved from the IBM Cloud portal and further installed to the UniMRCP server.
Subscribe to the Watson Speech to Text API and obtain the credentials.
1. Navigate to the Speech to Text API portal. https://console.bluemix.net/catalog/services/speech-to-text 2. Log in and create a service. 3. Select the Service credentials tab and create a new credential. 4. View and copy the credentials, which should be available in the following format { "apikey": "3df... ...Y7Pc9", "iam_apikey_description": "Auto generated apikey during resource-key operation for...", "iam_apikey_name": "auto-generated-apikey-31b336bc-2d6a-41c3-a8b2-e05ec6db19b4", "iam_role_crn": "crn:v1:bluemix:public:iam::::serviceRole:Manager", "iam_serviceid_crn": "crn:v1:bluemix:public:iam-identity::a/57d48380...::serviceid:...", "url": "https://gateway-lon.watsonplatform.net/speech-to-text/api" } |
Paste the credentials into a text file watsonsr.credentials, which needs to be placed to the directory /opt/unimrcp/data.
nano /opt/unimrcp/data/watsonsr.credentials |
In order to load the Watson SR plugin into the UniMRCP server, open the file unimrcpserver.xml, located in the directory /opt/unimrcp/conf, and add the following entry under the XML element <plugin-factory>. Disable other recognition plugins, if available. The remaining demo plugins might also be disabled, if not installed.
<!-- Factory of plugins (MRCP engines) --> <plugin-factory> <engine id="Demo-Synth-1" name="demosynth" enable="true"/> <engine id="Demo-Recog-1" name="demorecog" enable="false"/> <engine id="Demo-Verifier-1" name="demoverifier" enable="true"/> <engine id="Recorder-1" name="mrcprecorder" enable="true"/> <engine id="Watson-SR-1" name="umswatsonsr" enable="true"/> </plugin-factory> |
In order to enable log output from the plugin and set filtering rules, open the configuration file logger.xml, located in the directory /opt/unimrcp/conf, and add the following entry under the element <sources>.
<source name=" WATSONSR-PLUGIN" priority="INFO" masking="NONE"/> |
The configuration file of the plugin is located in /opt/unimrcp/conf/umswatsonsr.xml. Default settings should be sufficient for general use.
Refer to the Usage Guide for more information.
Validate your setup by using the sample UniMRCP client and server applications on the same host. The default configuration and data files should be sufficient for a basic test.
Launch the UniMRCP server application.
cd /opt/unimrcp/bin ./unimrcpserver |
In the server log output, check whether the plugin is normally loaded.
[INFO] Load Plugin [Watson-SR-1] [/opt/unimrcp/plugin/umswatsonsr.so] |
Next, check for the license information.
[NOTICE] UniMRCP WSR License
-product name: umswatsonsr -product version: 1.0.0 -license owner: - -license type: trial -issue date: 2018-06-28 -exp date: 2018-07-28 -channel count: 2 -feature set: 0 |
Note: the optional package umc-addons must be installed for this test to work.
Launch the sample UniMRCP client application umc.
cd /opt/unimrcp/bin ./umc |
Run a typical speech recognition scenario by issuing the command run gsr1 from the console of the umc client application.
run gsr1 |
This command sends a RECOGNIZE request to the server and then starts streaming a sample audio input file callsteve.pcm to recognize.
Check for the NLSML results to be returned as expected.
<?xml version="1.0"?> <result> <interpretation grammar="command" confidence="0.92"> <instance>call Steve</instance> <input mode="speech">call Steve</input> </interpretation> </result> |
Visually inspect the log output for any possible warnings or errors.
Note that utterances are stored in the var directory, if the corresponding parameter is enabled in the configuration file umswatsonsr.xml and/or requested by the client.