Voice recognition using Raspberry Pi 3

In this tutorial we perform Voice recognition, an extremely complex visual task, almost instantaneously. Our own recognition ability is far more robust than any computer’s software can hope to be. We are able to recognize the voice of several thousand individuals whom we have met during our lifetime. Here the focus is towards developing a sort of unsupervised pattern recognition scheme that does not depend on excessive geometry and computations like deformable templates. A voice recognition software is installed on the Raspberry Pi 3 which works with the help of internet. Raspberry Pi 3 has inbuilt WiFi and it fits the application very well, as internet access comes with ease from an access point5 or even from a hotspot. Click here to get the details for configuring WiFi in RPi 3.

Google Voice and speech APIs are used by the application software to perform voice recognition and thats why internet connectivity is a must have for this project. What happens here is speech to text conversion in simple words. The result will be a text corresponding to the input speech on USB Microphone.

Required Items:

Raspberry Pi 3
USB Microphone
Internet Access (using Ethernet cable or WiFi)

Raspberry Pi does not have a sound card and therefore it wont support microphones on audio jack, so we would recommend a USB microphone or a USB webcam with built-in mic.

Power up your Pi with all the peripherals connected (ensure the Pi has internet access). Firstly, we would need to check the microphone connected to it. Open up the LX terminal and enter the command, alsamixer. Following the command, a new window will open up:Access the UI and there in you can adjust the volume (use up/down arrow keys). Press F6 to select the microphone from the list and control the recording volume by pressing the up/down arrow keys.

So, before starting, lets test our microphone. To test the mic, we will record a sound and then play it. Enter the following command in LX Terminal:

arecord -D plughw:1,0 test.wav

Just do your test record and it will be saved as test.wav

To play the this file, enter aplay test.wav in the terminal.

Installing the Voice Recognition Software

Download the software from this link. Extract all the contents to the home folder and open up the LX Terminal.

Type in cd PiAUISuite/Install to change the directory.

Then type sudo ./InstallAUISuite.sh.

During installation many confirmation questions will likely pop up. Read the questions clearly and press y/n to select the preferred option.

Once the installation is complete, reboot the Pi. Open up the LX Terminal and type sudo voicecommand to check whether the application is working or not. After entering the command say ‘hello’ to your microphone. If its working fine, text ‘hello’ will be printed on the terminal:

Editing the configuration file

The voice command input is converted to text using google voice API. By configuring the application files accordingly, we can enable the Pi to act upon our voice command. In this example, we control an LED connected to Pi. It can be switched on or off as per the users voice command. For which we would need two programs, one to switch on and the other to switch off. Click here to get the details on how to blink an LED using python script. Save the programs as test12.py and test13.py.

When we say ‘LED ON’, the test12.py program should run and the LED glows. And when we say ‘LED OFF’, the test13.py should run to switch off the LED. To do this edit the configuration file as follows. Type in:

sudo voicecommand -e , a new screen will open as shown in figure:

Press the enter key to continue and the new configuration editor file will open up. Enter the commands in the editor as shown in the figure below:

Save the file by pressing ctrl+x. and then enter y.‘LED ON‘ will be the voice command, when we speech out the same on microphone the software will compare it with the words in the config file. If any word is matched, the corresponding action will processed. In our case, it will go to the pi directory and execute the python code test12.py. To run the software in a continuous mode you could use the command: sudo voicecommand -c. Though we have demonstrated all of this with a simple LED blinking procedure, you might have realized the potential it carries for high end applications. Thats up to you to explore!! If you do let us know of your project, we would be happy to hear from you guys 🙂

28 Responses to "Voice recognition using Raspberry Pi 3"

logeais says:

January 10, 2017 at 12:53 am

bonjour,
je trouve vôtre tuto super

Log in to Reply
Veera says:

February 5, 2017 at 6:07 pm

when i use sudo voicecommand -c , it says found audio, but the action is not been performed.

My configuration:

say hello==cd /home/pi/ && aplay hello.wav

My intension is to play the recorded audio upon receiving the ” say hello ”

The below is log of terminal during activity, please someone help….

pi@raspberrypi:~ $ sudo voicecommand -c
Opening config file…
running in continuous mode
keyword duration is 2 and duration is 3
Found audio
Recording WAVE ‘stdin’ : Unsigned 8 bit, Rate 16000 Hz, Mono
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 15149 0 14 100 15135 4 4793 0:00:03 0:00:03 –:–:– 4795
No translation
Found audio
Recording WAVE ‘stdin’ : Unsigned 8 bit, Rate 16000 Hz, Mono
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 15190 0 14 100 15176 3 3609 0:00:04 0:00:04 –:–:– 3912
No translation
Found audio
Recording WAVE ‘stdin’ : Unsigned 8 bit, Rate 16000 Hz, Mono
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 15294 0 14 100 15280 6 6735 0:00:02 0:00:02 –:–:– 6737
No translation
Found audio
Recording WAVE ‘stdin’ : Unsigned 8 bit, Rate 16000 Hz, Mono
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 15244 0 14 100 15230 2 2441 0:00:06 0:00:06 –:–:– 4088
No translation
Found audio
Recording WAVE ‘stdin’ : Unsigned 8 bit, Rate 16000 Hz, Mono
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 16830 0 14 100 16816 5 7153 0:00:02 0:00:02 –:–:– 7152
No translation
Found audio
Recording WAVE ‘stdin’ : Unsigned 8 bit, Rate 16000 Hz, Mono
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
0 0 0 0 0 0 0 0 –:–:– –:–:– –:–:– 0
[6]+ Stopped sudo voicecommand -c
pi@raspberrypi:~ $ sudo voicecommand -c

Log in to Reply
Veera says:

February 5, 2017 at 6:10 pm

when i use sudo voicecommand -c , it says found audio, but the action is not been performed.

My configuration:

say hello==cd /home/pi/ && aplay hello.wav

My intension is to play the recorded audio upon receiving the ” say hello ”

The below is log of terminal during activity, please someone help….

pi@raspberrypi:~ $ sudo voicecommand -c
Opening config file…
running in continuous mode
keyword duration is 2 and duration is 3
Found audio
Recording WAVE ‘stdin’ : Unsigned 8 bit, Rate 16000 Hz, Mono
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 15149 0 14 100 15135 4 4793 0:00:03 0:00:03 –:–:– 4795
No translation
Found audio
Recording WAVE ‘stdin’ : Unsigned 8 bit, Rate 16000 Hz, Mono
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 15190 0 14 100 15176 3 3609 0:00:04 0:00:04 –:–:– 3912
No translation
Found audio

Log in to Reply
1. Mahesh says:
  
  February 7, 2017 at 6:28 pm
  
  Hai Veera..I hope you have a proper net connection with in the RPi module…If not just provide it and check…
  
  Log in to Reply
  1. Veera says:
    
    February 11, 2017 at 11:45 am
    
    Hi mahesh, i have net connection, i could update packages on pi, what might be the other reasons?
    
    Thanks in advance
    
    Log in to Reply
ndinev says:

February 27, 2017 at 3:56 am

I am getting “No translation found”
This is the same as Veera reported above.

@Mahesh : Could you elaborate more on configuration. Like user names for voice api… etc. It is not clear what additional accounts to what systems are needed in order this system

Thanks

Log in to Reply
1. Mahesh says:
  
  March 7, 2017 at 11:00 am
  
  Hai ndinev..
  there could be two options for that result..
  Due to the poor response from your microphone or else
  there might not a proper internet connection.
  
  Log in to Reply
shubham says:

February 28, 2017 at 5:13 pm

Could you please tell which model of Raspberry Pi 3 do we need for the project?

Log in to Reply
1. Mahesh says:
  
  March 7, 2017 at 11:15 am
  
  Hai shubham..
  we can use Raspberry PI 2 or 3
  
  Log in to Reply
mohd hanif yanto says:

March 1, 2017 at 9:54 pm

hello.. can i used the offline voice code? should i used sphinx or another software for recognize the the voice command?

Log in to Reply
1. Mahesh says:
  
  March 7, 2017 at 11:14 am
  
  hai mohd hanif yanto…
  Yes you can…but i think it need more time to process…
  
  Log in to Reply
  1. mohd hanif yanto says:
    
    March 10, 2017 at 2:25 am
    
    i been try to do the arecord -D plughw:1,0 test.wav but i get overrun!!! (at least 483.484 ms long).. what should i do? i used chroma razer 7.1 chroma for the capture hardware devices.. but it says device0:usb audio[usb audio] subdevices:1/1 subdevices #0: subdevice #0
    
    Log in to Reply
kartik says:

March 6, 2017 at 8:02 pm

i am having a similar problem said by @veera ! Anybody pls help ! !

Log in to Reply
1. Mahesh says:
  
  March 7, 2017 at 11:00 am
  
  Hai kartik..
  there could be two options for that result..
  Due to the poor response from your microphone or else
  there might not a proper internet connection.
  
  Log in to Reply
X3r0 says:

March 30, 2017 at 11:25 am

I just happened across this forum and decided to share my knowledge that I’ve gathered from researching this frustrating issue.

Apparently, from what I’ve read at least, Google has changed its accessibility to its voice recognition API, thus rendering this software unable to process requests. I’ve seen some workarounds involving sphinx or other offline STTs; however, nothing really definitive for using Google’s more powerful engine. It seems that this is a relatively new issue, and hopefully the author of the software can figure an alternative out. I’d make suggestions, but I’m relatively new to the whole voice recognition concept so I’m more in the learning stages instead of instruction stages.

Just thought I’d bring a little closure to the thread, albeit no solution :(.

Log in to Reply
puja says:

April 21, 2017 at 10:46 am

can we use mobile headset micro phone instead of usb micro phone????

Log in to Reply
1. Mahesh says:
  
  April 21, 2017 at 11:25 am
  
  Hai puja
  No…There is no inbuilt MIC in Raspberry Pi.
  
  Log in to Reply
2. Sandeep says:
  
  May 28, 2017 at 8:40 am
  
  Yes, Im using a Logitech headset, but issues with no translation
  
  Log in to Reply
  1. Mahesh says:
    
    May 30, 2017 at 10:14 am
    
    Hai sandeep
    Please provide proper internet connection, For recording purpose, only USB microphone device must be used in raspberry Pi
    
    Log in to Reply
AIMAI says:

April 26, 2017 at 9:32 pm

I am getting “No translation ”!!!!!!
what solution ?

Log in to Reply
AIMAI says:

April 27, 2017 at 12:53 pm

Found audio
Recording WAVE ‘stdin’ : Unsigned 8 bit, Rate 16000 Hz, Mono
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 16323 0 14 100 16309 5 6593 0:00:02 0:00:02 –:–:– 6592
No translation

(what solution)?

Log in to Reply
1. Mahesh says:
  
  April 28, 2017 at 10:54 am
  
  hai AIMAI
  I think there is a problem in your internet connection
  
  Log in to Reply
khaled says:

August 26, 2017 at 2:50 am

hello Mahesh,
does this still working on latest software of NOOBS for RPI 3 ?

Log in to Reply
1. Mahesh says:
  
  September 9, 2017 at 11:42 am
  
  Hai khaled
  It is better to use Raspbian Jessie rather than noobs…
  
  Log in to Reply
Awais Ahmed says:

December 21, 2017 at 5:02 am

Hello Mahesh!
when I write sudo voicecommand , it give the following error
error while loading shared liberaries: libboost_regex.so.1.49.0: cannot open shared object file: No such file or directory.

what is the solution of this problem ?
Many thanks in Advance 🙂

Log in to Reply
1. Mahesh says:
  
  January 2, 2018 at 11:10 am
  
  Hai Awais
  please try this command.
  sudo apt-get install libboost libboost-regex libboost-dev libboost-regex-dev
  
  Use latest raspbian jessie and make the installation as described in this page…
  
  Log in to Reply
Luke says:

December 25, 2017 at 7:30 pm

Hello,
I have a question about the installation process:Whenever I type cd PiAUISuite/Install in, I get /PiAUISuite/Install S as output. After that I can’t do the sudo ./InstallAUISuite.sh because it says:command not found.
Can someone help me with my problem?
Thanks, Luke.

Log in to Reply
1. Mahesh says:
  
  January 2, 2018 at 11:07 am
  
  Hai Luke
  please check the installation directory
  
  Log in to Reply

Leave a Reply Cancel reply