
Wukong-Robot is a simple, flexible and elegant multi-language voice conversation robot/smart speaker project, which aims to help Chinese Makers and Hackers quickly build personalized smart speakers. The project features support for brain-computer wake-up and integrates a variety of speech recognition and synthesis technologies. This article will guide you on how to build Wukong-Robot in your development environment and realize the functions of a smart voice speaker.
Environmental Requirements
Before you begin, make sure your system meets the following requirements:
- Python version: Python 3.7 and above is required.
- Operating system: Applicable to Linux system.
Step 1: Install Wukong-Robot
1.1 Clone the Wukong-Robot repository
Execute the following command in the terminal to clone the Wukong-Robot GitHub repository:
git clone https://github.com/wzpan/wukong-robot.git
cd wukong-robot
1.2 Install the required audio library
Install the required audio library according to your system environment:
Linux :
sudo apt-get update -y
sudo apt-get install portaudio19-dev python-pyaudio python3-pyaudio sox pulseaudio libsox-fmt-all ffmpeg
pip3 install pyaudio
1.3 Install dependent Python libraries
Install the dependent libraries required for the project:
pip3 install -r requirements.txt
1.4 Compile offline wake-up engine
Wukong-Robot supports multiple offline wake-up engines, such as Snowboy and Porcupine. In order to use the offline wake-up engine, you need to compile the _snowboydetect.so file.
First, make sure your system has swig installed:
Linux :
cd $HOME
wget https://wzpan-1253537070.cos.ap-guangzhou.myqcloud.com/misc/swig-3.0.10.tar.gz
tar xvf swig-3.0.10.tar.gz
cd swig-3.0.10
sudo apt-get install -y libpcre3 libpcre3-dev
./configure --prefix=/usr --without-clisp --without-maximum-compile-warnings
make
sudo make install
Next, build Snowboy:
cd $HOME
wget https://wzpan-1253537070.cos.ap-guangzhou.myqcloud.com/wukong/snowboy.tar.bz2
tar -xvjf snowboy.tar.bz2
cd snowboy/swig/Python3
make
cp _snowboydetect.so ../..
Step 2: Install third-party skill plugins
Wukong-Robot provides a rich plugin library that can expand the robot’s functions. Install the skill plugin library wukong-contrib:
mkdir $HOME/.wukong
cd $HOME/.wukong
git clone http://github.com/wzpan/wukong-contrib.git contrib
pip3 install -r contrib/requirements.txt
Step 3: First run configuration
When you run Wukong-Robot for the first time, the system will automatically generate the configuration file config.yml in the ~/.wukong directory. You can start Wukong-Robot with the following command:
python3 wukong.py
Please make sure the items in the configuration file are configured correctly so that Wukong-Robot can work properly.
Features
Wukong-Robot provides a series of powerful features, including:
- Modular design: Speech recognition, synthesis and conversational robot functions are highly modular and easy to expand.
- Support for multiple Chinese languages: Integrates multiple speech recognition and synthesis technologies, including Baidu, iFlytek, etc.
- Conversational robot: Supports local and online conversational robot access, and provides intelligent conversation functions.
- Offline wake-up: Supports multiple wake-up methods, including voice commands and shake.
- Smart home control: Supports linkage with multiple smart home protocols to achieve voice control.
- Open API: Provides open API interface and supports function expansion.
Conclusion
Through this tutorial, you have successfully built the Wukong-Robot smart voice speaker. This project provides developers with rich functions and flexible configuration options, which can easily realize personalized smart speakers. You can expand the functions as needed to create a smart voice assistant that meets your needs. I hope you have fun in the development process!