desktop/simon: Added (speech recognition software)

Signed-off-by: Robby Workman <rworkman@slackbuilds.org>
author: Roberto Metere <roberto@metere.it> 2011-12-18 00:24:32 -0600
committer: Robby Workman <rworkman@slackbuilds.org> 2011-12-18 00:24:32 -0600
commit: 2edd4d3645c7387a68ad8b200328092389aab4f2 (patch)
tree: cd9a41d2cdb218ea48fcb98998a0a65da892820e /desktop/simon/README.setup
parent: b77ef0870456f80022a8d92108c8887d0a170e4f (diff)
download: slackbuilds-2edd4d3645c7387a68ad8b200328092389aab4f2.tar.gz
1 files changed, 49 insertions, 0 deletions
diff --git a/desktop/simon/README.setup b/desktop/simon/README.setup
new file mode 100644
index 0000000000..b14f2d7f3d
--- /dev/null
+++ b/desktop/simon/README.setup
@@ -0,0 +1,49 @@
+You may want to install the Hidden Markov Model Toolkit (HTK) which is 
+covered by a license which does not permit free distribution.  However, 
+you need HTK if you want to train your acoustic model.  You can obtain
+HTK from here (but only after registering): http://htk.eng.cam.ac.uk/
+
+If you are creating solutions which will be used by more than one user, or
+simply don't have the time to train the system, you can use static base models.
+Static models are used as-is and are not modified by simon in any way.
+Because of this, it is important that the selected base model matches your
+voice as closely as possible.
+
+Even if you use a static model, you NEED to get an acoustic model from the web.
+You can download some prebuilt models at http://www.voxforge.org/
+
+BEGINNER GUIDE:
+If you are a beginner and you don't know exactly how a speech recognition works,
+but want just to enable this "cool feature", you may want to follow these steps
+(static model), in order to make simon operative (English).
+
+This is to help you to your first approach to this program, next you will
+be able to customize more and more!
+
+ 0. Browse acoustic models from
+    http://www.repository.voxforge1.org/downloads/Nightly_Builds/current/
+    Download "HTK_AcousticModel-2010-12-16_16kHz_16bit_MFCC_O_D.tgz"
+ 1. Uncompress the model where you want.
+ 2. Run "ksimond" (not from root). You need to have the daemon simond running.
+ 3. Configure "simond". (ksimond -> configuration -> simond). Add a username and a
+    password which are going to be used by simon.
+ 4. Run "simon". An assistant will appear. Click "Next" once to jump to "Scenarios"
+    section of the assistant.
+ 5. Get some scenario. You need at least one, download a scenario in English.
+ 6. Configure base model. Choose "Static model" type. From the uncompressed acoustic
+    model of step 1 choose:
+    - "hmmdefs" file for HMM definition
+    - "tiedlist" file for Tiedlist
+    - "macros" file for Macros
+    - "stats" file for Stats
+    Click "Ok", then "Next".
+ 7. "Server" and "Sound devices" sections configuration depends on what hardware
+    and software you're going to use.
+    You can safely just press "Next" to leave them unchanged.
+ 8. Adjust the volume of your microphone (or any input device you're going to use)
+    I suggest you to get the rumor at few percentage (3%-4%) and to get
+    "Volume correct" while speaking (I boosted my microphone for that)
+ 9. Optionally perform a training of speechable texts of your scenario to put your
+    voice in training data for a better recognition.
+10. Speak!
+
author	Roberto Metere <roberto@metere.it>	2011-12-18 00:24:32 -0600
committer	Robby Workman <rworkman@slackbuilds.org>	2011-12-18 00:24:32 -0600
commit	2edd4d3645c7387a68ad8b200328092389aab4f2 (patch)
tree	cd9a41d2cdb218ea48fcb98998a0a65da892820e /desktop/simon/README.setup
parent	b77ef0870456f80022a8d92108c8887d0a170e4f (diff)
download	slackbuilds-2edd4d3645c7387a68ad8b200328092389aab4f2.tar.gz