VoxForge was set up to collect speech audio files to create a GPL Speech Corpus for use with Free and Open Source Speech Recognition Engines (on Linux and Windows).

The transcribed speech will be 'compiled' into acoustic models for use with Open Source speech recognition engines such as Julius, ISIP, and Sphinx, and HTK (note that HTK has distribution restrictions).

Why Do We Need Free GPL Speech Audio?

Most acoustic models used by 'Open Source' Speech Recognition engines are closed source. They do not give you access to the speech audio and transcriptions (the 'source') used to create the acoustic model.

The reason for this is because Open Source projects are required to purchase speech copora which has restrictive licensing (i.e. they are not permitted to distribute the 'source' speech audio, but they are permitted to distribute the 'compiled' acoustic model). There is no free speech corpus in a form that can readily be used to create Acoustic Models for Speech Recognition Engines.

How Can You Help?

Record yourself reading some text, and upload your recordings to VoxForge using one of the following methods:

Where Can You Try It Out Now?

Acoustic models, created using donated speech, for the Julius Speech Recognition Engine:

Links

© 2005-2009 VoxForge; Legal: Terms and Conditions

SourceForge.net Logo