Difference between revisions of "Software"
From rosp
(→Automatic speech recognition) |
(→Automatic speech recognition) |
||
Line 2: | Line 2: | ||
== [[Automatic speech recognition]] == | == [[Automatic speech recognition]] == | ||
+ | |||
+ | '''Kaldi''' | ||
+ | |||
+ | Available at sourceforge [http://kaldi.sourceforge.net/ here] | ||
+ | |||
+ | '''CMUSphinx''' | ||
+ | |||
+ | Available at sourceforge [http://cmusphinx.sourceforge.net/ here] | ||
'''Hidden Markov Model Toolkit (HTK)''' | '''Hidden Markov Model Toolkit (HTK)''' | ||
− | + | Toolbox in C available for Windows, Linux and OSX from the Cambridge University [http://htk.eng.cam.ac.uk/ here] (you need to register to download) | |
− | |||
− | |||
− | + | ''Resources related to robustness'' | |
− | * | + | *Scripts available for various robust ASR Corpora, see the [[Datasets#Automatic_speech_recognition|Datasets section]] |
− | * | + | *The [http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html voicebox] MATLAB toolbox allows writing and reading feature vectors in htk format, thus allowing build custom robust front ends. |
*Patches to perform Uncertainty Decoding and Modified Imputation available [http://www.astudillo.com/ramon/research/stft-up/ here] | *Patches to perform Uncertainty Decoding and Modified Imputation available [http://www.astudillo.com/ramon/research/stft-up/ here] |
Revision as of 21:30, 25 August 2013
This page provides software grouped by application.
Contents
Automatic speech recognition
Kaldi
Available at sourceforge here
CMUSphinx
Available at sourceforge here
Hidden Markov Model Toolkit (HTK)
Toolbox in C available for Windows, Linux and OSX from the Cambridge University here (you need to register to download)
Resources related to robustness
- Scripts available for various robust ASR Corpora, see the Datasets section
- The voicebox MATLAB toolbox allows writing and reading feature vectors in htk format, thus allowing build custom robust front ends.
- Patches to perform Uncertainty Decoding and Modified Imputation available here
Speaker identification and verification
Speech enhancement and separation
Other applications
Contribute software
To contribute new software, please
- create an account and login
- go to the wiki page above corresponding to your application; if it does not exist yet, you may create it
- click on the "Edit" link at the top of the page and add a new section for your software (software is ordered by year of the latest version)
- click on the "Save page" link at the bottom of the page to save your modifications
Please make sure to provide the following information:
- name of the software and year of the latest version
- authors, institution, contact information
- link to the software, ideally including a short demo, and to the external libraries needed
- short description (functionalities, inputs and outputs, programming language, operating system, license, etc) and link to a paper/report describing the software, if any
- whether running on well-known baselines (Aurora-2, Aurora-4, Switchboard, CHiME, etc) is included or requires wrapping by the user
In order to save storage space, please do not upload the software on this wiki, but link it as much as possible from a public repository (e.g., bitbucket, github, sourceforge) or from a stable URL on the website of your institution. If this is not possible, please contact the resources sharing working group.