Revision as of 22:42, 25 August 2013 by Ramon.astudillo (talk | contribs) (Automatic speech recognition)

Software

From rosp

This page provides software grouped by application.

Automatic speech recognition

Kaldi

Available at sourceforge here


CMUSphinx

Available at sourceforge here


Hidden Markov Model Toolkit (HTK)

Toolbox in C available for Windows, Linux and OSX from the Cambridge University here (you need to register to download)

Resources related to robustness

  • The voicebox MATLAB toolbox allows writing and reading feature vectors in HTK format, thus making possible custom robust front ends.
  • Patches to perform Uncertainty Decoding and Modified Imputation available here

Speaker identification and verification

Speech enhancement and separation

Other applications

Contribute software

To contribute new software, please

  • create an account and login
  • go to the wiki page above corresponding to your application; if it does not exist yet, you may create it
  • click on the "Edit" link at the top of the page and add a new section for your software (software is ordered by year of the latest version)
  • click on the "Save page" link at the bottom of the page to save your modifications

Please make sure to provide the following information:

  • name of the software and year of the latest version
  • authors, institution, contact information
  • link to the software, ideally including a short demo, and to the external libraries needed
  • short description (functionalities, inputs and outputs, programming language, operating system, license, etc) and link to a paper/report describing the software, if any
  • whether running on well-known baselines (Aurora-2, Aurora-4, Switchboard, CHiME, etc) is included or requires wrapping by the user

In order to save storage space, please do not upload the software on this wiki, but link it as much as possible from a public repository (e.g., bitbucket, github, sourceforge) or from a stable URL on the website of your institution. If this is not possible, please contact the resources sharing working group.