Difference between revisions of "Software"

From rosp
(Automatic speech recognition)
(Automatic speech recognition)
Line 2: Line 2:
  
 
== [[Automatic speech recognition]] ==
 
== [[Automatic speech recognition]] ==
 +
 +
'''Kaldi'''
 +
 +
Available at sourceforge [http://kaldi.sourceforge.net/ here]
 +
 +
'''CMUSphinx'''
 +
 +
Available at sourceforge [http://cmusphinx.sourceforge.net/ here]
  
 
'''Hidden Markov Model Toolkit (HTK)'''
 
'''Hidden Markov Model Toolkit (HTK)'''
  
Available from the Cambridge University [http://htk.eng.cam.ac.uk/ here] (you need to register to download)
+
Toolbox in C available for Windows, Linux and OSX from the Cambridge University [http://htk.eng.cam.ac.uk/ here] (you need to register to download)
 
 
''Resources''
 
  
*Scripts available for various Corpora, see the [[Datasets#Automatic_speech_recognition|Datasets section]]
+
''Resources related to robustness''
  
*Various official extensions [http://htk.eng.cam.ac.uk/extensions/index.shtml here].
+
*Scripts available for various robust ASR Corpora, see the [[Datasets#Automatic_speech_recognition|Datasets section]]
  
*In particular, the [http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html voicebox] MATLAB toolbox allows writing and reading feature vectors in htk format, thus allowing to replace the whole front-end.
+
*The [http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html voicebox] MATLAB toolbox allows writing and reading feature vectors in htk format, thus allowing build custom robust front ends.
  
 
*Patches to perform Uncertainty Decoding and Modified Imputation available [http://www.astudillo.com/ramon/research/stft-up/ here]
 
*Patches to perform Uncertainty Decoding and Modified Imputation available [http://www.astudillo.com/ramon/research/stft-up/ here]

Revision as of 22:30, 25 August 2013

This page provides software grouped by application.

Automatic speech recognition

Kaldi

Available at sourceforge here

CMUSphinx

Available at sourceforge here

Hidden Markov Model Toolkit (HTK)

Toolbox in C available for Windows, Linux and OSX from the Cambridge University here (you need to register to download)

Resources related to robustness

  • The voicebox MATLAB toolbox allows writing and reading feature vectors in htk format, thus allowing build custom robust front ends.
  • Patches to perform Uncertainty Decoding and Modified Imputation available here

Speaker identification and verification

Speech enhancement and separation

Other applications

Contribute software

To contribute new software, please

  • create an account and login
  • go to the wiki page above corresponding to your application; if it does not exist yet, you may create it
  • click on the "Edit" link at the top of the page and add a new section for your software (software is ordered by year of the latest version)
  • click on the "Save page" link at the bottom of the page to save your modifications

Please make sure to provide the following information:

  • name of the software and year of the latest version
  • authors, institution, contact information
  • link to the software, ideally including a short demo, and to the external libraries needed
  • short description (functionalities, inputs and outputs, programming language, operating system, license, etc) and link to a paper/report describing the software, if any
  • whether running on well-known baselines (Aurora-2, Aurora-4, Switchboard, CHiME, etc) is included or requires wrapping by the user

In order to save storage space, please do not upload the software on this wiki, but link it as much as possible from a public repository (e.g., bitbucket, github, sourceforge) or from a stable URL on the website of your institution. If this is not possible, please contact the resources sharing working group.