public marks

PUBLIC MARKS from kmaclean with tag engine

16 February 2007 15:15

Julius Open-Source Large Vocabulary Speech Recognition Engine

Julius is an open source speech recognition engine. Julius is a two-pass large vocabulary continuous speech recognition (LVCSR) software decoder. It can perform almost real-time decoding on most current PCs in 20k word dictation task. Major search techniques are fully incorporated. It is also modularized carefully to be independent from model structures, and various HMM types are supported such as shared-state triphones and tied-mixture models, with any number of mixtures, states, or phones. Standard formats are adopted to cope with other free modeling toolkit. The main platform is Linux and other Unix workstations, and also works on Windows. Julius is open source and distributed with a revised BSD style license. Julius adopts acoustic models in HTK ascii format, pronunciation dictionary in HTK-like format, and word 3-gram language models in ARPA standard format (forward 2-gram and reverse 3-gram as trained from corpus with reversed word order). Although Julius is only distributed with Japanese models, the VoxForge project (www.voxforge.org) is working on creating English Acoustic Models for use with the Julius Speech Recognition Engine.

kmaclean's TAGS related to tag engine

HTK +   isip +   Julius +   recognition +   speech +   sphinx +   voxforge +