Skip site navigation (1)Skip section navigation (2)

FreeBSD Manual Pages


home | help

       pocketsphinx_batch - Run	speech recognition in batch mode

       pocketsphinx_batch -hmm hmmdir -dict dictfile [ options ]...

       Run  speech recognition over a list of utterances in batchmode.	A list
       of arguments follows:

	      name for audio input (platform-specific)

	      Size of audio file header	in bytes (headers are ignored)

       -adcin Input is raw audio data

       -agc   Automatic	gain  control  for  c0	('max',	 'emax',  'noise',  or

	      Initial threshold	for automatic gain control

	      Do phoneme recognition

       -alpha Preemphasis parameter

	      Print back trace of recognition results

       -beam  Beam  width  applied  to	every frame in Viterbi search (smaller
	      values mean wider	beam)

	      Run bestpath (Dijkstra) search over word lattice (3rd pass)

	      Language model probability weight	for bestpath search

	      Cache senone scores from first pass search

	      Input is cepstral	files, output is log spectral files

	      files directory (prefixed	to filespecs in	control	file)

	      Input files extension (prefixed to filespecs in control file)

	      Number of	components in the input	feature	vector

       -cmn   Cepstral	mean  normalization  scheme  ('current',  'prior',  or

	      Initial  values (comma-separated)	for cepstral mean when 'prior'
	      is used

	      Compute all senone scores	in every frame	(can  be  faster  when
	      there are	many senones)

       -ctl   file listing utterances to be processed

	      No. of utterances	to be processed	(after skipping	-ctloffset en-

	      Do every Nth line	in the control file

	      No. of utterances	at the beginning of -ctl file to be skipped

       -dict  pronunciation dictionary (lexicon) input file

	      Add 1/2-bit noise

	      Use double bandwidth filters (same center	freq)

	      Frame GMM	computation downsampling ratio

	      FB Type of mel_scale or log_linear

       -fdict word pronunciation dictionary input file

       -feat  Feature stream type, depends on the acoustic model

	      Filler word transition penalty

       -frate Frame rate

       -fsg   state grammar

	      Force backtrace from FSG final state

	      finite state grammar control file

	      Use alternative pronunciations for FSG

	      (FSG Mode	(Mode 2) only) Insert filler words at each state.

       -fwd3g Use trigrams in first pass search

	      Run forward flat-lexicon search over word	lattice	(2nd pass)

	      Beam width applied to every frame	in second-pass flat search

	      Minimum number of	end frames for a word to be searched  in  fwd-
	      flat search

	      Language	model  probability  weight for flat lexicon (2nd pass)

	      Window of	frames in lattice to search  for  successor  words  in
	      fwdflat search

	      Beam width applied to word exits in second-pass flat search

	      Run forward lexicon-tree search (1st pass)

       -hmm   containing acoustic model	files.

       -hyp   output file name

	      output with segmentation file name

	      Endianness  of  input data, big or little, ignored if NIST or MS

	      Maximum number of	Gaussians per leaf node	in kd-Trees

	      Maximum depth of kd-Trees	to use

	      file for Gaussian	selection

	      Lattice size

	      Length of	sin-curve for liftering, or 0 for no liftering.

       -live  Get input	from audio hardware

       -lm    trigram language model input file

       -lmctl a	set of language	model

       The -hmm	and -dict arguments are	always required.  Either -lm  or  -fsg
       is  required, depending on whether you are using	a statistical language
       model or	a finite-state grammar.	 To do batchmode recognition, you will
       need  to	 specify a control file, using -ctl This is a simple text file
       containing one entry per	line.  Each entry is the name of an input file
       relative	 to  the -cepdir directory, and	without	the filename extension
       (which is given in the -cepext argument).

       If you are using	acoustic feature files as input	(see sphinx_fe(1)  for
       information  on	how to generate	these),	you can	also specify a subpart
       of a file, using	the following format:


       Written by numerous people at CMU from 1994 onwards.  This manual  page
       by David	Huggins-Daines <>

       Copyright (C) 1994-2007 Carnegie	Mellon University.  See	the file COPY-
       ING included with this package for more information.

       pocketsphinx_continuous(1), sphinx_fe(1).

				  2007-08-27		 POCKETSPHINX_BATCH(1)


Want to link to this manual page? Use this URL:

home | help