Simple voice activity detection. More...

#include <pocketsphinx/prim_type.h>
#include <pocketsphinx/export.h>

Macros
#define	PS_VAD_DEFAULT_SAMPLE_RATE 16000

#define	PS_VAD_DEFAULT_FRAME_LENGTH 0.03

#define	ps_vad_frame_length(vad) ((double)ps_vad_frame_size(vad) / ps_vad_sample_rate(vad))

Enumerations
enum	ps_vad_mode_e { PS_VAD_LOOSE = 0, PS_VAD_MEDIUM_LOOSE = 1, PS_VAD_MEDIUM_STRICT = 2, PS_VAD_STRICT = 3 }
	Voice activity detection "aggressiveness" levels. More...

enum	ps_vad_class_e { PS_VAD_ERROR = -1, PS_VAD_NOT_SPEECH = 0, PS_VAD_SPEECH = 1 }
	Classification of input frames returned by ps_vad_classify(). More...

Detailed Description

Simple voice activity detection.

Because doxygen is Bad Software, the actual documentation can only exist in ps_vad_t. Sorry about that.

Macro Definition Documentation

#define PS_VAD_DEFAULT_SAMPLE_RATE 16000

Default sampling rate for voice activity detector

#define PS_VAD_DEFAULT_FRAME_LENGTH 0.03

Default frame length for voice activity detector

#define ps_vad_frame_length ( vad ) ((double)ps_vad_frame_size(vad) / ps_vad_sample_rate(vad))

Get the actual length of a frame in seconds.

This may differ from the value requested in ps_vad_set_input_params().

Voice activity detection "aggressiveness" levels.

Classification of input frames returned by ps_vad_classify().