git.parisson.com Git - timeside.git/commit

author	David Doukhan <david.doukhan@gmail.com>
	Fri, 26 Sep 2014 13:19:58 +0000 (15:19 +0200)
committer	David Doukhan <david.doukhan@gmail.com>
	Fri, 26 Sep 2014 13:19:58 +0000 (15:19 +0200)
commit	236e27ca727b7efec09138273b6db897be863dc5
tree	8b060b765b9a3cff586e995301983ccee54c0e00	tree \| snapshot
parent	9c7bc4bad7248a3c4f4c8b1df6d5751ca8903b05	commit \| diff

Improvements to the speech activity detection model.

The analyser outputs 2 new estimates
* frame-level estimate corresponding to the raw speech/non speech estimate, filtered with dilatation and erosion procedures, remove some outliers
* speech-non speech segments, inferred from the dilatation erosion/procedure

The analyser can be initialised with 3 new options:
* size of the window used for the dilatation and erosion procedures
* threshold for the decision of speech VS non speech
* bounds for the raw estimates (usefull for plotting)