I have a long audio file that was created by concatenating many short files. I would like to detect silence between the speech segments (just a threshold is enough for my purposes) and replace them by absolute zeros such that there is no background "noise". It is important for me to retain the length of the recording.
I know that sox can detect silence at the beginning and end of a file and I can use silence
, reverse
, pad
etc. to remove the samples and fill in the zeros. Is there a way to do it everywhere in the file, not just start+end?
UPD: this is probably a pretty complicated way to ask if there are tools for voice activity detection for Linux
sox noisered
? It's not the replacement tool for which you're searching, but it might help if nothing else appears. – Chris Davies Mar 27 '15 at 13:29