I am trying to use (not implement VAD algorithm) voice activity detection to get timestamps for a given audio but facing hard time doing so.
What I am trying to achieve ?
Find an offline library for voice activity detection
What I have ?
I have an audio file (mp3)
What I am trying to get ?
Given an audio file (mp3), script/library should return the indexes/timestamps of human voice.
What I tried ?
Used https://github.com/kdavis-mozilla/vad.js and tested VAD with audio stream but it is returning whether it detects voice or not, but no way to get timestamps of when the script detects human voice.
Example ?
Script should take an audio file with 1 minute duration where human voice starts from 00:00:05 to 00:00:20 and 00:00:39 to 00:00:45 and 00:00:50 to 00:00:60 and return timestamps as output like
[[00:00:05,00:00:20],[00:00:39,00:00:45],[00:00:50,00:00:60]]
Can you suggest me a library or Git repository for my scenario ?
I have seen similar question which was asked 10 years ago but I'm not sure if the answers are still relevant
I also found this repository
– CMH12 Jan 13 '23 at 13:23