Roy Shilkrot
|
5227a437b6
|
VAD based segmentation (#97)
* refactor: Add whisper_buffer to transcription_filter_data struct
* refactor: Add sentence_psum_accept_thresh to transcription_filter_data struct
* refactor: Update buffer size and overlap size in whisper-processing.cpp
* refactor: Update buffer size and overlap size in whisper-processing.cpp
* refactor: Add audio-file-utils.cpp for audio file handling
* refactor: Update buffer size and overlap size in whisper-processing.cpp
* refactor: Add external model option to translation settings
* refactor: Add support for input tokenization style in translation settings
* refactor: Update buffer size and overlap size in whisper-processing.cpp
|
2024-05-16 15:07:00 -04:00 |
|