* Add accessor for VAD window size in samples
* Feed buffered audio data to VAD in proper window sizes
* Wake whisper thread whenever audio is received
* Update silero VAD to v5
* Only reset VAD state between chunks of activity
* refactor: Update whisper model path and enable hipBLAS acceleration
* refactor: Update whisper model path and enable hipBLAS acceleration
* refactor: Update whisper model path and enable hipBLAS acceleration
* refactor: Update whisper model path and enable hipBLAS acceleration
* refactor: Update whisper model path and enable hipBLAS acceleration
* refactor: Update whisper model path and enable CoreML acceleration
* refactor: Enable partial transcription with a latency of 1000ms
* refactor: Update CMakePresets.json and buildspec.json
- Remove the "QT_VERSION" variable from CMakePresets.json for all platforms
- Update the "version" of "obs-studio" and "prebuilt" dependencies in buildspec.json
- Update the "version" of "qt6" dependency in buildspec.json
- Update the "version" of the project to "0.3.3" in buildspec.json
- Update the "version" of the project to "0.3.3" in CMakePresets.json
- Remove unused code in whisper-processing.cpp
* refactor: Add -Wno-error=deprecated-declarations option to compilerconfig.cmake
* refactor: Update language codes in translation module
* refactor: Add boolean flag for whisper model loaded status
* refactor: Improve handling of whisper model paths in transcription filter
* refactor: Update whisper model path and add flag for model loaded status
* refactor: Add transcription-filter-properties.cpp for managing filter properties
* refactor: Add translation_monitor to transcription filter
- Add translation_monitor to the transcription filter data structure
- Initialize and stop the translation_monitor in the transcription_filter_update function
- Update the send_caption_to_source function to use the translation_monitor for sending translated captions
- Clear the translation_monitor when disabling buffered output in the transcription_filter_update function
* refactor: Simplify UI and improve error handling in transcription filter
* refactor: Update translation option in transcription filter
- Update the translation option in the transcription filter to use a more concise label
- Remove unnecessary code related to file output in the transcription filter
- Improve the handling of whisper model paths in the transcription filter
- Set the default language to "auto" in the transcription filter properties
* refactor: Improve error handling in model-downloader.cpp and transcription-filter-callbacks.cpp
* refactor: Improve error handling in model-downloader.cpp and transcription-filter-callbacks.cpp
* refactor: Update model-downloader.cpp to use obs_module_config_path for retrieving the config folder path
- Replace the usage of obs_module_get_config_path with obs_module_config_path to retrieve the config folder path in model-downloader.cpp
- Add a check for a null config_folder and log an info message if it is null
- Convert the config_folder string to a wstring on Windows using MultiByteToWideChar
- Update the log messages to provide more descriptive information about the config models folder and the model folder existence in the config folder
- Use the updated config_folder_str in the std::filesystem::absolute function call
* Trigger Build
* refactor: Update model-downloader.cpp to use obs_module_config_path for retrieving the config folder path
* refactor: Fix bug in transcription filter callbacks
- Add a condition to check for null timestamps before saving the sentence to srt in the send_sentence_to_file function
- Remove unnecessary code in the set_text_callback function that checks for empty text after suppression
- Update the whisper_loop function to clear the current subtitle if the minimum subtitle duration has passed
- Update the file output option in the transcription filter to use the new "Save to File" label instead of "Text File output"
- Add a new boolean flag "save_to_file" in the transcription filter data structure to track the file output setting
- Update the code in transcription-filter-callbacks.cpp and transcription-filter.cpp to use the new flag for file output logic
- Update the properties and UI in transcription-filter-properties.cpp to reflect the changes
* refactor: Add filter-replace-dialog.cpp for filter and replace functionality
* refactor: Improve filter-replace-dialog.cpp for filter and replace functionality
* refactor: Update buffer_output_type translations in locale files
* refactor: Update buffer_num_chars_per_line translation in locale files
* refactor: Remove unused code related to buffer output type selection
* refactor: Update TokenBufferThread to use TokenBufferString for caption building
* refactor: Update TokenBufferThread to use TokenBufferString for caption building
* refactor: Improve remove_leading_trailing_nonalpha function in transcription-utils.cpp
* refactor: Set whisper language to English in transcription filter properties
* refactor: Update whispercpp dependency to version 0.0.3
* refactor: Add buffered output parameters for transcription filter
* refactor: Remove unused parameter in set_source_signals function
* refactor: Fix character splitting bug in TokenBufferThread
* refactor: Update buffer size and overlap size in whisper-processing.cpp
* refactor: Remove unused parameter in set_source_signals function
* refactor: Fix floating point precision issue in whisper-processing.cpp
* refactor: Improve remove_leading_trailing_nonalpha function in transcription-utils.cpp
* refactor: Update VAD threshold in transcription filter
* refactor: Update VAD threshold parameter name in silero-vad-onnx.h
* refactor: Update VAD threshold parameter name in silero-vad-onnx.h
* refactor: Update lock_guard parameter name in TokenBufferThread
Largely based on @umireon's [AUR build script](https://github.com/occ-ai/obs-localvocal/issues/62#issuecomment-1910708241), these steps worked for me on a
Gentoo Linux system as well, so I figured it might be helpful to add them to
the README.
I feel like some of this could be merged with the Ubuntu section above (which
might also in itself be applicable to Debian as well as Ubuntu), but for now
this should at least help people
* Update translation-utils.h, transcription-filter.h, whisper-model-utils.h, model-find-utils.h, and model-downloader.h
* Update create_context function to include ct2ModelFolder parameter
* fix: add fix_utf8 flag to transcription_filter_data struct
* Update create_context function to include ct2ModelFolder parameter
* Update read_text_from_file function to include join_sentences parameter
* fix: Update VadIterator::reset_states to include reset_hc parameter
* Update create_context function to include whisper_sampling_method parameter
* Update tests README with additional configuration options
* feat: Add function to find file in folder by regex expression
* refactor: Improve text conditioning logic in transcription-filter.cpp
* refactor: Improve text conditioning logic in transcription-filter.cpp
* chore: Update ctranslate2 dependency to version 1.2.0
* refactor: Improve text conditioning logic in transcription-filter.cpp
* chore: Update cmake BuildCTranslate2.cmake to disable -Wno-comma warning
* refactor: Update translation context in whisper-processing.cpp and translation-utils.cpp
* Update buffer size and overlap size in whisper-processing.h and default buffer size in msec in transcription-filter.cpp
* Update audio processing timestamp calculation in whisper-processing.cpp
* Update OBS plugin installation instructions for Linux
* Fix typo in update_whisper_model function name
* Update CTranslate2 and cpu_features dependencies
* Update CTranslate2 and cpu_features dependencies
* Update dependencies and fix special tokens handling
* Add BUILD_BYPRODUCTS to CMake build command
* Update version to 0.2.5 in buildspec.json
* Update version to 0.2.4 in buildspec.json
* Update special token handling in whisper-processing.cpp
* Update special token handling in whisper-processing.cpp
* Update buffer size and overlap size in whisper-processing.h and default buffer size in msec in transcription-filter.cpp
* Update buffer size and overlap size in whisper-processing.h and default buffer size in msec in transcription-filter.cpp
* Update suppress_sentences in en-US.ini and transcription-filter-data.h
* Update suppress_sentences and fix whitespace in transcription-filter-data.h, whisper-processing.h, transcription-utils.cpp, and transcription-filter.h
* Update whisper-processing.cpp and whisper-utils.cpp files
* Update findStartOfOverlap function signature to use int instead of size_t
* Update Whispercpp_Build_GIT_TAG to use commit 7395c70a748753e3800b63e3422a2b558a097c80 in BuildWhispercpp.cmake
* Update buffer size and overlap size in whisper-processing.h and default buffer size in msec in transcription-filter.cpp
* Update unused parameter in transcription-filter-properties function
* Update log level and add suppress_sentences feature in transcription-filter.cpp and whisper-processing.cpp
* Add translation output feature in en-US.ini and transcription-filter-data.h
* Add DTW token timestamps and buffered output feature
* trigger rebuild
* Refactor remove_leading_trailing_nonalpha function to improve readability and performance
* Refactor is_lead_byte and is_trail_byte macros for improved readability and maintainability
* Refactor is_lead_byte and is_trail_byte macros for improved readability and maintainability
* trigger build
* Add Silero VAD model and integrate it into the transcription filter
* Fix Silero VAD model path and enable n_threads
* Update translation strings for multiple locales
* Update Onnxruntime library linking and fix compiler warning
* Fix variable naming and type casting in Silero VAD implementation
* Update Silero VAD model path and enable n_threads