* Enable CoreML and allow fallback to CPU on MacOS ARM64
* Disable CoreML support on MacOS ARM64
* Fix build configuration for MacOS
* Update macOS build configuration based on MACOS_ARCH environment variable
* Update BuildWhispercpp.cmake to disable FMA instructions on non-Apple platforms
* Add cuBLAS support to build and package actions
* Update Cublas versions in Windows build and packaging scripts
* Update CUDA_TOOLKIT_ROOT_DIR environment variable
* Add sub-packages and non-cuda-sub-packages options to CUDA toolkit setup
* Update CUDA sub-packages in build-project.yaml
* Add "visual_studio_integration" to sub-packages in CUDA build workflow
* Fix typo in build-project.yaml
* Fix typo in CUDA build method
* Update sub-packages in CUDA toolkit installation
* Remove unnecessary CUDA sub-packages and method