Fixed support for whisper.cpp on older CPUs and issues in speaker identification.
Windows and macOS installation files for the release are available on SourceForge
Buzz for Linux you can get on Flathub and Snap
Note for Windows
To install...
Unclaimed project
Are you a maintainer of buzz? Claim this project to take control of your public changelog and roadmap.
Changelog
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
Fixed support for whisper.cpp on older CPUs and issues in speaker identification.
Windows and macOS installation files for the release are available on SourceForge
Buzz for Linux you can get on Flathub and Snap
Note for Windows
To install...
Fixes for Flatpak builds for the previous release of 1.4.1
Windows and macOS installation files for the release are available on SourceForge
Adding speaker identification on transcriptions and video support for transcription viewer, improvements to transcription table and support for over 1000 of worlds languages via MMS models as well as separate window to show live transcripts on a projector.
Release details:
This release fixes wheel build issues from previous release 1.3.2.
Windows and macOS installation files for the release are available on SourceForge
Buzz for Linux you can get on Flathub and [Snap](https://snapcraft.i...
This release introduces Vulkan GPU support for whisper.cpp making it significantly faster even on laptops. Real-time transcription is possible even with large models on computers with ~5GB RAM video cards. There is now an option to separate voice tracks before the audio is transcribed. This can improve transcript accuracy for audios with background noises or music. Faster whisper was updated to th...
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Stable Diffusion web UI
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A feature-rich command-line audio/video downloader