PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog โข 9 items โข Updated about 21 hours ago โข 59
ibm-granite/granite-vision-3.1-2b-preview Image-Text-to-Text โข Updated about 3 hours ago โข 3.12k โข 42
view post Post 5054 We did it. Kokoro TTS (v1.0) can now run 100% locally in your browser w/ WebGPU acceleration. Real-time text-to-speech without a server. โก๏ธGenerate 10 seconds of speech in ~1 second for $0.What will you build? ๐ฅ webml-community/kokoro-webgpuThe most difficult part was getting the model running in the first place, but the next steps are simple:โ๏ธ Implement sentence splitting, allowing for streamed responses๐ Multilingual support (only phonemization left)Who wants to help? See translation 7 replies ยท ๐ฅ 20 20 ๐ 7 7 ๐ 5 5 ๐ค 3 3 + Reply
distil-whisper/distil-small.en Automatic Speech Recognition โข Updated Mar 25, 2024 โข 59.7k โข 93
Running on CPU Upgrade 611 611 Open ASR Leaderboard ๐ Request evaluation results for a speech model