ibm-granite/granite-speech-4.1-2b Automatic Speech Recognition • 2B • Updated 16 days ago • 399k • 145
VibeVoice Collection Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated Mar 2 • 247
POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion Paper • 2509.01215 • Published Sep 1, 2025 • 52
Video ReCap: Recursive Captioning of Hour-Long Videos Paper • 2402.13250 • Published Feb 20, 2024 • 27
VideoPrism: A Foundational Visual Encoder for Video Understanding Paper • 2402.13217 • Published Feb 20, 2024 • 41
Running on CPU Upgrade Agents 922 ChuanhuChatGPT 🐯 922 Chat with AI models and manage conversation history