CityRAG: Stepping Into a City via Spatially-Grounded Video Generation Paper • 2604.19741 • Published Apr 21 • 17
AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model Paper • 2604.19747 • Published Apr 21 • 40
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift • Apr 2 • 910
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 Text Generation • 124B • Updated Apr 29 • 1.19M • • 392
Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b-Logprob Viewer • Updated Jan 15 • 435k • 4.52k • 63
KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs Paper • 2601.01046 • Published Jan 3 • 14
MediaTek-Research/Breeze-ASR-25 Automatic Speech Recognition • 2B • Updated Jul 8, 2025 • 13.8k • 130