eepos

eepos

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

The FID Lottery: Quantifying Hidden Randomness in Generative-Model Evaluation

liked a model 3 days ago

tarruda/DeepSeek-V4-Flash-GGUF

upvoted a paper 5 days ago

Qwen-Image-2.0-RL Technical Report

View all activity

Organizations

None yet

upvoted a paper 1 day ago

The FID Lottery: Quantifying Hidden Randomness in Generative-Model Evaluation

Paper • 2606.20536 • Published 17 days ago • 12

upvoted a paper 5 days ago

Qwen-Image-2.0-RL Technical Report

Paper • 2606.27608 • Published 10 days ago • 48

upvoted an article 25 days ago

Article

Introducing North Mini Code: Cohere’s First Model For Developers

CohereLabs

•

25 days ago

• 79

upvoted an article 30 days ago

Article

Fine-tune FLUX.2 [klein] with a LoRA under 60 minutes

black-forest-labs

•

30 days ago

• 25

upvoted a paper about 2 months ago

Asymmetric Flow Models

Paper • 2605.12964 • Published May 13 • 22

upvoted a collection about 2 months ago

Gemma 4

15 items • Updated 24 days ago • 1.01k

upvoted a collection 2 months ago

Qwen3.6

4 items • Updated Apr 22 • 427

upvoted a collection 3 months ago

MiniMax-M2

https://arxiv.org/abs/2605.26494 • 4 items • Updated May 27 • 29

upvoted a collection 4 months ago

Qwen3.5

Qwen3.5 is Qwen's new model family including Qwen3.5 Small: 0.8B, 2B, 4B, 9B and Qwen3.5 Medium: 35B-A3B, 27B, 122B-A10B and 397B-A17B. • 25 items • Updated 19 days ago • 161

upvoted an article 4 months ago

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

ggerganov, ngxson, allozaur, lysandre, victor, julien-c

•

Feb 20

• 507

upvoted a paper 5 months ago

SLA2: Sparse-Linear Attention with Learnable Routing and QAT

Paper • 2602.12675 • Published Feb 13 • 59

upvoted 2 collections 5 months ago

Hibiki-Zero

Streaming speech translation without the need for word-level alignments • 4 items • Updated May 9 • 4

Qwen3.5

21 items • Updated Mar 9 • 1.71k

upvoted 4 collections 6 months ago

TranslateGemma

3 items • Updated Mar 12 • 245

Text-To-Speech

https://kyutai.org/next/tts • 6 items • Updated Mar 2 • 27

FLUX.2

Our second generation of FLUX • 21 items • Updated Apr 6 • 253

CASA

CASA: Cross-Attention over Self-Attention for Efficient Vision-Language Fusion on long-context streaming inputs • 6 items • Updated Mar 9 • 8

upvoted an article 7 months ago

Article

New in llama.cpp: Model Management

ggml-org

•

Dec 11, 2025

• 137

upvoted 2 collections 7 months ago

Mistral Large 3

A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 100

Qwen-Image

14 items • Updated Dec 31, 2025 • 115