Victoria Jones

isaacperez2

12 18

AI & ML interests

None yet

Recent Activity

upvoted a paper 29 days ago

VisualThink-VLA: Visual Intermediate Reasoning for Effective and Low-Latency Vision-Language-Action Policies

upvoted a paper 29 days ago

The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement

liked a model about 1 month ago

Muapi/makima-chainsaw-man-flux-lora

View all activity

Organizations

None yet

upvoted 2 papers 29 days ago

VisualThink-VLA: Visual Intermediate Reasoning for Effective and Low-Latency Vision-Language-Action Policies

Paper • 2605.30011 • Published May 28 • 10

The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement

Paper • 2605.30888 • Published May 29 • 10

liked 3 models about 1 month ago

liked a dataset about 1 month ago

openbmb/Ultra-FineWeb-L3

Viewer • Updated May 28 • 1.06B • 71.7k • 306

liked a model about 1 month ago

tencent/Hy-MT2-1.8B

Translation • 2B • Updated May 26 • 23.5k • • 1.14k

upvoted 2 papers about 1 month ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published May 20 • 207

The Unlearnability Phenomenon in RLVR for Language Models

Paper • 2605.16787 • Published May 16 • 6

liked a dataset about 2 months ago

gretelai/synthetic_text_to_sql

Viewer • Updated Dec 16, 2025 • 106k • 3.3k • 670

liked a model about 2 months ago

carbonx/buddy-desktop

Updated May 11 • 1

liked a dataset about 2 months ago

BAAI/Infinity-Instruct

Viewer • Updated Dec 4, 2025 • 21.9M • 3.29k • 732

upvoted a paper about 2 months ago

From Context to Skills: Can Language Models Learn from Context Skillfully?

Paper • 2604.27660 • Published May 3 • 171

liked a dataset 2 months ago

diaoweiqing/record-test_20260501_195420

Viewer • Updated May 1 • 13.4k • 26 • 1

upvoted a paper 2 months ago

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published Apr 28 • 287

liked a model 2 months ago

gcfrts/mg_154b_rag_full_lora

Updated Apr 25 • 1

upvoted a paper 2 months ago

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published Apr 22 • 244

liked 2 models 2 months ago

inclusionAI/LLaDA2.0-Uni

Any-to-Any • 16B • Updated May 27 • 7.65k • 248

openbmb/VoxCPM2

Text-to-Speech • 2B • Updated Apr 16 • 636k • 1.45k

upvoted a paper 3 months ago

RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time

Paper • 2604.11626 • Published Apr 13 • 103

Victoria Jones

AI & ML interests

Recent Activity

Organizations

isaacperez2's activity