Woof woof

WoofWoof

18 15

AI & ML interests

None yet

Recent Activity

upvoted a paper 18 days ago

GeoBrowse: A Geolocation Benchmark for Agentic Tool Use with Expert-Annotated Reasoning Traces

upvoted a paper 18 days ago

RedAct: Redacting Agent Capability Traces for Procedural Skill Protection

upvoted a paper 23 days ago

Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents

View all activity

Organizations

None yet

upvoted 2 papers 18 days ago

GeoBrowse: A Geolocation Benchmark for Agentic Tool Use with Expert-Annotated Reasoning Traces

Paper • 2604.04017 • Published Apr 5 • 8

RedAct: Redacting Agent Capability Traces for Procedural Skill Protection

Paper • 2606.10813 • Published 22 days ago • 23

upvoted a paper 23 days ago

Towards On-Policy Data Evolution for Visual-Native Multimodal Deep Search Agents

Paper • 2605.10832 • Published May 11 • 22

upvoted a paper 26 days ago

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

Paper • 2606.05622 • Published 28 days ago • 44

liked a dataset 26 days ago

JiayuJeff/AdaPlanBench

Updated 27 days ago • 129 • 3

liked a dataset about 1 month ago

LordUky/EMCompress

Viewer • Updated May 23 • 2.75k • 110 • 2

liked 3 datasets 3 months ago

upvoted 2 papers 4 months ago

XSkill: Continual Learning from Experience and Skills in Multimodal Agents

Paper • 2603.12056 • Published Mar 12 • 34

AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios

Paper • 2602.23166 • Published Feb 26 • 45

liked a dataset 4 months ago

Warrieryes/AgentVista

Viewer • Updated Mar 6 • 209 • 159 • 4

upvoted a paper 5 months ago

Dancing in Chains: Strategic Persuasion in Academic Rebuttal via Theory of Mind

Paper • 2601.15715 • Published Jan 22 • 14

liked 2 datasets 5 months ago

RebuttalAgent/RebuttalBench

Preview • Updated Nov 9, 2025 • 21 • 2

RebuttalAgent/Comments_200K

Preview • Updated Nov 10, 2025 • 55 • 2

liked 2 models 5 months ago

RebuttalAgent/RebuttalAgent

8B • Updated Jan 28 • 5 • 2

RebuttalAgent/Rebuttal-RM

8B • Updated Jan 28 • 4 • 2

upvoted 2 papers 7 months ago

MMGR: Multi-Modal Generative Reasoning

Paper • 2512.14691 • Published Dec 16, 2025 • 121

Scaling Environments for LLM Agents in the Era of Learning from Interaction: A Survey

Paper • 2511.09586 • Published Nov 12, 2025 • 2

upvoted a paper 8 months ago

CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents

Paper • 2511.02734 • Published Nov 4, 2025 • 23

Woof woof

AI & ML interests

Recent Activity

Organizations

WoofWoof's activity