OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and Manipulation Paper • 2512.08294 • Published Dec 9, 2025 • 18
RPiAE: A Representation-Pivoted Autoencoder Enhancing Both Image Generation and Editing Paper • 2603.19206 • Published Mar 19 • 1
Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning Paper • 2605.21487 • Published May 20 • 23
Video-MME-Logical: A Controlled Diagnostic Benchmark for Video Temporal-Logical Reasoning Paper • 2606.27828 • Published 7 days ago • 23
Video-MME-Logical: A Controlled Diagnostic Benchmark for Video Temporal-Logical Reasoning Paper • 2606.27828 • Published 7 days ago • 23
Video-MME-Logical: A Controlled Diagnostic Benchmark for Video Temporal-Logical Reasoning Paper • 2606.27828 • Published 7 days ago • 23
Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning Paper • 2605.21487 • Published May 20 • 23
4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding Paper • 2605.05997 • Published May 7 • 18
Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis Paper • 2603.29620 • Published Mar 31 • 49
Gen-Searcher: Reinforcing Agentic Search for Image Generation Paper • 2603.28767 • Published Mar 30 • 58
Gen-Searcher: Reinforcing Agentic Search for Image Generation Paper • 2603.28767 • Published Mar 30 • 58