EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents Paper • 2606.11182 • Published 3 days ago • 18
EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents Paper • 2606.11182 • Published 3 days ago • 18
EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents Paper • 2606.11182 • Published 3 days ago • 18
Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought Paper • 2505.23766 • Published May 29, 2025
AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes Paper • 2506.14728 • Published Jun 17, 2025
MMedAgent: Learning to Use Medical Tools with Multi-modal Agent Paper • 2407.02483 • Published Jul 2, 2024 • 1
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence Paper • 2507.21046 • Published Jul 28, 2025 • 85
UltraImage: Rethinking Resolution Extrapolation in Image Diffusion Transformers Paper • 2512.04504 • Published Dec 4, 2025 • 18
Alita-G: Self-Evolving Generative Agent for Agent Generation Paper • 2510.23601 • Published Oct 27, 2025
SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D Features Paper • 2509.16098 • Published Sep 19, 2025
A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models Paper • 2502.17516 • Published Feb 22, 2025
Avenir-Web: Human-Experience-Imitating Multimodal Web Agents with Mixture of Grounding Experts Paper • 2602.02468 • Published Feb 2 • 2
DN-DETR: Accelerate DETR Training by Introducing Query DeNoising Paper • 2203.01305 • Published Mar 2, 2022
UI-Mem: Self-Evolving Experience Memory for Online Reinforcement Learning in Mobile GUI Agents Paper • 2602.05832 • Published Feb 5
MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory Paper • 2605.15128 • Published 29 days ago • 62
CubeBench: Diagnosing Interactive, Long-Horizon Spatial Reasoning Under Partial Observations Paper • 2512.23328 • Published Jan 1
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published 17 days ago • 139
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published 17 days ago • 139