7 11 4

Shilong Liu

ShilongLiu

https://www.lsl.zone

SlongLiu

AI & ML interests

Computer vision. Machine learning. Agents

Recent Activity

authored a paper 1 day ago

EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents

upvoted a paper 2 days ago

EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents

submitted a paper 2 days ago

EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents

View all activity

Organizations

None yet

authored a paper 1 day ago

EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents

Paper • 2606.11182 • Published 3 days ago • 18

upvoted a paper 2 days ago

EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents

Paper • 2606.11182 • Published 3 days ago • 18

submitted a paper to Daily Papers 2 days ago

EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents

Paper • 2606.11182 • Published 3 days ago • 18

authored 15 papers 16 days ago

SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D Features

Paper • 2509.16098 • Published Sep 19, 2025

A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models

Paper • 2502.17516 • Published Feb 22, 2025

Avenir-Web: Human-Experience-Imitating Multimodal Web Agents with Mixture of Grounding Experts

Paper • 2602.02468 • Published Feb 2 • 2

MiLDEdit: Reasoning-Based Multi-Layer Design Document Editing

Paper • 2601.04589 • Published Jan 8

DN-DETR: Accelerate DETR Training by Introducing Query DeNoising

Paper • 2203.01305 • Published Mar 2, 2022

UI-Mem: Self-Evolving Experience Memory for Online Reinforcement Learning in Mobile GUI Agents

Paper • 2602.05832 • Published Feb 5

MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory

Paper • 2605.15128 • Published 29 days ago • 62

CubeBench: Diagnosing Interactive, Long-Horizon Spatial Reasoning Under Partial Observations

Paper • 2512.23328 • Published Jan 1

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 17 days ago • 139

upvoted a paper 16 days ago

LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding

Paper • 2605.27365 • Published 17 days ago • 139

liked a model 16 days ago

nvidia/LocateAnything-3B

Image-Text-to-Text • 4B • Updated 5 minutes ago • 149k • 1.9k

Shilong Liu

AI & ML interests

Recent Activity

Organizations

ShilongLiu's activity