Jeff

JiayuJeff

6 26 5

JiayuJeff

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Trimming the Long-Tail of Visual World Modeling Evaluation

upvoted a paper 6 days ago

GBC: Gradient-Based Connections for Optimizing Multi-Agent Systems

updated a dataset 7 days ago

JiayuJeff/PlanBench-XL

View all activity

Organizations

None yet

upvoted a paper 5 days ago

Trimming the Long-Tail of Visual World Modeling Evaluation

Paper • 2606.24256 • Published 13 days ago • 41

upvoted a paper 6 days ago

GBC: Gradient-Based Connections for Optimizing Multi-Agent Systems

Paper • 2606.28187 • Published 10 days ago • 13

updated a dataset 7 days ago

JiayuJeff/PlanBench-XL

Viewer • Updated 7 days ago • 327 • 130 • 4

authored a paper 10 days ago

BioInsight: Multi-Agent Orchestration for Interactive Biomedical Knowledge Discovery

Paper • 2606.20997 • Published 17 days ago • 8

upvoted a paper 11 days ago

BioInsight: Multi-Agent Orchestration for Interactive Biomedical Knowledge Discovery

Paper • 2606.20997 • Published 17 days ago • 8

upvoted a paper 12 days ago

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

Paper • 2606.22388 • Published 15 days ago • 96

commented a paper 12 days ago

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

Paper • 2606.22388 • Published 15 days ago • 96 •

authored a paper 12 days ago

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

Paper • 2606.22388 • Published 15 days ago • 96

upvoted a collection 12 days ago

awesome-agentic-benchmarks

Collection

3 items • Updated 12 days ago • 4

upvoted a paper 12 days ago

GeoBrowse: A Geolocation Benchmark for Agentic Tool Use with Expert-Annotated Reasoning Traces

Paper • 2604.04017 • Published Apr 5 • 8

submitted a paper to Daily Papers 12 days ago

PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems

Paper • 2606.22388 • Published 15 days ago • 96

liked a dataset 15 days ago

JiayuJeff/PlanBench-XL

Viewer • Updated 7 days ago • 327 • 130 • 4

published a dataset 15 days ago

JiayuJeff/PlanBench-XL

Viewer • Updated 7 days ago • 327 • 130 • 4

commented a paper 30 days ago

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

Paper • 2606.05622 • Published Jun 4 • 44 •

upvoted a paper 30 days ago

Brick-Composer: Using MLLMs for Assembly with Diverse Bricks

Paper • 2606.05445 • Published Jun 3 • 8

upvoted a paper about 1 month ago

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

Paper • 2606.05622 • Published Jun 4 • 44

submitted a paper to Daily Papers about 1 month ago

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

Paper • 2606.05622 • Published Jun 4 • 44

updated a dataset about 1 month ago

JiayuJeff/AdaPlanBench

Updated about 1 month ago • 106 • 3

liked a dataset about 1 month ago

JiayuJeff/AdaPlanBench

Updated about 1 month ago • 106 • 3

published a dataset about 1 month ago

JiayuJeff/AdaPlanBench

Updated about 1 month ago • 106 • 3

Jeff

AI & ML interests

Recent Activity

Organizations

JiayuJeff's activity