Tag
1 articles
A new benchmark checks whether agent memory can retain web-environment experience, not just user history, and improve long-term task recall.