The MailSphere platform provides an email service that allows users to send, receive, and organize messages, while offering tools for search, labeling, and communication management.
Overall Pass Rate (pass@8)
Easy: 66.7% < Pass@8 ≤ 100%
Medium: 33.3% < Pass@8 < 66.6% AND Median Steps ≤ 80
Hard: Pass@8 ≤ 33.3% OR Median Steps > 80
This model: 16 easy · 4 medium · 13 hard tasks
Pass rate: 52.27%
Compose / Reply / Forward
Create and send messages.
Archive / Delete
Manage inbox efficiently.
Advanced Search
Filter by sender/date/keyword.
Labels & Filters
Categorize and auto-sort emails.
Smart Compose
Predict text to speed writing.
Reusable Templates
Save frequently used messages.
Shared Drafts
Co-edit email content in real-time.
Inbox Customization
Adjust layout and priority inbox.
Watch how an AI agent interacts with this environment
Prompt
I see an urgent invoice issue in my inbox. Locate the email about the 'Invoice mismatch'. I need to handle this immediately. Reply to the sender saying 'Sorry to hear that. I'll have someone review it right away.' To keep everyone in the loop, please BCC jane.smith@acmecorp.com on this response.
Loading timeline...