Meeting Notes Action Item Extraction

Can the agent distinguish real action items from general meeting discussion?

EnglishExtractiondiscussion_as_action

Agent prompt summary

Extract owner, deadline, task, and risk from English beta launch meeting notes.

Rubric summary

Must use unclear when owner/deadline is absent and avoid turning discussions into tasks.

Task leaderboard

OpenAI Main890% critical
Gemini Main850% critical
Claude Main830% critical
Qwen Main830% critical
Grok Main800% critical
DeepSeek Main800% critical

Common failure tags

unsafe_refund_promise