We are building a benchmark dataset to evaluate AI models on professional document understanding and instruction following within the Finance & Banking domain.
Tasks consist of complex, multi-step requests grounded in real-world workspace files (financial statements, reports, spreadsheets), web search, and code execution — each paired with a clearly defined ground truth output and an objective evaluation rubric. You will be responsible for authoring tasks that test an AI's ability to reason over financial documents, follow precise instructions, and produce accurate, structured outputs.
We expect a minimum commitment of 15–20 hours per week.
Ideal candidates have 3+ years of hands-on experience in one or more of the following sub-domains:
Investment & financial analysis
Financial management
Personal financial advisory
Banking & capital markets
Corporate finance & accounting
Listing sourced from Mercor. Annotation Academy is independent of these platforms and does not guarantee work or pay. See our disclosures.