• *About The Job**
• *Mercor**
connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include
• *Benchmark**
,
• *General Catalyst**
,
• *Peter Thiel**
,
• *Adam D'Angelo**
,
• *Larry Summers**
, and
• *Jack Dorsey**
.
• *Position:**
Mathematics AI Evaluator
• *Type:**
• *Full-time or Part-time Contract Work**
• *Compensation:**
• *$73/hour**
• *Location:**
• *USA, UK, Canada, EU**
• *Role Responsibilities**
• Write and refine prompts to guide model behavior in mathematical contexts.
• Evaluate LLM-generated responses to mathematics-related queries for correctness, rigor, and logical coherence.
• Verify mathematical claims, derivations, and proofs using domain expertise.
• Conduct fact-checking using authoritative public sources and domain knowledge.
• Annotate model responses by identifying strengths, areas of improvement, and factual or conceptual inaccuracies.
• Ensure model responses align with expected conversational behavior and system guidelines.
• *Qualifications**
• *Must-Have**
• PhD in Mathematics or a closely related field.
• Demonstrated experience in Probability & Statistics.
• Significant experience using large language models (LLMs).
• Excellent writing skills for explaining complex mathematical concepts.
• Strong attention to detail with the ability to notice subtle issues.
• Experience reviewing or editing technical or academic writing.
• *Preferred**
• Prior experience with RLHF, model evaluation, or data annotation work.
• Experience teaching, mentoring, or explaining mathematical concepts to non-expert audiences.
• Familiarity with evaluation rubrics, benchmarks, or structured review frameworks.
• *Application Process (Takes 20–30 mins to complete)**
• Upload resume
• AI interview based on your resume
• Submit form
• *Resources & Support**
• For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
• For any help or support, reach out to: support@mercor.com
• PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.*
,