Data Scientist Job at Mercor, Remote

eWlVSTlHVEt5QkR0R21Mc0E2bkhaTkVIMHc9PQ==
  • Mercor
  • Remote

Job Description

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more.

Role Description

We're seeking a data-driven analyst to conduct comprehensive failure analysis on AI agent performance across finance-sector tasks. You'll identify patterns, root causes, and systemic issues in our evaluation framework by analyzing task performance across multiple dimensions (task types, file types, criteria, etc.).

  • Statistical Failure Analysis : Identify patterns in AI agent failures across task components (prompts, rubrics, templates, file types, tags)
  • Root Cause Analysis : Determine whether failures stem from task design, rubric clarity, file complexity, or agent limitations
  • Dimension Analysis : Analyze performance variations across finance sub-domains, file types, and task categories
  • Reporting & Visualization : Create dashboards and reports highlighting failure clusters, edge cases, and improvement opportunities
  • Quality Framework : Recommend improvements to task design, rubric structure, and evaluation criteria based on statistical findings
  • Stakeholder Communication : Present insights to data labeling experts and technical teams

Qualifications

  • Statistical Expertise : Strong foundation in statistical analysis, hypothesis testing, and pattern recognition
  • Programming : Proficiency in Python (pandas, scipy, matplotlib/seaborn) or R for data analysis
  • Data Analysis : Experience with exploratory data analysis and creating actionable insights from complex datasets
  • AI/ML Familiarity : Understanding of LLM evaluation methods and quality metrics
  • Tools : Comfortable working with Excel, data visualization tools (Tableau/Looker), and SQL

Requirements

  • Experience with AI/ML model evaluation or quality assurance
  • Background in finance or willingness to learn finance domain concepts
  • Experience with multi-dimensional failure analysis
  • Familiarity with benchmark datasets and evaluation frameworks
  • 2-4 years of relevant experience

Job Tags

Remote job,

Similar Jobs

Farm Job Search

Farm Assistant and Equipment Operator Job at Farm Job Search

 ...Farm Assistant and Equipment Operator (6196) Location: South Dakota JobNumber: 6196 Farm Assistant and Equipment Operator opening on a large grain farm in central South Dakota. Must be capable (or able to learn) of operating large farm equipment including sprayer... 

Drury Hotels

Hotel Sales Coordinator Job at Drury Hotels

 ...Veterans Blvd. - Nashville, Tennessee 37201 YOU BELONG AT DRURY HOTELS. Getting a job is just the beginning. Finding a place where...  ...Summary: Under general direction, works closely with the sales and hotel leadership to promote and sell guest rooms and meeting... 

Flex

Software Engineer I, Backend (New Grad) Job at Flex

 ...part of the team? About the job Flex is looking for a Software Engineer to join our engineering team. You'll work on building and maintaining...  ...people to grow a successful company. Our HQ is located in New York City, but we have employees located throughout the US,... 

Powell Watson Automotive Group

Licensed Insurance Agent Job at Powell Watson Automotive Group

Full Job Description We are looking for a competitive insurance agent to service existing customers and generate new business by contacting potential customers. As part of the Powell Watson Motor Auto Group, you will have the opportunity to service and sell insurance... 

ALONJA Enterprises LLC

Customer Support Specialist 2 Job at ALONJA Enterprises LLC

 ...customers across various client accounts. This is a flexible, work-from-home role ideal for individuals with strong communication skills...  ...computer use Flexible schedule availability, including evenings/weekends Technology Requirements (MANDATORY to...