-
Crossing Hurdles

Technical Expert | Remote

Crossing Hurdles
Canada · Contract · Associate

Position: PhD Rater

Type: Part-Time

Compensation: $50–$100/hour

Location: Remote

Commitment: 30+ hours/week (primarily weekdays)


Role Responsibilities

  • Design challenging, real-world STEM benchmark problems in domains such as data science, machine learning, finance, and software engineering.
  • Implement tasks within an agentic development environment using Python.
  • Create reproducible problem setups with clear specifications and executable tests.
  • Evaluate and analyze AI model behavior, including reasoning traces and agent workflows.
  • Diagnose reasoning failures, logic gaps, and problem-solving limitations in AI systems.
  • Contribute to improving benchmark quality and evaluation frameworks for frontier AI models.


Requirements

  • Active or recently graduated PhD.
  • Deep expertise in data science, machine learning, finance, and/or Python-based software development.
  • Strong research background in advanced STEM topics.
  • Ability to commit reliably for 30+ hours per week.
  • Demonstrated technical output such as high-quality open-source contributions or research work.
  • Ability to analyze agent behavior traces and diagnose failures beyond surface-level errors.


Application Process

  • Upload resume
  • Interview
  • Submit form

Key Skills

Ranked by relevance

ai machine learning python
Login to Apply
Posted
Mar 08, 2026
Type
Contract
Level
Associate
Location
Canada

Industries

Higher Education Research Services

Categories

Information Technology Research

Related Jobs

3 roles aligned with this opportunity

View all jobs
View Job Details
CIBC
Related

Consultant, Data and AI Analytics

2026-06-06

Full-time
Not Applicable
Canada
Banking
Information Technology
View Job Details
University of Helsinki
Related

Doctoral Researcher in Algorithmic Bioinformatics

2026-06-04

Full-time
Not Applicable
Finland
Higher Education
Research
View Job Details
EPFL
Related

Postdoctoral Researcher in Multimodal Human Sensing and Advanced Behavioral Data Analysis

2026-05-26

Full-time
Not Applicable
Switzerland
Higher Education
Research