-
YO IT Consulting

PhD Rater - Remote

YO IT Consulting
Finland · Full-time · Not Applicable

Seeking experienced researchers and technical experts to support a frontier-model evaluation project focused on agentic workflows. You will design and validate challenging benchmark tasks in data science, machine learning, finance, and coding to help identify reasoning and problem-solving gaps in advanced STEM models. The role involves building real-world tasks with executable tests and analyzing model or agent behavior.

Key Responsibilities

  • Design challenging, real-world STEM problems
  • Implement each task within an agentic development environment using Python

Contract and Payment Terms

  • You will be engaged as an independent contractor.
  • This is a fully remote role that can be completed on your own schedule.
  • Projects can be extended, shortened, or concluded early depending on needs and performance.
  • Payments are weekly on Stripe or Wise based on services rendered.

Key Skills

Ranked by relevance

machine learning
Login to Apply
Posted
Mar 31, 2026
Type
Full-time
Level
Not Applicable
Location
Finland

Industries

Software Development

Categories

Research Analyst Information Technology

Related Jobs

3 roles aligned with this opportunity

View all jobs
View Job Details
OMP
Related

Senior Product Analyst - Software Engineer C#

2026-06-13

Full-time
Not Applicable
Spain
Software Development
Research
View Job Details
Hoxhunt
Related

Senior Product Analyst, Team Lead

2026-06-16

Full-time
Not Applicable
Finland
Computer
Research
View Job Details
Rovio Entertainment
Related

Senior Manager, Analytics

2026-06-12

Full-time
Not Applicable
Finland
Computer Games
Research