Join Mindrift as a Freelance Agent Evaluation Engineer, where you'll assess how AI coding agents perform real-world developer tasks. You'll create challenging scenarios and build realistic developer environments to test these models.
innovative and tech-focused
AI coding agents - how well a model handles real-world <b>developer</b> tasks. You'll create challenging tasks and evaluation... criteria within realistic simulated environments: Build realistic <b>developer</b> environments - a virtual company with codebase
You'll be taken to the original listing on jobviewtrack.com to apply.