Join Mindrift as a Freelance Agent Evaluation Engineer, where you'll assess AI coding agents in realistic developer environments. You'll create tasks and criteria that challenge these models to perform like real developers.
Innovative and tech-focused, with a strong emphasis on AI and development.
AI coding agents - how well a model handles real-world <b>developer</b> tasks. You'll create challenging tasks and evaluation... criteria within realistic simulated environments: Build realistic <b>developer</b> environments - a virtual company with codebase
You'll be taken to the original listing on jobviewtrack.com to apply.