As a Freelance Agent Evaluation Engineer at Mindrift, you'll focus on assessing AI coding agents by creating realistic developer tasks and environments. This role blends creativity with technical skills to simulate real-world coding challenges.
Innovative and tech-focused, with a strong emphasis on AI and development.
AI coding agents - how well a model handles real-world <b>developer</b> tasks. You'll create challenging tasks and evaluation... criteria within realistic simulated environments: Build realistic <b>developer</b> environments - a virtual company with codebase
You'll be taken to the original listing on jobviewtrack.com to apply.