As an AI Inference Optimization Engineer, you'll focus on building and enhancing AI model-serving pipelines. This role is perfect for someone with a solid understanding of AI technologies and a passion for optimizing performance.
Anicalls is looking for an AI Inference Optimization Engineer to join their team in Johannesburg. In this role, you will design and build high-performance pipelines for serving AI models in enterprise applications. Your work will involve collaborating with business, data, and engineering teams to create scalable and efficient AI solutions that meet the needs of the organization.
On a daily basis, you will be responsible for developing AI inference and model-serving pipelines, ensuring they are optimized for performance and resource utilization. You will also implement monitoring and alerting systems to maintain the reliability of AI services. Security and governance are key aspects of this role, so you will need to ensure that all standards are adhered to throughout the AI lifecycle.
This position is ideal for someone with a minimum of 1-2 years of hands-on experience in AI inference, model serving, or MLOps. A strong understanding of scalable AI serving architectures is essential. If you are passionate about AI and enjoy working in a collaborative environment, this role could be a great fit for you.
Key responsibilities include: • Designing and developing high-performance AI inference pipelines • Building scalable AI serving infrastructure • Implementing observability and monitoring solutions • Ensuring compliance with security and governance standards
If you are ready to take on the challenge of optimizing AI solutions in a dynamic setting, we encourage you to apply for this exciting opportunity.
You'll be taken to the original listing on PNet to apply.