As an AI Inference Optimization Engineer, you'll focus on building and optimizing AI model-serving pipelines. This role is perfect for someone with a solid understanding of AI technologies and a passion for improving performance and efficiency.
Anicalls is looking for an AI Inference Optimization Engineer to enhance their AI capabilities by designing and building high-performance model-serving pipelines. In this role, you will collaborate with various teams, including business, data, and engineering, to create scalable and secure AI solutions that meet enterprise needs. Your focus will be on optimizing inference performance and resource utilization while ensuring deployment efficiency.
Your daily responsibilities will include: • Designing and developing AI inference and model-serving pipelines • Building scalable and reliable AI serving infrastructure • Implementing observability, monitoring, and alerting solutions for AI services • Ensuring adherence to security, reliability, and governance standards throughout the AI lifecycle
The ideal candidate should have a minimum of 1-2 years of hands-on experience in AI inference, model serving, or MLOps. A strong understanding of scalable AI serving architectures is essential. This role suits someone who is detail-oriented, enjoys problem-solving, and is eager to work in a collaborative environment focused on innovation and technology.
You'll be taken to the original listing on PNet to apply.