Join Speechify as a Software Engineer focused on data infrastructure and acquisition. You'll work with a talented team to enhance data collection processes that support AI model training, making reading more accessible for millions.
Speechify is on a mission to eliminate reading barriers for learners worldwide. With over 50 million users, their text-to-speech products transform various reading materials into audio, enhancing comprehension and retention. The company operates in a fully remote setting, with a diverse team of experts from leading tech firms and academic institutions.
As a Software Engineer in the Data team, your role will focus on enhancing data collection processes to support AI model training. You will be responsible for finding new audio data sources and integrating them into the ingestion pipeline. Your work will involve operating and extending cloud infrastructure on Google Cloud Platform (GCP) and collaborating with AI scientists to improve data quality and efficiency.
Key responsibilities include: • Identifying and sourcing audio data for ingestion • Managing and optimizing cloud infrastructure • Collaborating with the AI team to develop a dataset roadmap • Adapting to changing priorities and managing multiple tasks
The ideal candidate will have a strong background in software development, with at least 5 years of experience. Proficiency in bash and Python scripting, along with experience in Docker and cloud services, is essential. Familiarity with web crawlers and large-scale data processing is a plus. Strong communication skills are also important, as you'll be working closely with various team members to achieve common goals.
This role is perfect for someone who thrives in a fast-paced, entrepreneurial environment and is passionate about making a positive impact through technology. If you're ready to contribute to a transformative product that supports individuals with learning differences, Speechify could be the right fit for you.
You'll be taken to the original listing on Indeed to apply.