Senior Distributed Systems Engineer (Machine Learning)
Movable Ink is a software company that provides marketers with technology and expert services to create unique customer experiences.
Job Description
As Senior Distributed Systems Engineer (Machine Learning), you will play an instrumental role in advancing the services and systems comprising our Machine Learning Platform. Your technical skills will be leveraged to drive the development and deployment of high quality, scalable AI solutions. This is an opportunity to work end-to-end on a large-scale machine-learning system that touches millions of customers, and a chance to continuously learn and help improve our solution as the field evolves.
This role will be reporting to the Director of Engineering (AI).
Responsibilities
- Develop scalable, highly available, and fault tolerant services powering our Machine Learning platform according to industry standards for performance, monitoring, orchestration, and testing.
- Design, deploy, and maintain Machine Learning services such as a feature store, experimentation platform, model endpoint management/blue-green deployment, vector databases, etc.
- Collaborate with product and engineering stakeholders to empathetically understand and define requirements for complex systems, and develop complex projects from conception into rigorous technical specifications with a clear path to production.
- Minimize risk across platform/system deployments, features, and processes.
- Foster close collaboration with AI research teams to ensure that their innovations are effectively integrated into the product development process.
- Build systems that deliver measurable and tangible business value.
Requirements
- 4+ years software engineering experience
- Experience architecting, building, and maintaining production distributed systems at scale
- Exemplary software engineering skills (design, unit testing, git, code review, CI/CD)
- Proficiency with Python
- Experience with large-scale data processing frameworks (we use SQL, PySpark, Kafka)
- Experience with cloud computing platforms (we use Google Cloud Platform (GCP))
- Experience with modern cloud technologies (we use Kubernetes, Terraform, etc)
- Experience implementing performant microservices (we use gRPC)
- Proficient in database management, including designing database schema, crafting efficient queries, performing basic DBA tasks, and knowledgeable regarding common databases relevant to Python development
- Enjoys collaborating with AI researchers, product managers, and other engineering teams
- A desire to always be learning and contributing to a collaborative environment
As Senior Distributed Systems Engineer (Machine Learning), you will play an instrumental role in advancing the services and systems comprising our Machine Learning Platform. Your technical skills will be leveraged to drive the development and deployment of high quality, scalable AI solutions. This is an opportunity to work end-to-end on a large-scale machine-learning system that touches millions of customers, and a chance to continuously learn and help improve our solution as the field evolves.
This role will be reporting to the Director of Engineering (AI).
Responsibilities
- Develop scalable, highly available, and fault tolerant services powering our Machine Learning platform according to industry standards for performance, monitoring, orchestration, and testing.
- Design, deploy, and maintain Machine Learning services such as a feature store, experimentation platform, model endpoint management/blue-green deployment, vector databases, etc.
- Collaborate with product and engineering stakeholders to empathetically understand and define requirements for complex systems, and develop complex projects from conception into rigorous technical specifications with a clear path to production.
- Minimize risk across platform/system deployments, features, and processes.
- Foster close collaboration with AI research teams to ensure that their innovations are effectively integrated into the product development process.
- Build systems that deliver measurable and tangible business value.
Requirements
- 4+ years software engineering experience
- Experience architecting, building, and maintaining production distributed systems at scale
- Exemplary software engineering skills (design, unit testing, git, code review, CI/CD)
- Proficiency with Python
- Experience with large-scale data processing frameworks (we use SQL, PySpark, Kafka)
- Experience with cloud computing platforms (we use Google Cloud Platform (GCP))
- Experience with modern cloud technologies (we use Kubernetes, Terraform, etc)
- Experience implementing performant microservices (we use gRPC)
- Proficient in database management, including designing database schema, crafting efficient queries, performing basic DBA tasks, and knowledgeable regarding common databases relevant to Python development
- Enjoys collaborating with AI researchers, product managers, and other engineering teams
- A desire to always be learning and contributing to a collaborative environment