Principal Infrastructure Engineer
Movable Ink is a software company that provides marketers with technology and expert services to create unique customer experiences.
Job Description
As a Principal Infrastructure Engineer on the Movable Ink DaVinci Platform, you will play a pivotal role in shaping the direction and execution of our infrastructure strategy. You will be responsible for driving key initiatives that enhance the scalability, reliability, and security of our infrastructure platform and the systems and services that run on it. Your leadership and deep technical expertise will be critical in ensuring our infrastructure meets the growing demands of our business and enables the continued improvement of our developer productivity. You will collaborate closely with Engineering Managers, the SRE team, VP of Engineering and other key stakeholders to deliver high-quality, observable, and scalable solutions.
This is a great opportunity to establish the foundational core infrastructure for a very fast growing and fast moving part of Movable Ink.
The role will be reporting to the Site Reliability Engineering Manager.
Responsibilities:
- Set the technical vision for the Da Vinci Platform core infrastructure and lead its roadmap.
- Shape and refine the Incident Management framework for DaVinci in collaboration with the SRE team, ensuring efficient and effective response to incidents.
- Lead the migration of all service infrastructure to Infrastructure as Code (IaC) using Terraform, establishing best practices and ensuring consistent, repeatable deployments.
- Own the Telemetry & Monitoring platforms on DaVinci, ensuring comprehensive observability and proactive issue detection.
- Cloud Cost Optimization: Lead cloud cost optimization efforts on GCP, planning and implementing strategies to maximize value and minimize waste.
- Infrastructure Security Audits: Own infrastructure security audits and compliance, ensuring adherence to industry standards and best practices.
- Global Datacenter Architecture: Own the global datacenter architecture & topology for DaVinci, ensuring a robust and scalable infrastructure.
Qualifications:
- 10+ years of technical hands-on experience in cloud infrastructure, network engineering or software engineering running distributed systems at scale.
- 5+ years of operational experience running distributed systems in GCP.
- Degree in computer science or equivalent experience.
- Proven track record of leading complex, high-impact projects across distributed systems, with a focus on scalability, reliability, and security.
- Extensive hands-on experience with AWS or GCP as an individual contributor, with a strong focus on GCP.
- Deep understanding of modern infrastructure management tools and processes, including Terraform, Kubernetes, and ArgoCD.
- Hands-on experience in operating Kubernetes clusters and Kubernetes based workloads including troubleshooting deployments and GKE configuration.
- Professional experience working with programming languages such as Python, Go, or similar.
- Expertise in managing and optimizing telemetry, monitoring, and secrets management platforms.
- Experience with disaster recovery planning, cloud cost optimization, and network topology design.
- Strong experience in infrastructure security audits, compliance, and global datacenter architecture.
- Excellent problem-solving and communication skills, with the ability to collaborate effectively with cross-functional teams including product managers, engineering leadership, software engineers, and site reliability engineers.
- Experience or exposure to GCP data and AI services such as DataProc, BigQuery and Vertex are a plus.
The base pay range for this position is $220,000 - $240,000 USD/year. The base pay offered may vary depending on job-related knowledge, skills, and experience. Stock options and other incentive pay may be provided as part of the compensation package, in addition to a full range of medical, financial, and/or other benefits, depending on the position ultimately offered.
As a Principal Infrastructure Engineer on the Movable Ink DaVinci Platform, you will play a pivotal role in shaping the direction and execution of our infrastructure strategy. You will be responsible for driving key initiatives that enhance the scalability, reliability, and security of our infrastructure platform and the systems and services that run on it. Your leadership and deep technical expertise will be critical in ensuring our infrastructure meets the growing demands of our business and enables the continued improvement of our developer productivity. You will collaborate closely with Engineering Managers, the SRE team, VP of Engineering and other key stakeholders to deliver high-quality, observable, and scalable solutions.
This is a great opportunity to establish the foundational core infrastructure for a very fast growing and fast moving part of Movable Ink.
The role will be reporting to the Site Reliability Engineering Manager.
Responsibilities:
- Set the technical vision for the Da Vinci Platform core infrastructure and lead its roadmap.
- Shape and refine the Incident Management framework for DaVinci in collaboration with the SRE team, ensuring efficient and effective response to incidents.
- Lead the migration of all service infrastructure to Infrastructure as Code (IaC) using Terraform, establishing best practices and ensuring consistent, repeatable deployments.
- Own the Telemetry & Monitoring platforms on DaVinci, ensuring comprehensive observability and proactive issue detection.
- Cloud Cost Optimization: Lead cloud cost optimization efforts on GCP, planning and implementing strategies to maximize value and minimize waste.
- Infrastructure Security Audits: Own infrastructure security audits and compliance, ensuring adherence to industry standards and best practices.
- Global Datacenter Architecture: Own the global datacenter architecture & topology for DaVinci, ensuring a robust and scalable infrastructure.
Qualifications:
- 10+ years of technical hands-on experience in cloud infrastructure, network engineering or software engineering running distributed systems at scale.
- 5+ years of operational experience running distributed systems in GCP.
- Degree in computer science or equivalent experience.
- Proven track record of leading complex, high-impact projects across distributed systems, with a focus on scalability, reliability, and security.
- Extensive hands-on experience with AWS or GCP as an individual contributor, with a strong focus on GCP.
- Deep understanding of modern infrastructure management tools and processes, including Terraform, Kubernetes, and ArgoCD.
- Hands-on experience in operating Kubernetes clusters and Kubernetes based workloads including troubleshooting deployments and GKE configuration.
- Professional experience working with programming languages such as Python, Go, or similar.
- Expertise in managing and optimizing telemetry, monitoring, and secrets management platforms.
- Experience with disaster recovery planning, cloud cost optimization, and network topology design.
- Strong experience in infrastructure security audits, compliance, and global datacenter architecture.
- Excellent problem-solving and communication skills, with the ability to collaborate effectively with cross-functional teams including product managers, engineering leadership, software engineers, and site reliability engineers.
- Experience or exposure to GCP data and AI services such as DataProc, BigQuery and Vertex are a plus.
The base pay range for this position is $220,000 - $240,000 USD/year. The base pay offered may vary depending on job-related knowledge, skills, and experience. Stock options and other incentive pay may be provided as part of the compensation package, in addition to a full range of medical, financial, and/or other benefits, depending on the position ultimately offered.