Senior Site Reliability Engineer - Hybrid
Boomi is the platform for intelligent connectivity and automation. Connect everyone to everything, anywhere.
Job Description
How You’ll Make An Impact
=========================
As a Senior Site Reliability Engineer, you will be responsible for developing sophisticated systems and software based on the customer’s business goals, needs and general business environment. You will work with product management, other engineering teams, customer success and support on developing cutting edge new product features and enhancements across various areas of Boomi offerings.
What You’ll Do
--------------
- Participate actively in detecting, remediating and reporting on Production incidents, ensuring the SLAs/ SLOs are defined and met.
- Participate in on-call rotation to ensure coverage for planned/unplanned events.
- Engage with other Engineering organizations to implement processes, identify improvements, and drive consistent results.
- Working with your SRE and Engineering counterparts for driving DR exercises, Game days, training and other response readiness efforts.
- Collaborate with Service Engineering organizations to build and automate tooling, implement best practices on Observability and manage the Boomi services in production and consistently achieve our market leading SLA.
- Improving the scalability and reliability of Boomi’s systems in production.
- Automate the provisioning and maintenance of Boomi’s infrastructure.
- Work independently with a minimal level of guidance from technical leadership.
- Mentor other Boomi engineers, including design collaboration and code reviews.
The Experience You Bring
------------------------
- Passionate about SRE, DevOps, Automation and infrastructure platforms. Expert in developing Ansible playbooks and automation for Infrastructure as code using Terraform and Cloud Formation Templates.
- Expert in defining, measuring, and improving Reliability Metrics (SLO/SLI/ Error budgets).
- Strong in implementing observability practices (Monitoring, Logging, Distributed Tracing etc.) preferably using Splunk and New Relic. Experience should not be limited to using the dashboards, but creating them from scratch.
- Experience in conducting and automating DR exercise in AWS cloud thus validating RPOs and RTOs.
- Strong understanding and working experience with AWS components.
- Ability to design and implement API’s for use by internal teams.
Bonus Points If You Have
- 5–8 years of related experience in the software engineering industry, with experience supporting large scale software systems in production.
- Certified in Cloud (AWS/Azure/GCP), experience in using services such as computers, containers and databases.
- Experience in Ansible/Terraform and Python.
- A grasp of Cloud Native concepts, containerization best practices and security awareness in Cloud will be a strong plus.
- Experience in Observability, creating dashboards for SLA/SLI/SLO.
Location
Conshohocken, PA - Hybrid
Aren’t sure if you’re a match? We know that impostor syndrome and the confidence gap can prevent us from meeting spectacular candidates — so don’t hesitate to apply; you could be the perfect fit!
Compensation and Benefits
Boomi is committed to fair and equitable compensation practices. An overview of our benefits can be found here.
#LI-ES1
About Boomi and What Makes Us Special
Are you ready to work at a fast-growing company where you can make a difference? Boomi aims to make the world a better place by connecting everyone to everything, anywhere. Our award-winning, intelligent integration and automation platform helps organizations power the future of business. At Boomi, you’ll work with world-class people and industry-leading technology. We hire trailblazers with an entrepreneurial spirit who can solve challenging problems, make a real impact, and want to be part of building something big. If this sounds like a good fit for you, check out boomi.com or visit our Boomi Careers page to learn more.
How You’ll Make An Impact
As a Senior Site Reliability Engineer, you will be responsible for developing sophisticated systems and software based on the customer’s business goals, needs and general business environment. You will work with product management, other engineering teams, customer success and support on developing cutting edge new product features and enhancements across various areas of Boomi offerings.
What You’ll Do
- Participate actively in detecting, remediating and reporting on Production incidents, ensuring the SLAs/ SLOs are defined and met.
- Participate in on-call rotation to ensure coverage for planned/unplanned events.
- Engage with other Engineering organizations to implement processes, identify improvements, and drive consistent results.
- Working with your SRE and Engineering counterparts for driving DR exercises, Game days, training and other response readiness efforts.
- Collaborate with Service Engineering organizations to build and automate tooling, implement best practices on Observability and manage the Boomi services in production and consistently achieve our market leading SLA.
- Improving the scalability and reliability of Boomi’s systems in production.
- Automate the provisioning and maintenance of Boomi’s infrastructure.
- Work independently with a minimal level of guidance from technical leadership.
- Mentor other Boomi engineers, including design collaboration and code reviews.
The Experience You Bring
- Passionate about SRE, DevOps, Automation and infrastructure platforms. Expert in developing Ansible playbooks and automation for Infrastructure as code using Terraform and Cloud Formation Templates.
- Expert in defining, measuring, and improving Reliability Metrics (SLO/SLI/ Error budgets).
- Strong in implementing observability practices (Monitoring, Logging, Distributed Tracing etc.) preferably using Splunk and New Relic. Experience should not be limited to using the dashboards, but creating them from scratch.
- Experience in conducting and automating DR exercise in AWS cloud thus validating RPOs and RTOs.
- Strong understanding and working experience with AWS components.
- Ability to design and implement API’s for use by internal teams.
Bonus Points If You Have
- 5–8 years of related experience in the software engineering industry, with experience supporting large scale software systems in production.
- Certified in Cloud (AWS/Azure/GCP), experience in using services such as computers, containers and databases.
- Experience in Ansible/Terraform and Python.
- A grasp of Cloud Native concepts, containerization best practices and security awareness in Cloud will be a strong plus.
- Experience in Observability, creating dashboards for SLA/SLI/SLO.
Location
Conshohocken, PA - Hybrid
Aren’t sure if you’re a match? We know that impostor syndrome and the confidence gap can prevent us from meeting spectacular candidates — so don’t hesitate to apply; you could be the perfect fit!
Compensation and Benefits
Boomi is committed to fair and equitable compensation practices. An overview of our benefits can be found here.
#LI-ES1
Be Bold. Be You. Be Boomi. We take pride in our culture and core values and are committed to being a place where everyone can be their true, authentic self. Our team members are our most valuable resources, and we look for and encourage diversity in backgrounds, thoughts, life experiences, knowledge, and capabilities.
All employment decisions are based on business needs, job requirements, and individual qualifications.
Boomi strives to create an inclusive and accessible environment for candidates and employees. If you need accommodation during the application or interview process, please submit a request to talent@boomi.com. This inbox is strictly for accommodations, please do not send resumes or general inquiries.