Senior Site Reliability Engineer - Enterprise Technology

At Hudson River Trading (HRT) we are mathematicians, computer scientists, statisticians, physicists and engineers. We research and develop automated trading algorithms using advanced mathematical techniques. We have built one of the world's most sophisticated computing environments, and our researchers are at the forefront of innovation in the world of algorithmic trading.
Job Description
Hudson River Trading (HRT) is looking for a Senior Site Reliability Engineer to join our growing Enterprise Technology group. The SRE team sits within Enterprise Technology and is responsible for operating and optimizing corporate productivity & collaboration infrastructure for the entire firm, both on-prem and in the cloud.
As one of Enterprise Technology’s first SREs, you will help to establish and grow our site reliability engineering practice in addition to ensuring the availability and reliability of systems within our stack.
This role requires a deep Linux operating system and application administration skill set, proficiency in Python, and solid experience with configuration management/IaC. Successful candidates should also have exceptional organizational, communication, and project management skills, as well as the ability to troubleshoot complex technical issues.
Responsibilities
- Manage on-premise containerized web services, and a multitude of bridge services, integrations and batch processes that interconnect the elements of our productivity ecosystem
- Proactively eliminate sources of operational work. Engineering not firefighting
- Automate and troubleshoot a broad range of technical infrastructure both on-prem and in the cloud
- Develop and implement monitoring solutions to ensure high system uptime and reliability
- Enable transparency and high development velocity within the firm while maintaining a high bar for security. Find ways to reduce user friction, and make sure HRTers have access to the tools and data they need when they need it
- Break down complexity, iterate, and communicate progress to a wide variety of leads and stakeholders
Qualifications
- 5+ years of experience in site reliability engineering or related disciplines
- Proficiency with Python
- Experience managing and monitoring containerized infrastructure
- Experience working with CI/CD tools such as Jenkins, GitHub Actions, or ArgoCD
- Expert experience with IaC and configuration management tools such as Terraform, SaltStack, Chef, Puppet, or Ansible
Annual base salary range of $150,000 to $250,000. Pay (base and bonus) may vary depending on job-related skills and experience. A sign-on and discretionary performance bonus may be provided as part of the total compensation package, in addition to company-paid medical and/or other benefits.
Hudson River Trading (HRT) is looking for a Senior Site Reliability Engineer to join our growing Enterprise Technology group. The SRE team sits within Enterprise Technology and is responsible for operating and optimizing corporate productivity & collaboration infrastructure for the entire firm, both on-prem and in the cloud.
As one of Enterprise Technology’s first SREs, you will help to establish and grow our site reliability engineering practice in addition to ensuring the availability and reliability of systems within our stack.
This role requires a deep Linux operating system and application administration skill set, proficiency in Python, and solid experience with configuration management/IaC. Successful candidates should also have exceptional organizational, communication, and project management skills, as well as the ability to troubleshoot complex technical issues.
Responsibilities
- Manage on-premise containerized web services, and a multitude of bridge services, integrations and batch processes that interconnect the elements of our productivity ecosystem
- Proactively eliminate sources of operational work. Engineering not firefighting
- Automate and troubleshoot a broad range of technical infrastructure both on-prem and in the cloud
- Develop and implement monitoring solutions to ensure high system uptime and reliability
- Enable transparency and high development velocity within the firm while maintaining a high bar for security. Find ways to reduce user friction, and make sure HRTers have access to the tools and data they need when they need it
- Break down complexity, iterate, and communicate progress to a wide variety of leads and stakeholders
Qualifications
- 5+ years of experience in site reliability engineering or related disciplines
- Proficiency with Python
- Experience managing and monitoring containerized infrastructure
- Experience working with CI/CD tools such as Jenkins, GitHub Actions, or ArgoCD
- Expert experience with IaC and configuration management tools such as Terraform, SaltStack, Chef, Puppet, or Ansible
Annual base salary range of $150,000 to $250,000. Pay (base and bonus) may vary depending on job-related skills and experience. A sign-on and discretionary performance bonus may be provided as part of the total compensation package, in addition to company-paid medical and/or other benefits.