Posted on 
Sep 26, 2024

Senior Infrastructure Engineer, Metal Dev

Roseland
Mid-Senior ICs
Engineering, IT
CoreWeave
CoreWeave
CoreWeave
Private
101-250
Software, Security & Developer Tools

CoreWeave is a specialized cloud provider focused on GPU accelerated use cases including VFX, AI/ML, Batch Processing and Real Time Experiences. We support countless AI/ML services in the text to image, NLP and broader AI/ML space, reducing client’s infrastructure management requirements with our Kubernetes based serverless GPU cloud offerings.

Job Description

About this Role:

CoreWeave is seeking a highly skilled and motivated Infrastructure Engineer to join our Hardware Engineering Development team (METALDEV), reporting to the Director of Compute Architecture. In this role, you will play a crucial part in the development of the services that automate and test our server infrastructure. You will collaborate closely with cross-functional teams, external vendors, and stakeholders to ensure the successful delivery of highly performant and reliable hardware solutions.

Responsibilities:

  • Develop and maintain Go and Python server management services
  • Collaborate with upstream communities, including Go and Redfish based projects
  • Document hardware automation workflows and processes
  • Create CI/CD pipelines for server hardware compliance tests
  • Develop and maintain hardware/firmware management services
  • Automate all aspects of the server hardware lifecycle
  • Serve as the senior point of contact for hardware escalation and troubleshooting
  • Collaborate with cross-functional teams to define hardware requirements, specifications, and system architecture
  • Create and maintain accurate documentation of hardware designs, specifications, test procedures, and results
  • Analyze and optimize the performance of hardware systems, identify bottlenecks, and propose improvements for enhanced efficiency
  • Establish processes for internal hardware testing, deployment, and performance optimization

Requirements:

  • Must have at least 5 years of profession experience:
  • Proficiency with Go and Python
  • Previous experience deploying containerized applications using Kubernetes
  • Excellent documentation skills and attention to detail
  • Strong analytical and problem-solving abilities

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $175,000 - $210,000. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.

About this Role:

CoreWeave is seeking a highly skilled and motivated Infrastructure Engineer to join our Hardware Engineering Development team (METALDEV), reporting to the Director of Compute Architecture. In this role, you will play a crucial part in the development of the services that automate and test our server infrastructure. You will collaborate closely with cross-functional teams, external vendors, and stakeholders to ensure the successful delivery of highly performant and reliable hardware solutions.

Responsibilities:

  • Develop and maintain Go and Python server management services
  • Collaborate with upstream communities, including Go and Redfish based projects
  • Document hardware automation workflows and processes
  • Create CI/CD pipelines for server hardware compliance tests
  • Develop and maintain hardware/firmware management services
  • Automate all aspects of the server hardware lifecycle
  • Serve as the senior point of contact for hardware escalation and troubleshooting
  • Collaborate with cross-functional teams to define hardware requirements, specifications, and system architecture
  • Create and maintain accurate documentation of hardware designs, specifications, test procedures, and results
  • Analyze and optimize the performance of hardware systems, identify bottlenecks, and propose improvements for enhanced efficiency
  • Establish processes for internal hardware testing, deployment, and performance optimization

Requirements:

  • Must have at least 5 years of profession experience:
  • Proficiency with Go and Python
  • Previous experience deploying containerized applications using Kubernetes
  • Excellent documentation skills and attention to detail
  • Strong analytical and problem-solving abilities

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $175,000 - $210,000. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.

Receive Tech Ladies'
newest jobs in your inbox,
every week.

Join Tech Ladies for full-access to the job board, member-only events, and more!

If you're already a member, we haven't forgotten you. We promise. It's a new system. If you fill out the form once, it'll remember you going forward. Apologies for the inconvenience.

Roseland
Roseland
No items found.
Engineering
Engineering
IT
IT
In-Person
In-Person