Posted on 
Nov 18, 2024

Technical Lead / Principal Engineer

Roseland
Engineering
CoreWeave
CoreWeave
CoreWeave
Private
101-250
Software, Security & Developer Tools

CoreWeave is a specialized cloud provider focused on GPU accelerated use cases including VFX, AI/ML, Batch Processing and Real Time Experiences. We support countless AI/ML services in the text to image, NLP and broader AI/ML space, reducing client’s infrastructure management requirements with our Kubernetes based serverless GPU cloud offerings.

Job Description

Position Overview:

CoreWeave is seeking an experienced and visionary Principal Engineer to lead our Hardware Engineering Dev team to build the next generation of scalable infrastructure services. Reporting to the Engineering Manager for Hardware Engineering Dev, you will take a leadership role in developing highly performant, reliable systems that drive CoreWeave's hardware inventory and automation capabilities. You will work closely with cross-functional teams, including product management, architecture, and operations, to ensure our GPU offerings meet the needs of our clients. Your expertise will guide the technical direction of our products and services, which will drive advancements in GPU computing.

The ideal candidate has a strong background in building cloud infrastructure, a deep understanding of Kubernetes and upstream services, and experience building infrastructure services using Golang and Python. They should also be knowledgeable about hardware provisioning and cloud operations.

In this role, you will play a crucial part in designing, architecting, and developing the services that automate and test CoreWeave’s server infrastructure. You will provide technical mentorship while making sound technical decisions. You will build innovative solutions to improve and automate current processes for greater efficiency.

The ideal candidate has a strong background in building cloud infrastructure, a deep understanding of Kubernetes and upstream services, and experience building infrastructure services using Golang and Python. They should also be knowledgeable about hardware provisioning and cloud operations.

In this role, you will play a crucial part in designing, architecting, and developing the services that automate and test CoreWeave’s server infrastructure. You will provide technical mentorship while making sound technical decisions. You will build innovative solutions to improve and automate current processes for greater efficiency.

Key Responsibilities:

  • Lead Architecture & Development: Architect, Design, and develop robust services primarily in Golang, focusing on gRPC and RESTful APIs to support CoreWeave’s hardware inventory and management systems.
  • Technical Leadership: Provide strategic technical leadership, mentoring, and guidance to a high-performing engineering team. Foster a culture of innovation, collaboration, and excellence that inspires your peers and drives company-wide technical achievements.
  • Innovate and Automate: Create new tools and solutions that automate hardware provisioning, testing, and cloud operations processes, driving efficiency and reducing manual overhead.
  • Cross-Functional Collaboration: Work closely with cross-functional teams, including our Fleet Reliability Engineering, Hardware Engineering, Kubernetes engineering, and Data Platform Engineering along with external vendors and upstream open source communities to ensure cohesive and reliable integration across services.
  • Decision-Making and Guidance: Lead decision-making on key technical approaches, considering both current needs and long-term strategic growth for our infrastructure services.
  • Research & Development: Stay on the cutting edge of GPU and cloud technologies. Proactively integrate emerging advancements to ensure our products maintain a competitive edge in a rapidly evolving market.

Qualifications:

  • Education: Bachelor's degree in Computer Science, Engineering, or a related field (Master’s or PhD is a plus), or equivalent industry experience.
  • Proven Experience: 10+ years in software engineering, with 4+ years in a technical lead or senior role, designing and building cloud infrastructure solutions.
  • Technical Expertise:
  • Hardware & Cloud Operations Knowledge: Strong understanding of hardware provisioning, cloud operations, and the unique demands of managing infrastructure at scale.
  • Distributed Databases: Experience with distributed database principles and data schemas at large production scale.
  • API Development: Proficient in building and scaling gRPC and RESTful APIs, with a track record of implementing high-performing and resilient services.
  • Leadership & Communication: Proven ability to lead and influence cross-functional teams, delivering technical direction while ensuring clear and effective communication.
  • Problem-Solving: A natural problem-solver who thrives on tackling complex, large-scale engineering challenges with creativity, persistence, and precision.
  • Growth Mindset: Passionate about learning, experimenting with new technologies, and sharing knowledge with colleagues to elevate the entire engineering team.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $185,000-$200,000. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.

 

Position Overview:

CoreWeave is seeking an experienced and visionary Principal Engineer to lead our Hardware Engineering Dev team to build the next generation of scalable infrastructure services. Reporting to the Engineering Manager for Hardware Engineering Dev, you will take a leadership role in developing highly performant, reliable systems that drive CoreWeave's hardware inventory and automation capabilities. You will work closely with cross-functional teams, including product management, architecture, and operations, to ensure our GPU offerings meet the needs of our clients. Your expertise will guide the technical direction of our products and services, which will drive advancements in GPU computing.

The ideal candidate has a strong background in building cloud infrastructure, a deep understanding of Kubernetes and upstream services, and experience building infrastructure services using Golang and Python. They should also be knowledgeable about hardware provisioning and cloud operations.

In this role, you will play a crucial part in designing, architecting, and developing the services that automate and test CoreWeave’s server infrastructure. You will provide technical mentorship while making sound technical decisions. You will build innovative solutions to improve and automate current processes for greater efficiency.

The ideal candidate has a strong background in building cloud infrastructure, a deep understanding of Kubernetes and upstream services, and experience building infrastructure services using Golang and Python. They should also be knowledgeable about hardware provisioning and cloud operations.

In this role, you will play a crucial part in designing, architecting, and developing the services that automate and test CoreWeave’s server infrastructure. You will provide technical mentorship while making sound technical decisions. You will build innovative solutions to improve and automate current processes for greater efficiency.

Key Responsibilities:

  • Lead Architecture & Development: Architect, Design, and develop robust services primarily in Golang, focusing on gRPC and RESTful APIs to support CoreWeave’s hardware inventory and management systems.
  • Technical Leadership: Provide strategic technical leadership, mentoring, and guidance to a high-performing engineering team. Foster a culture of innovation, collaboration, and excellence that inspires your peers and drives company-wide technical achievements.
  • Innovate and Automate: Create new tools and solutions that automate hardware provisioning, testing, and cloud operations processes, driving efficiency and reducing manual overhead.
  • Cross-Functional Collaboration: Work closely with cross-functional teams, including our Fleet Reliability Engineering, Hardware Engineering, Kubernetes engineering, and Data Platform Engineering along with external vendors and upstream open source communities to ensure cohesive and reliable integration across services. 
  • Decision-Making and Guidance: Lead decision-making on key technical approaches, considering both current needs and long-term strategic growth for our infrastructure services.
  • Research & Development: Stay on the cutting edge of GPU and cloud technologies. Proactively integrate emerging advancements to ensure our products maintain a competitive edge in a rapidly evolving market.

Qualifications:

  • Education: Bachelor's degree in Computer Science, Engineering, or a related field (Master’s or PhD is a plus), or equivalent industry experience. 
  • Proven Experience: 10+ years in software engineering, with 4+ years in a technical lead or senior role, designing and building cloud infrastructure solutions.
  • Technical Expertise
  • Hardware & Cloud Operations Knowledge: Strong understanding of hardware provisioning, cloud operations, and the unique demands of managing infrastructure at scale.
  • Distributed Databases: Experience with distributed database principles and data schemas at large production scale.
  • API Development: Proficient in building and scaling gRPC and RESTful APIs, with a track record of implementing high-performing and resilient services.
  • Leadership & Communication: Proven ability to lead and influence cross-functional teams, delivering technical direction while ensuring clear and effective communication.
  • Problem-Solving: A natural problem-solver who thrives on tackling complex, large-scale engineering challenges with creativity, persistence, and precision.
  • Growth Mindset: Passionate about learning, experimenting with new technologies, and sharing knowledge with colleagues to elevate the entire engineering team.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $185,000-$200,000. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.

 

Receive Tech Ladies'
newest jobs in your inbox,
every week.

Join Tech Ladies for full-access to the job board, member-only events, and more!

If you're already a member, we haven't forgotten you. We promise. It's a new system. If you fill out the form once, it'll remember you going forward. Apologies for the inconvenience.

Roseland
Roseland
No items found.
Engineering
Engineering
In-Person
In-Person