Posted on 
Nov 8, 2024

Principal Engineer

Roseland
Mid-Senior ICs
Engineering
CoreWeave
CoreWeave
CoreWeave
Private
101-250
Software, Security & Developer Tools

CoreWeave is a specialized cloud provider focused on GPU accelerated use cases including VFX, AI/ML, Batch Processing and Real Time Experiences. We support countless AI/ML services in the text to image, NLP and broader AI/ML space, reducing client’s infrastructure management requirements with our Kubernetes based serverless GPU cloud offerings.

Job Description

CoreWeave is the AI Hyperscaler™, delivering a cloud platform of cutting edge services powering the next wave of AI. The company’s technology provides enterprises and leading AI labs with the most performant, efficient and resilient solutions for accelerated computing. Since 2017, CoreWeave has operated a growing footprint of data centers covering every region of the US and across Europe. CoreWeave was ranked as one of the TIME100 most influential companies of 2024.

As the leader in the industry, we thrive in an environment where adaptability and resilience are key. Our culture offers career-defining opportunities for those who excel amid change and challenge. If you’re someone who thrives in a dynamic environment, enjoys solving complex problems, and is eager to make a significant impact, CoreWeave is the place for you. Join us, and be part of a team solving some of the most exciting challenges in the industry. 

As a Principal Engineer at CoreWeave, you will be responsible for leading technical strategy, architectural decisions, and the development of advanced features that power our GPU-accelerated cloud services. You will work closely with senior leadership, engineering teams, and product teams to define and execute on the vision for CoreWeave's next-generation infrastructure. This is an opportunity to have a direct and significant impact on the architecture of a high-growth, cutting-edge cloud platform used by leading companies in machine learning, VFX, and other compute-intensive industries.

What You'll Do:

  • Architect and Design: Lead the design and architecture of scalable, GPU-accelerated cloud solutions. Work with cross-functional teams to translate business requirements into technical solutions.
  • Innovate and Optimize: Drive technical innovation by identifying and implementing new technologies and methodologies that will enhance CoreWeave’s platform. Continuously optimizing for scalability.
  • Technical Leadership: Provide mentorship and guidance to engineering teams, setting high standards for code quality, system design, and operational excellence. Lead code reviews, foster collaboration, and encourage best practices across teams.
  • Cross-Team Collaboration: Work closely with stakeholders across Infrastructure, Security, Product, and Operation teams to ensure seamless integration of new features and services into the CoreWeave platform. Contribute to strategic decisions that influence the company's technical direction.
  • Solve Complex Problems: Solve complex technical challenges related to distributed systems, cloud infrastructure, GPU workloads, Kubernetes, networking, security, and hardware automation.
  • Thought Leadership: Stay ahead of industry trends in cloud computing, AI, Kubernetes, and other relevant domains. Represent CoreWeave as a thought leader at conferences, industry events, and in technical communities.

Who You Are:

  • Experience: 10+ years of experience in engineering, with at least 5+ years in leadership.
  • Strong Technical Background: Expertise in designing and building large-scale distributed systems, with deep knowledge of containerization, and orchestration.
  • Programming Skills: Proficiency in languages such as Go, Python, C++, Rust or similar.
  • Architecture & Scalability: Deep understanding of cloud-native architectures, and the challenges of scaling compute-intensive infrastructure.
  • Leadership & Mentorship: Proven ability to lead teams involving multiple projects, making high-level technical decisions. Experience mentoring and growing individuals across all skill levels.
  • Problem Solving & Innovation: A track record of initiating change by solving complex, high-impact engineering problems. Driving innovation in cloud infrastructure or other relevant technologies.
  • Communication: Demonstrated experience engaging with stakeholders at all levels of the organization. Experience presenting technical ideas to non-technical audiences.

Nice to Have:

  • Experience with the Kubernetes ecosystem, including custom controllers, control planes, and operator development.
  • Experience with operational excellence and scalability, removing engineering friction and enhancing developer productivity across all layers of infrastructure.
  • Build and operate resilient, high-performance computing networks for AI workloads.
  • Design, develop, and maintain scalable applications and APIs.
  • Experience with cloud based storage solutions.
  • Building, designing, operating large scale server fleet automation and systems.
  • Experience securing internal and external infrastructure while continuously assessing and improving security measures across CoreWeave’s services.
  • Experience with datacenter hardware and technologies.
  • Problem-solving with a passion for innovation and collaboration.
  • Work in a fast-paced environment, balancing multiple priorities.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $275,000-$330,000. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.

 

CoreWeave is the AI Hyperscaler™, delivering a cloud platform of cutting edge services powering the next wave of AI. The company’s technology provides enterprises and leading AI labs with the most performant, efficient and resilient solutions for accelerated computing. Since 2017, CoreWeave has operated a growing footprint of data centers covering every region of the US and across Europe. CoreWeave was ranked as one of the TIME100 most influential companies of 2024.

As the leader in the industry, we thrive in an environment where adaptability and resilience are key. Our culture offers career-defining opportunities for those who excel amid change and challenge. If you’re someone who thrives in a dynamic environment, enjoys solving complex problems, and is eager to make a significant impact, CoreWeave is the place for you. Join us, and be part of a team solving some of the most exciting challenges in the industry. 

As a Principal Engineer at CoreWeave, you will be responsible for leading technical strategy, architectural decisions, and the development of advanced features that power our GPU-accelerated cloud services. You will work closely with senior leadership, engineering teams, and product teams to define and execute on the vision for CoreWeave's next-generation infrastructure. This is an opportunity to have a direct and significant impact on the architecture of a high-growth, cutting-edge cloud platform used by leading companies in machine learning, VFX, and other compute-intensive industries.

What You'll Do:

  • Architect and Design: Lead the design and architecture of scalable, GPU-accelerated cloud solutions. Work with cross-functional teams to translate business requirements into technical solutions.
  • Innovate and Optimize: Drive technical innovation by identifying and implementing new technologies and methodologies that will enhance CoreWeave’s platform. Continuously optimizing for scalability. 
  • Technical Leadership: Provide mentorship and guidance to engineering teams, setting high standards for code quality, system design, and operational excellence. Lead code reviews, foster collaboration, and encourage best practices across teams.
  • Cross-Team Collaboration: Work closely with stakeholders across Infrastructure, Security, Product, and Operation teams to ensure seamless integration of new features and services into the CoreWeave platform. Contribute to strategic decisions that influence the company's technical direction.
  • Solve Complex Problems: Solve complex technical challenges related to distributed systems, cloud infrastructure, GPU workloads, Kubernetes, networking, security, and hardware automation.
  • Thought Leadership: Stay ahead of industry trends in cloud computing, AI, Kubernetes, and other relevant domains. Represent CoreWeave as a thought leader at conferences, industry events, and in technical communities.

Who You Are:

  • Experience: 10+ years of experience in engineering, with at least 5+ years in leadership. 
  • Strong Technical Background: Expertise in designing and building large-scale distributed systems, with deep knowledge of containerization, and orchestration.
  • Programming Skills: Proficiency in languages such as Go, Python, C++, Rust or similar. 
  • Architecture & Scalability: Deep understanding of cloud-native architectures, and the challenges of scaling compute-intensive infrastructure.
  • Leadership & Mentorship: Proven ability to lead teams involving multiple projects, making high-level technical decisions. Experience mentoring and growing individuals across all skill levels.
  • Problem Solving & Innovation: A track record of initiating change by solving complex, high-impact engineering problems. Driving innovation in cloud infrastructure or other relevant technologies.
  • Communication: Demonstrated experience engaging with stakeholders at all levels of the organization. Experience presenting technical ideas to non-technical audiences.

Nice to Have:

  • Experience with the Kubernetes ecosystem, including custom controllers, control planes, and operator development.
  • Experience with operational excellence and scalability, removing engineering friction and enhancing developer productivity across all layers of infrastructure.
  • Build and operate resilient, high-performance computing networks for AI workloads.
  • Design, develop, and maintain scalable applications and APIs.
  • Experience with cloud based storage solutions.
  • Building, designing, operating large scale server fleet automation and systems.
  • Experience securing internal and external infrastructure while continuously assessing and improving security measures across CoreWeave’s services.
  • Experience with datacenter hardware and technologies.
  • Problem-solving with a passion for innovation and collaboration.
  • Work in a fast-paced environment, balancing multiple priorities.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $275,000-$330,000. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.

 

Receive Tech Ladies'
newest jobs in your inbox,
every week.

Join Tech Ladies for full-access to the job board, member-only events, and more!

If you're already a member, we haven't forgotten you. We promise. It's a new system. If you fill out the form once, it'll remember you going forward. Apologies for the inconvenience.

Roseland
Roseland
No items found.
Engineering
Engineering
In-Person
In-Person