We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Principal Software Engineer

Microsoft
United States, Texas, Irving
7000 State Highway 161 (Show on map)
Oct 28, 2025
OverviewImagine being at the forefront of transformative cloud technology. The Azure Kubernetes Service (AKS) team is pioneering the management of Kubernetes clusters at hyperscale-building efficient, safe, and scalable tools to manage millions of servers that power AKS.As a Principal Software Engineer on the AKS Platform Infrastructure team, you'll dive deep into automated infrastructure management and server orchestration at a scale few companies ever reach. You'll be responsible for building and maintaining the compute infrastructure that powers AKS, enabling it to be the most performant and reliable managed Kubernetes service in the world.This is a unique opportunity to deeper your career experience, develop deep hyperscale cloud infrastructure expertise, and shape the future of Kubernetes at Microsoft.Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
ResponsibilitiesCreate and maintain tools that manage hundreds of thousands of virtual machines powering Azure Kubernetes Service.Expand AKS's global footprint by automating buildouts in new regions and sovereign clouds.Coordinate region-agnostic buildout architecture, design, and execution across multiple AKS and Azure teams.Automate the build and release stack to enable engineers to manage dozens of microservices safely, efficiently, and in compliance with standards.Build tools, automation, and safety mechanisms to prevent infrastructure problems from becoming production incidents.Act as a Designated Responsible Individual (DRI), participating in on-call rotations to monitor system health and restore services during incidents.Balance pragmatism with vision and creativity; deliver continuous improvements to the team's process and codebase.Collaborate across teams to deliver scalable, resilient, and secure infrastructure solutions.Embody our culture and values.
Applied = 0

(web-675dddd98f-4tmch)