The Compute team is looking to hire a Senior Software Engineer that thrives at the intersection of infrastructure and software development. This team's challenges break into 2 domains, which we considerplatform engineeringandcluster engineering.Platform Engineering: Higher-level orchestration of both compute capacity and workload primitives to support our multi-cloud, multi-region, deployments. A subset of current focuses include:Software automation that creates, manages, and destroys clusters in our fleet.APIs and controllers that support multi-cluster deployment and scheduling mechanics.Core SDKs that enable controller development in the larger organization.Software that codifies out-of-cluster ancillary concerns such as network configurations and managed services.Cluster Engineering: Intra-cluster engineering problems involving balancing performance, efficiency, and stability. A subset of current focuses includeDetection of node-level performance characteristics and making availability decisions based on the data.Schedulers that support more efficient packing of resources along with reactive rescheduling on the basis of changing compute availability.Kubernetes controllers that offer APIs in the cluster and perform reconciliation to reach a desired state.Cluster upgrades, both mechanical process concerns and automation.As a member of the Compute team, your work will span these 2 domains, which are rich with challenging infrastructure and software engineering problems. Your work will directly impact hundreds of millions of users around the world. Join us and help build the future of Reddit!In your day-to-day, you can expect to:Work collaboratively with a team of software engineers to create and maintain the foundational platform for running Reddit's infrastructure.Deliver software to improve the availability, scalability, latency, and efficiency of Reddit's Compute Platform.Contribute feedback to the technical and strategic direction of the compute platform.Automate critical aspects of the development process such as service creation and management, as well as critical infrastructure operations.Share on-call responsibilities with the Compute team. You have:4+ years of experience developing internet-scale software, preferably in the context of infrastructure.Language proficiency in Go.Experience developing on top of Kubernetes or similar distributed systems.Kubernetes controller or operator development experience is a huge plus.Proficiency operating Linux with a solid understanding around cgroups, namespaces, other multi-tenancy primitives.Strong troubleshooting capabilities surrounding both systems and software.Experience engineering large systems, tracking work, and being a self-starter on projects.Excellent communication skills to collaborate with a service-oriented team and company.Benefits:Comprehensive Healthcare Benefits401k MatchingWorkspace benefits for your home officePersonal & Professional development fundsFamily Planning SupportFlexible Vacation (please use them!) & Reddit Global Wellness Days4+ months paid Parental LeavePaid Volunteer time off#LI-remote, #LI-JS5