Senior Manager - Site Reliability Engineering, Datacenter Hardware and IaaS
Posted 2025-04-06Description:
 GEICO is seeking an experienced Senior Manager with a passion for building high performance, low-latency platforms, and applications.
 You will build and manage a team of engineers with a deep focus on delivering enterprise-wide product to operate in a highly performant and efficient way.
 The ideal candidate has deep technical expertise to improve application performance, capacity benchmarking, improve availability and reliability, design and evolve cloud infrastructure and architecture.
 A Senior Manager will lead strategy and execution of a technical roadmap that will increase the velocity of delivering products and unlock new engineering capabilities.
Requirements:  Strong knowledge in modern at-scale datacenter architectures.  Experience with OCP hardware and related technologies (eg. OpenBMC, Redfish), bonus for knowledge in low level driver development.  Focus on leveraging infrastructure as code as a primary means of control.  Building CI/CD chains for datacenter operations  Experience in building IaaS systems based on OpenStack  Knowledge of cloud computing technologies and concepts (SaaS, PaaS, IaaS, etc)  Working knowledge of object-oriented development, Gang of Four (GOF) Design Patterns, Microservices, Dependency Injection with IOC containers, and both frontend and backend unit testing  Proven ability to concentrate and demonstrate a capacity for learning technical concepts and adapting to new technologies quickly  Strong Cloud (AWS, GCP, Azure etc.) platform knowledge  Proficiency in Project Management and work item management tools such as Azure DevOps and Portfolio  Strong foundation in algorithms, data structures, and core computer science concepts  Experience in existing Operational Portals such as Azure Portal  Fluency with Python, Golang, JSON, and RESTful Web Services  Experience with application monitoring tools and performance assessments  Experience in PowerShell Scripting  Constructing, interpreting, and applying metrics to your work and decision making, able to use those metrics to identify correlation between drivers and results, and using that information to drive prioritization and action  Strong understanding of Site Reliability Engineering and DevOps principles  Strong technical acumen in Cloud Architecture, Performance Benchmarking, and Capacity planning  Expert in Container orchestration (e.g., Kubernetes), container runtimes and optimization  Experience with driving cultural change in technical excellence, quality, and efficiency  Experience managing and growing technical leaders and teams  In-depth knowledge of CS data structures and algorithms
Benefits:
 Premier Medical, Dental and Vision Insurance with no waiting period**
 Paid Vacation, Sick and Parental Leave
 401(k) Plan
 Tuition Reimbursement
 Paid Training and Licensures
Apply Job!