Staff System Engineer (Remote)

Posted 2025-10-26
Remote, USA Full Time Immediate Start
Position Purpose: The Staff Systems Engineer is responsible for leading a team of engineers designing, building, and supporting The Home Depot's technical infrastructure of hardware and system software that drives the success of Home Depot and our customers. As a Staff Systems Engineer you will be part of a dynamic team with engineers of all experience levels who help each other build and grow technical and leadership skills while creating, deploying, and supporting production infrastructure. Staff Systems Engineers contribute to foundational infrastructure as code elements that can be reused as well as architectural diagrams and other related documentation. Staff Systems Engineers participates in the selection and lead the implementation of physical and virtual infrastructure to meet evolving enterprise and product team needs. As a Staff Systems Engineer, you will be a core player that participates and leads multiple efforts simultaneously. You are expected to build and grow the skillsets of more junior Engineers on the team. 1.)An ideal candidate will drive innovation and lead efforts related to Hybrid Cloud, Stream Processing Pipeline, and DevOps/SRE Enhancements. 2.) An ideal candidate will work directly with customers to gather requirements and ensure maximum business value, supervise and mentor junior/senior level engineers, and assist with strategy and planning, in addition to hands-on engineering activities. 3.) An ideal candidate will have a minimum 3-5 years of professional experience with all items in the “Required Skillsets” and a strong level of experience with a majority of the listed “Preferred” skillsets. Required Skillset: · Kubernetes · Golang Development · Apache Kafka · Cloud Technology · DevOps/SRE Practices Key Responsibilities: • 25% Delivery and Execution - Leads configuration, debugging, and support for information technology solutions; Leads field and corporate rollouts of technology; Leads the stand up of necessary system software, hardware, and equipment (physical or virtual) to meet changing infrastructure needs; Creates and optimizes specifications for technology solutions; Produces and manages purchase requests for hardware and software; Leads development of test suites (functional, destructive, etc) to enable successful rapid deployment of infrastructure as code to production • 15% Learning - Keeps abreast of innovations and industry trends as well as changes to internal systems and determines how they impacts tools, training, and support necessary to keep systems up, running, and secure; Participates in and contributes to learning activities around modern systems engineering core practices (communities of practice); Proactively views articles, tutorials, and videos to learn about new technologies and best practices being used within other technology organizations • 30% Planning and Analysis - Researches and analyzes business trends and behavioral data to identify strategic opportunities for improvements and new initiatives; Leads the evaluation, development, and recommendation of specific strategic technology to provide cost-effective solutions that meet THD requirements; Researches and designs best fit infrastructure, network, database, cloud, AI, and security architectures for products; Proactively creates and maintains infrastructure as code and AI models for continuous improvement; Participates in strategic project planning and management across multiple efforts; Develops formal training courses • 30% Support and Enablement - Collaborates with product and project teams to understand needs and enable them with infrastructure; Supports technology architecture design review efforts for project and product teams; Leverages tooling and custom applications to monitor the operational status of applications, infrastructure, networks, databases, and security; optimizes and tunes performance as appropriate; Drives root cause analysis, debugging, support, and post-mortem analysis for security incidents and service interruptions; Maintains, upgrades, and supports existing systems and infrastructure to ensure operational stability; Acts as a vendor liaison, owning resourcing, issue management, and documentation; Leads the production of in-house documentation around solutions; Provides application support for software running in production; Acts as a mentor to more junior Systems Engineers; Drives converting KB articles into AI models; Drives changes to analytic models used to analyze performance Direct Manager/Direct Reports: • This position typically reports to Systems Engineer Manager or Sr Manager • This position typically has 0 Direct Reports Travel Requirements: • No travel required. Physical Requirements: • Most of the time is spent sitting in a comfortable position and there is frequent opportunity to move about. On rare occasions there may be a need to move or lift light articles. Working Conditions: • Located in a comfortable indoor area. Any unpleasant conditions would be infrequent and not objectionable. Minimum Qualifications: • Must be eighteen years of age or older. • Must be legally permitted to work in the United States. Preferred Qualifications: • 3-6 years of relevant work experience • Golang Development: Concurrency, Unit Testing, Profiling/Benchmarking, Dependency Management, • Google Cloud Platform: GKE, BigQuery, PubSub, Cloud Storage/Networking • Microservice Architecture: Docker, Envoy, Istio Service Mesh, Kubernetes CRDs and Custom Operators/Controllers, Mesosphere • DevOps/SRE: Ansible, Terraform, Spinnaker, Bash Scripting, Concourse CI, Github/GitOps, Jenkins, Prometheus/AlertManager, Grafana, PagerDuty • Professional or educational experience in multiple Information Technology disciplines • Proficiency in working as part of a collaborative, cross-functional, modern engineering team • Proficiency in troubleshooting and remediation within multiple Information technology disciplines • Proficiency with debuggers, runtime analysis, library systems, compiled programming, and software update tools • Proficiency in system and environment analysis, design, and optimization • Exposure to developing technical roadmaps including work estimation, refactoring, and modernizing legacy systems • Experience with object oriented programming languages (preferably Java), distributed computing environments, and code reviews • Experience with system security design and management • Experience with disaster recovery planning and engineering • Proficiency in operating system commands and utilities as well as scripting • Proficiency working with cloud platforms such as GCP and Azure • Proficiency in supporting a 24x7 retail operation • Proficiency with version control systems • Proficiency with CI/CD toolchain • Proficiency with production system designs including Infrastructure as Code, High Availability, and Performance monitoring • Experience with Site Reliability Engineering (SRE) Minimum Education: • The knowledge, skills and abilities typically acquired through the completion of a bachelor's degree program or equivalent degree in a field of study related to the job. Preferred Education: • No additional education Minimum Years of Work Experience: • 3 Preferred Years of Work Experience: • No additional years of experience Minimum Leadership Experience: • None Preferred Leadership Experience: • None Certifications: • None Competencies: • Action Oriented • Being Resilient • Business Insights • Global Perspective • Manages Ambiguity • Nimble Learning • Self-Development • Collaborates • Cultivates Innovation • Optimizes Work Processes • Situational Adaptability • Communicates Effectively • Drives Results • Interpersonal Savvy Benefits offered include health care benefits, 401K, ESPP, paid time off, and success sharing bonus. For a full list of the various benefits The Home Depot offers, visit https://careers.homedepot.com/our-benefits. Apply tot his job Apply To this Job
Back to Job Board