SITE RELIABILITY ENGINEER (SRE)Own your opportunity. Make your impactAs a Site Reliability Engineer (SRE) supporting the CIO Infrastructure Services (CIS) program, you will help maintain the reliability, scalability, and performance of enterprise infrastructure services deployed across more than 250 global sites. You will engineer and optimize systems, automate operational workflows, strengthen monitoring capabilities, and ensure the stability and resilience of mission critical‑ environments.You will partner closely with Engineering, Operations, Tech Refresh, Cybersecurity, and Data Center teams to ensure seamless integration of new capabilities into a high availability production environment, helping the Defense Intelligence Enterprise remain secure, connected, and ‑mission ready‑.HOW A SITE RELIABILITY ENGINEER WILL MAKE AN IMPACTEnsure the reliability, availability, and performance of enterprise IT systems across global environmentsDevelop automation solutions that reduce manual effort, streamline operational tasks, and improve system resiliencyBuild and maintain monitoring, alerting, and observability capabilities supporting 24/7/365 enterprise operationsPerform root cause analysis (RCA), corrective action planning, and long-term‑ problem remediation for infrastructure issuesPartner with engineering teams to validate, test, and integrate new systems, upgrades, baselines, and enhancements into productionImprove system performance through configuration tuning, capacity planning, and optimization of compute, storage, network, and virtualized environmentsDevelop and maintain infrastructure-as-code, scripts, and operational automation to support consistent and repeatable deploymentsSupport enterprise incident response, including triage, escalation, and service restoration for high visibility‑ eventsMaintain operational documentation including SOPs, runbooks, baselines, dashboards, and architectural diagramsEnsure compliance with ITIL/ITSM processes—including Incident, Problem, Change, and Configuration ManagementStrengthen the enterprise security posture by supporting patching, vulnerability remediation, and RMF related‑ configuration updatesCoordinate with global operations teams to ensure service continuity, readiness, and adherence to SLAs and KPIsLeverage analytics, metrics, and monitoring data to identify performance trends and drive continuous service improvement initiativesWHAT YOU'LL NEED TO SUCCEEDRequired:CLEARANCE: Active TS/SCI a favorable PolygraphEDUCATION: Bachelor's degree in computer science, engineering, IT, or related technical field(Additional experience may substitute for degree)8+ years of experience in site reliability engineering, systems engineering, enterprise operations, or DevOps rolesHands‑on experience with automation tools (PowerShell, Python, Ansible, Terraform, etc.)Strong experience supporting enterprise infrastructure domains including server compute, storage, virtualization, networking, and monitoringExperience with enterprise monitoring platforms (e.g., SolarWinds, SCOM, Splunk, Nagios, ELK)Strong understanding of ITIL/ITSM workflows and operational governance processesDemonstrated ability to troubleshoot complex technical issues across distributed enterprise environmentsStrong communication and collaboration skills working across multidisciplinary technical teams Excellent communication and stakeholder engagement skillsUS citizenship requiredLOCATION: OnsitePreferred:ITIL v4 Foundations certificationExperience supporting the client, DoDIIS, or Intelligence Community environmentsFamiliarity with CMMC, NIST 800‑53, policies, and RMF processesExperience with ServiceNow/Service Central and automated ticketing workflowsExperience supporting hybrid cloud, virtual desktop infrastructure (VDI), or hyperconverged platformsGDIT IS YOUR PLACEAt GDIT, the mission is our purpose, and our people are at the center of everything we do.Growth: AI-powered career tool that identifies career steps and learning opportunitiesSupport: An internal mobility team focused on helping you achieve your career goalsRewards: Comprehensive benefits and wellness packages, 401K with company match, and competitive pay and paid time offCommunity: Award-winning culture of innovation and a military-friendly workplaceOWN YOUR OPPORTUNITYExplore an enterprise IT career at GDIT and you'll find endless opportunities to grow alongside colleagues who share your desire to drive operations forward.#CIS#J-18808-Ljbffr
SITE RELIABILITY ENGINEER (SRE)Own your opportunity. Make your impactAs a Site Reliability Engineer (SRE) supporting the CIO Infrastructure Services (CIS) program, you will help maintain the reliability, scalability, and performance of enterprise infrastructure services deployed across more than 250 global sites. You will engineer and optimize systems, automate operational workflows, strengthen monitoring capabilities, and ensure the stability and resilience of mission critical‑ environments.You will partner closely with Engineering, Operations, Tech Refresh, Cybersecurity, and Data Center teams to ensure seamless integration of new capabilities into a high availability production environment, helping the Defense Intelligence Enterprise remain secure, connected, and ‑mission ready‑.HOW A SITE RELIABILITY ENGINEER WILL MAKE AN IMPACTEnsure the reliability, availability, and performance of enterprise IT systems across global environmentsDevelop automation solutions that reduce manual effort, streamline operational tasks, and improve system resiliencyBuild and maintain monitoring, alerting, and observability capabilities supporting 24/7/365 enterprise operationsPerform root cause analysis (RCA), corrective action planning, and long-term‑ problem remediation for infrastructure issuesPartner with engineering teams to validate, test, and integrate new systems, upgrades, baselines, and enhancements into productionImprove system performance through configuration tuning, capacity planning, and optimization of compute, storage, network, and virtualized environmentsDevelop and maintain infrastructure-as-code, scripts, and operational automation to support consistent and repeatable deploymentsSupport enterprise incident response, including triage, escalation, and service restoration for high visibility‑ eventsMaintain operational documentation including SOPs, runbooks, baselines, dashboards, and architectural diagramsEnsure compliance with ITIL/ITSM processes—including Incident, Problem, Change, and Configuration ManagementStrengthen the enterprise security posture by supporting patching, vulnerability remediation, and RMF related‑ configuration updatesCoordinate with global operations teams to ensure service continuity, readiness, and adherence to SLAs and KPIsLeverage analytics, metrics, and monitoring data to identify performance trends and drive continuous service improvement initiativesWHAT YOU'LL NEED TO SUCCEEDRequired:CLEARANCE: Active TS/SCI a favorable PolygraphEDUCATION: Bachelor's degree in computer science, engineering, IT, or related technical field(Additional experience may substitute for degree)8+ years of experience in site reliability engineering, systems engineering, enterprise operations, or DevOps rolesHands‑on experience with automation tools (PowerShell, Python, Ansible, Terraform, etc.)Strong experience supporting enterprise infrastructure domains including server compute, storage, virtualization, networking, and monitoringExperience with enterprise monitoring platforms (e.g., SolarWinds, SCOM, Splunk, Nagios, ELK)Strong understanding of ITIL/ITSM workflows and operational governance processesDemonstrated ability to troubleshoot complex technical issues across distributed enterprise environmentsStrong communication and collaboration skills working across multidisciplinary technical teams Excellent communication and stakeholder engagement skillsUS citizenship requiredLOCATION: OnsitePreferred:ITIL v4 Foundations certificationExperience supporting the client, DoDIIS, or Intelligence Community environmentsFamiliarity with CMMC, NIST 800‑53, policies, and RMF processesExperience with ServiceNow/Service Central and automated ticketing workflowsExperience supporting hybrid cloud, virtual desktop infrastructure (VDI), or hyperconverged platformsGDIT IS YOUR PLACEAt GDIT, the mission is our purpose, and our people are at the center of everything we do.Growth: AI-powered career tool that identifies career steps and learning opportunitiesSupport: An internal mobility team focused on helping you achieve your career goalsRewards: Comprehensive benefits and wellness packages, 401K with company match, and competitive pay and paid time offCommunity: Award-winning culture of innovation and a military-friendly workplaceOWN YOUR OPPORTUNITYExplore an enterprise IT career at GDIT and you'll find endless opportunities to grow alongside colleagues who share your desire to drive operations forward.#CIS#J-18808-Ljbffr
Government Careers
Government jobs offer stability, competitive benefits, and the chance to make a meaningful impact on your community and country.
Whether you’re starting your career or seeking new opportunities, these roles provide pathways for growth, security, and service.
Explore positions across a wide range of fields and take the first step toward a rewarding future in public service.
MORE JOBS
-
Entry-Level Customs and Border Protection Officer (GS-5/7)
- Herndon, Virginia
- U.S. Customs and Border Protection
- Jun 20, 2026
-
Traffic Control Flagger
- Wilkes Barre, Pennsylvania
- Flagger Force
- Jun 20, 2026
-
Customs and Border Protection Officer (CBPO) - Entry Level New Hire Sign-On and Retention Incentives
- Vallejo, California
- U.S. Customs and Border Protection
- Jun 20, 2026
-
Platform Systems Engineer TS/SCI Clearance
- Colorado Springs, Colorado
- Phase2 Technology
- Jun 20, 2026
-
Dispatcher
- Commerce City, Colorado
- The Parking Spot
- Jun 20, 2026
-
SIGINT Analyst
- Tampa, Florida
- Core One
- Jun 20, 2026