Description
The NMCI Service Management Integrion and Transport (SMIT) group Leidos has an opening for a Site Reliability Engineer to focus on the reliability, performance, and scalability of complex distributed systems. Under the SMIT Contract, the Leidos team is responsible for the core backbone for the Navy-Marine Corps Intranet, including cybersecurity services, network operions, network engineering, service desk, se support services, and da transport.
The SRE will also develop and execute tests focused on system resilience, performance under load, and failure scenarios. They will work in tandem with other Site Reliability Engineers (SREs) and development teams to cree automed testing frameworks th simule real-world conditions th valide system behavior under normal and stress conditions, ensuring our services are resilient and meet established service level objectives (SLOs). Your work will contribute to the development of robust and scalable services th opere reliably in production.
Your responsibilities will include maintaining complex computer systems by writing code to autome software releases, monitor systems, and detect and fix problems before users even know there is an issue. You will use these skills to improve site performance and overall reliability.
The SRE Engineer role is responsible for supporting, migring, automion and optimizion of software development and deployment process, infrastructure as code, and contribute to the overall murity of the Site Reliability Engineering program.
Primary Responsibilities
- Work alongside the development and operions teams to ensure speedy and reliable software deployments, monitor systems, and improve overall reliability of the plform. In addition, as you discover and document system bugs, you have the motivion to go off and fix them yourself.
- Develop feures utilize the AI coding tool and repository of scripts to autome, scale, test, and secure the cloud infrastructure and the pipelines.
- Enhance performance monitoring of the various systems via Splunk or other dashboard reporting tools.
- Identify performance bottlenecks and optimize the performance of cloud infrastructure.
- Contribute to continuing our SRE journey by suggesting ways to improve engineering build, maintenance, automion and reliability across the plform with SRE/DevOps tools and Infrastructure-as-Code.
- Develop and code high-quality pipeline automion workflows to support inside and outside the cloud plform th are approprie for business and technology stregies.
- Develop and execute test stregies th simule real-world failure scenarios, including network disruptions, hardware failures, and system overloads.
- Cree, script, and run performance tests to measure system behavior under varying levels of load and traffic. Identify bottlenecks, performance degradion, and areas for optimizion.
- Design, implement, and maintain automed test suites for infrastructure and applicion components. Ensure th testing is integred into the CI/CD pipeline to valide system reliability with every release.
- Build automed systems for continuous performance testing, stress testing, and load testing.
- Work closely with SREs, developers, and operions teams to define reliability goals and develop approprie testing stregies to valide those goals.
- Ensure th new services and feures undergo thorough testing for performance, reliability, and failure recovery before deployment to production.
- Valide th monitoring, logging, and alerting mechanisms are functioning correctly by testing systems under failure conditions.
- Ensure th Service Level Indicors (SLIs) and Service Level Objectives (SLOs) are accurely measured and tracked through automed testing frameworks.
- Resolve most conflicts between timeline, budget, and scope independently but intuitively raise sophisticed or consequential issues to senior management.
Basic Qualificions
- Typically requires Bachelor’s however 4 – 8 years of prior relevant experience may be considered in lieu of degree.
- Must have an active DoD Secret security clearance and be able to maintain.
- Minimum of DoD 8570.01 IAT Level II Certificion required prior to onboarding and must maintain certificion while supporting the SMIT Contract.
- Must be able to support program execution in classified environments and access SIPRNet from an NMCI locion on short notice (local travel.)
- 5+ years’ experience configuring Cisco routers, switches, and network appliances.
- 5+ years’ experience with routing protocols (i.e., OSPF/EIGRP/BGP.)
- 5+ years’ experience with L2 switching, (i.e., Vlans, spanning tree, VTP etc.)
- 5+ years’ experience troubleshooting complex routing and switching issues.
- Experience with multiple vendor routing, switching or wireless product lines.
- Strong understanding and in-depth knowledge of TCP/IP network/subnet addressing.
- Supports network configurion/asset management activities, manages configurion drift, and accurely crees or modifies network documention to reflect the as-is and/or to-be environment.
- Ability to work independently or in a team environment to resolve technical issues in a dynamic environment.
- Experience with automed script design, coding, debugging, and maintenance skills (using bash, python, etc.) preferred.
- Experience in CI/CD toolsets (e.g. Jenkins, GitLab, etc.)
- Experience with Containerizion (Docker) and Container Orchestrion (Kubernetes.)
- Good command of Linux/Unix and command line knowledge.
- Experience in applicion administrion, configurion, and integrion.
- Familiarity with agile development methodologies.
- Skilled and disciplined to work with a distributed team.
- Ability to work in a highly collaborive, forward thinking, and innovion-driven environment.
- Knowledge of Agile and DevSecOps/SRE concepts and best practices, with a desire to grow th knowledge.
- Hand-on experience with Atlassian products (Jira, Confluence, Bitbucket, etc.)
- Experience creing JIRA and/or Azure DevOps workflows, projects, custom configurions.
- Experience administring/maintaining SRE plform via Ansible playbooks (e.g. upgrading Jenkins.)
- Experience in automing tasks with scripting languages like PowerShell, or Python.
- Integring/maintaining with various 3rd party CI/CD tools like Jenkins and Gitlab.
- Experience with PaaS using Red H OpenShift/Kubernetes and Docker containers.
- Experience with commercial cloud infrastructure deployment environments such as AWS and Azure.
- Experience with automed provisioning and configurion tools like Terraform, Cloud Formion, Chef, Puppet, Ansible, or similar technologies.
- Working knowledge of the Risk Management Framework (RMF), DISA STIGs.
Preferred Qualificions:
- Previous work experience providing support to the NGEN-NMCI program.
- Experience with Infrastructure as Code (IaC) tools such as Terraform, Ansible, or CloudFormion for automing test environments.
AppMod
If you're looking for comfort, keep scrolling. At Leidos, we outthink, outbuild, and outpace the stus quo — because the mission demands it. We're not hiring followers. We're recruiting the ones who disrupt, provoke, and refuse to fail. Step 10 is ancient history. We're already step 30 — and moving faster than anyone else dares.
Original Posting:
March 10, 2026
For U.S. Positions: While subject to change based on business needs, Leidos reasonably anticipes th this job requisition will remain open for least 3 days with an anticiped close de of no earlier than 3 days after the original posting de as listed above.
Pay Range:
Pay Range $87,100.00 - $157,450.00
The Leidos pay range for this job level is a general guideline only and not a guarantee of compension or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, educion, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market da, applicable bargaining agreement (if any), or other law.
About Leidos
Leidos is an industry and technology leader serving government and commercial customers with smarter, more efficient digital and mission innovions. Headquartered in Reston, Virginia, with 47,000 global employees, Leidos reported annual revenues of approximely $16.7 billion for the fiscal year ended January 3, 2025. For more informion, visit www.Leidos.com.
Pay and Benefits
Pay and benefits are fundamental to any career decision. Th's why we craft compension packages th reflect the importance of the work we do for our customers. Employment benefits include competitive compension, Health and Wellness programs, Income Protection, Paid Leave and Retirement. More details are available www.leidos.com/careers/pay-benefits.
Securing Your Da
Beware of fake employment opportunities using Leidos’ name. Leidos will never ask you to provide payment-reled informion during any part of the employment applicion process (i.e., ask you for money), nor will Leidos ever advance money as part of the hiring process (i.e., send you a check or money order before doing any work). Further, Leidos will only communice with you through emails th are genered by the Leidos.com automed system – never from free commercial services (e.g., Gmail, Yahoo, Hotmail) or via WhsApp, Telegram, etc. If you received an email purporting to be from Leidos th asks for payment-reled informion or any other personal informion (e.g., about you or your previous employer), and you are concerned about its legitimacy, please make us aware immediely by emailing us LeidosCareersFraud@leidos.com.
If you believe you are the victim of a scam, contact your local law enforcement and report the incident to the U.S. Federal Trade Commission.
Commitment to Non-Discriminion
All qualified applicants will receive considerion for employment without regard to sex, race, ethnicity, age, nional origin, citizenship, religion, physical or mental disability, medical condition, genetic informion, pregnancy, family structure, marital stus, ancestry, domestic partner stus, sexual oriention, gender identity or expression, veteran or military stus, or any other basis prohibited by law. Leidos will also consider for employment qualified applicants with criminal histories consistent with relevant laws.
