Job Description
The Sr. SRE will be responsible for the reliability, scalability, and performance of systems supporting classified government projects in an air-gapped deployment. This role leverages advanced monitoring and DevOps tools to ensure uptime and compliance in a disconnected environment. Key Responsibilities Design and maintain highly reliable systems using RKE2, Kubernetes, Ingress, Kong, Artifactory, and Sonar. Implement observability solutions with Prometheus, Grafana, Splunk, and Elastic to monitor system health in an air-gapped setting. Ensure compliance and performance optimization across multi-tenant deployments. Conduct code quality analysis and security assessments using Sonar. Collaborate with the Lead and Infra/Security Specialists to resolve incidents and improve system resilience. Develop and maintain documentation for system configurations and recovery procedures in a classified environment. Required Skills and Qualifications Expertise in RKE2, Kubernetes, Ingress, Kong, Artif...