Demonstrated expertise and track record of architecting, implementing, and supporting Enterprise-grade (highly available, scalable, high performance & secure) infrastructure powering complex business solutions
Extensive Linux Expertise (8+ years): Proven ability to diagnose, troubleshoot, and resolve complex issues across various distributions, ensuring optimal system performance, stability, and security as well as site reliability engineering, and performance tuning.
Extensive practical experience with managed Kubernetes offerings such as GKE, EKS, and AKS and a working knowledge of the Kubernetes ecosystem and tooling (e.g. Helm, Docker, Istio, etc.)
8+ years of hands-on experience with Cloud Computing, including infrastructure, virtualization, containerization, networking, storage, platform; or experience with traditional enterprise data-center technologies, including virtualization, RDBMS, storage appliances and private networks
Proficiency in at least two of the following: Bash, Go, Python, Ruby, node.js, Rust, Java.
Exceptional hands-on troubleshooting and problem-solving skills, consistently demonstrating the ability to diagnose complex technical issues, identify root causes, and implement effective solutions in a timely manner. This includes a proven track record in meticulously analyzing system failures, collaborating with cross-functional teams to devise innovative workarounds, and developing long-term preventative measures to enhance operational efficiency and minimize downtime.
Comprehensive understanding and application of security best practices, including data encryption, access control, secure coding, network security, vulnerability management, incident response, and compliance. Proactively identifies risks, implements prevention, and responds to incidents effectively.
Experience building and managing IT Infrastructure architecture and operations teams and processes.
Knowledge of operating in high security Government cloud environments. Knowledge of controls in IL5 and IL6 environments.