We are seeking an experienced Site Reliability Engineer to join our team and help maintain and improve our AWS cloud infrastructure. The ideal candidate will have a strong background in cloud computing, automation, and DevOps practices, as well as an active security clearance.
## Requirements:
• Active Top Secret (TS) security clearance
• Bachelor’s degree in computer science, Information Technology, or related field
• 5+ years of experience as an AWS SRE/DevOps Engineer
• Strong proficiency in at least one programming language (e.g., Python, Go, Java)
• Extensive experience with AWS services (EC2, S3, RDS, VPC, CloudWatch)
• Expertise in infrastructure-as-code tools (e.g., Terraform, CloudFormation)
• Proficiency with containerization and orchestration technologies (Docker, Kubernetes)
• Experience with monitoring and observability tools (e.g., CloudWatch, Grafana, Prometheus)
• Strong understanding of networking concepts and security best practices
## Responsibilities:
• Design, implement, and maintain scalable and reliable AWS cloud infrastructure
• Develop and maintain automation tools to streamline AWS operations and improve system efficiency
• Implement and manage CI/CD pipelines for continuous deployment and integration
• Monitor system performance, troubleshoot issues, and implement proactive solutions
• Collaborate with development teams to ensure application reliability and performance
• Contribute to disaster recovery plans and ensure robust backup systems are in place
• Implement and maintain security best practices in the AWS environment
## Preferred Qualifications:
• AWS certifications (e.g., AWS Certified Solutions Architect, AWS Certified DevOps Engineer)
• Experience with additional cloud platforms (e.g., Azure, GCP)
• Knowledge of advanced Linux system administration
• Familiarity with Agile methodologies and practices
The successful candidate will play a crucial role in ensuring the reliability, scalability, and security of our AWS cloud infrastructure while adhering to the highest standards of operational excellence.