Position Summary:
We are seeking a highly skilled and experienced AWS DevOps Engineer with at least 8 years of hands-on experience in designing, deploying, and managing modern cloud infrastructures. This role will work closely with the AWS Architect to ensure that all infrastructure and development pipelines are robust, scalable, cost-efficient, and aligned with industry best practices.
The ideal candidate will have strong expertise in observability, logging, CI/CD pipelines, infrastructure as code (IaC), and cost optimization strategies. Additionally, they will be proficient in emerging technologies such as Amazon Q, AI/ML services, and Kiro, with a proactive and solution-oriented mindset.
Key Responsibilities:
Collaboration & Architecture Alignment
• Partner closely with the AWS Architect to translate architecture designs into operational, secure, and cost-effective infrastructure.
• Provide infrastructure feedback during solution design to ensure feasibility, scalability, and maintainability.
• Participate in architecture reviews, infrastructure planning, and technical decision-making processes.
Infrastructure Management & Optimization:
• Build, deploy, and maintain AWS cloud infrastructure using Infrastructure as Code (Terraform, AWS CDK, or CloudFormation).
• Implement cost monitoring and optimization strategies to ensure resource efficiency.
• Manage and automate infrastructure scaling, backup, disaster recovery, and high availability solutions.
Observability & Logging:
• Establish robust monitoring, alerting, and logging frameworks (CloudWatch, AWS X-Ray, OpenTelemetry, Datadog, etc.).
• Implement full-stack observability for applications, APIs, and infrastructure components.
• Ensure incident detection and resolution processes are efficient and well-documented.
CI/CD & DevOps Best Practices:
• Design, implement, and manage CI/CD pipelines for multiple environments (Dev, QA, Staging, Production).
• Integrate automated testing, security scans, and compliance checks into the release process.
• Support blue/green and canary deployments, ensuring minimal downtime and rollback capabilities.
AI & Emerging Technologies:
• Leverage AWS AI/ML services and tools like Amazon Q, SageMaker, Kiro, and other emerging AI platforms to improve DevOps workflows and infrastructure intelligence.
• Explore automation opportunities using AI-powered infrastructure insights and recommendations.
Security & Compliance:
• Implement and maintain IAM best practices, secrets management, and least privilege principles.
• Ensure compliance with industry standards such as SOC 2, PCI DSS, and ISO 27001.
• Perform security audits and apply proactive remediations.
Required Skills & Qualifications:
• 8+ years of experience in DevOps, Cloud Engineering, or AWS Infrastructure roles.
• Deep expertise in AWS Services (EC2, S3, RDS, Lambda, ECS/EKS, CloudFront, VPC, Route53, etc.).
• Strong proficiency in Infrastructure as Code (Terraform, AWS CDK, CloudFormation).
• Proven experience with CI/CD pipelines (CodePipeline, GitHub Actions, Jenkins, GitLab CI).
• Expertise in observability tools (CloudWatch, X-Ray, Datadog, Prometheus, ELK Stack).
• Strong knowledge of cost optimization strategies in AWS.
• Familiarity with AI/ML tools such as Amazon Q, SageMaker, Kiro, and related platforms.
• Solid understanding of containerization and orchestration (Docker, Kubernetes, ECS, EKS).
• Strong scripting and automation skills (Python, Bash, PowerShell).
• Excellent communication skills and the ability to work effectively with cross-functional teams.
• Proactive, solution-oriented, and adaptable mindset with strong problem-solving skills.
Preferred Qualifications:
• AWS Certified DevOps Engineer – Professional or AWS Certified Solutions Architect – Professional.
• Experience in multi-account AWS governance (AWS Organizations, Control Tower, SCPs).
• Experience with event-driven architectures and messaging systems (SNS, SQS, EventBridge, Kafka).
• Knowledge of FinOps principles and tooling.
• Exposure to serverless architectures and event-based workflows.
Key Attributes:
• Collaborative – Works hand-in-hand with Architects, Developers, and Security teams.
• Analytical – Can foresee infrastructure issues and resolve them proactively.
• Innovative – Explores emerging AWS and AI technologies for better performance and efficiency.
• Reliable – Ensures infrastructure uptime, security, and performance at all times.
• Cost-conscious – Always seeking ways to reduce operational costs without compromising quality.