About the job
We are seeking a highly skilled Senior Cloud Engineer specializing in AWS.
The ideal candidate will have strong expertise in observability and monitoring tools like Datadog, Grafana, and the ELK stack and will play a pivotal role in designing, implementing and maintaining our cloud infrastructure, ensuring high availability, performance and robust monitoring solutions.
Responsibilities
- Design, deploy and manage AWS infrastructure using best practices (e.g., EC2, RDS, S3, Lambda, VPC, IAM, CloudFormation/Terraform)
- Implement and optimize observability solutions leveraging Datadog, Grafana, and ELK stack for comprehensive monitoring, logging and alerting
- Develop automation scripts and tools to streamline infrastructure management and monitoring workflows
- Ensure system reliability through proactive monitoring, performance tuning and incident response
- Collaborate with development, security and operations teams to define infrastructure requirements and support CI/CD pipelines
- Establish best practices for cloud operations, monitoring and incident management
- Conduct regular system audits, capacity planning and cost optimization initiatives
Requirements
- 3+ years of experience in cloud engineering with a focus on AWS services
- Proficiency in AWS core services (EC2, ECS, Lambda, RDS, S3, CloudWatch, etc.)
- Strong hands-on experience with observability tools: Datadog (APM, Infrastructure Monitoring), Grafana, and ELK (Elasticsearch, Logstash, Kibana)
- Experience with Infrastructure as Code (IaC) tools like Terraform or CloudFormation
- Scripting and automation skills (Python, Bash, etc.)
- Solid understanding of networking concepts (VPC, DNS, load balancing, firewalls)
- Familiarity with CI/CD tools and practices (Bamboo, Vault, etc.)
- Strong problem-solving and analytical skills
- Excellent communication and collaboration abilities
- Ability to work independently, as well as in a fast-paced and agile environment
- B2 level of English or higher, with an emphasis on technical communication skills
Nice to have
- Experience with Kubernetes and container orchestration on AWS (EKS)
- Knowledge of security best practices in AWS environments
- Exposure to serverless architectures and event-driven designs
We offer
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn
No comments:
Post a Comment