We are looking for a highly skilled Cloud Site Reliability Engineer (SRE) to join our offshore team
The Cloud SRE will play a key role in ensuring the reliability, scalability, and performance of our cloud-based systems
If you have a strong background in cloud technologies, a passion for automation, and the ability to address reliability challenges, we invite you to apply for this position
Responsibilities:
Cloud Infrastructure Management:Manage and optimize cloud infrastructure, ensuring high availability and reliability
Implement best practices for scalability, security, and performance in a cloud environment
Monitoring and Incident Response:Develop and implement monitoring solutions to proactively identify and address potential issues
Respond to and resolve incidents, ensuring minimal impact on system performance
Automation:Implement automation for routine operational tasks to enhance efficiency
Script and deploy infrastructure as code (IaC) to support cloud services
Reliability Engineering:Conduct reliability assessments and identify areas for improvement
Collaborate with development teams to enhance system reliability and stability
MSoW (Managed Service Operations and Workload):Provide support for managed services and workloads in a cloud environment
Collaborate with onshore teams on project requirements and deliverables
Security Compliance:Ensure cloud infrastructure complies with security policies and standards
Collaborate with security teams to address vulnerabilities and implement secure practices
Documentation:Maintain comprehensive documentation for cloud architecture, configurations, and procedures
Create documentation for incident response and troubleshooting
Collaboration:Collaborate with cross-functional teams, including development, operations, and security
Participate in knowledge-sharing activities and contribute to continuous learning