Archimedes AI Technologies Inc
Location
Bangalore | India
Job description
Responsibilities :Design, implement, and maintain CI/CD pipelines for automated build, test, and deployment of AI models and applications.Automate infrastructure provisioning and configuration management using tools like Terraform, Ansible, or similar.Collaborate with software developers to optimize application performance and resource utilization in cloud environments.Monitor, troubleshoot, and optimize system performance, reliability, and scalability.Implement and maintain container orchestration platforms such as Kubernetes for deploying and managing AI workloads.Manage and configure cloud services and resources on platforms like AWS, Azure, or Google Cloud Platform.Implement security best practices and compliance requirements for AI infrastructure and applications.Work closely with the research and development teams to deploy and scale machine learning models and algorithms.Participate in on-call rotation and respond to incidents related to production systems.Continuously research and evaluate new technologies and tools to improve the efficiency and effectiveness of our DevOps processes.Document infrastructure configurations, processes, and procedures to ensure knowledge sharing and maintainability.Skills and Qualifications :Bachelor's degree in Computer Science, Engineering, or related field.4+ years of experience working in a DevOps or similar role.Strong understanding of software development lifecycle and CI/CD concepts.Proficiency in scripting languages such as Python, Bash, or PowerShell.Experience with configuration management tools like Terraform, Ansible, Chef, or Puppet.Hands-on experience with containerization technologies such as Docker and container orchestration platforms like Kubernetes.Familiarity with cloud computing platforms such as AWS, Azure, or Google Cloud Platform.Knowledge of networking, security, and compliance principles in cloud environments.Experience with monitoring and logging tools such as Prometheus, Grafana, ELK stack, or similar.Strong problem-solving skills and ability to troubleshoot complex issues in distributed systems.Excellent communication and collaboration skills, with the ability to work effectively in a team environment.Experience with machine learning frameworks and tools is a plus. (ref:hirist.tech)
Job tags
Salary