Development

Solution Architect - Cloud & DevOps

Pune, Maharashtra
Work Type: Full Time

Job Description: Solution Architect - Cloud & DevOps 


Role Summary:


Codvo is seeking an experienced Solution Architect specializing in Cloud Infrastructure and DevOps practices to join our team. In this role, you will be responsible for designing, implementing, and managing the cloud infrastructure and DevOps strategies that underpin our Generative AI platform and client deployments. You will ensure our solutions are deployed securely, reliably, and efficiently, whether on public cloud platforms or client-specific private cloud environments. You will work closely with the Generative AI Solution Architect, engineering teams, and operations to build scalable, automated, and robust infrastructure solutions.


Key Responsibilities:


  • Cloud Architecture Design: Design scalable, resilient, and cost-effective cloud architectures (AWS, Azure, GCP, and private clouds) to host Codvo's Generative AI platform components and client solutions.
  • DevOps Strategy & Implementation: Define and implement CI/CD pipelines, Infrastructure as Code (IaC) practices (using tools like Terraform, CloudFormation, or ARM templates), configuration management, and automated deployment strategies.
  • Deployment Architecture: Architect deployment solutions for containerized applications (Docker, Kubernetes) across different environments, including client private clouds, ensuring seamless integration and operation.
  • Infrastructure for AI/ML: Design and manage infrastructure components specific to AI/ML workloads, such as compute resources for model tuning/inference, vector databases, message queues (NATS, Kafka), and data storage solutions.
  • Monitoring, Logging & Alerting: Implement comprehensive monitoring, logging, and alerting solutions (e.g., Prometheus, Grafana, ELK stack, CloudWatch, Azure Monitor) to ensure system health, performance, and availability.
  • Security & Compliance: Integrate security best practices throughout the cloud infrastructure and DevOps lifecycle (DevSecOps). Implement network security, identity and access management (IAM), vulnerability management, and ensure compliance with relevant standards.
  • Automation: Drive automation initiatives across infrastructure provisioning, configuration, deployment, and operational tasks.
  • Collaboration: Work closely with the Generative AI Solution Architect to align infrastructure capabilities with application architecture needs. Collaborate with development teams to optimize applications for cloud deployment and operational efficiency.
  • Client Environment Integration: Plan and execute the deployment and configuration of Codvo solutions within client-specific private cloud environments, addressing unique technical and security requirements.
  • Performance & Cost Optimization: Continuously monitor and optimize cloud infrastructure for performance and cost-effectiveness.
  • Documentation: Create and maintain detailed documentation for infrastructure architecture, DevOps processes, deployment procedures, and operational runbooks.


Required Qualifications & Skills:


  • Bachelor's or Master's degree in Computer Science, Engineering, or a related technical field.
  • Proven experience (typically 10+ years) in IT infrastructure and operations, with significant experience (4+ years) in a Cloud Architect, DevOps Architect, or similar role.
  • Deep expertise in designing, deploying, and managing solutions on at least one major cloud platform (AWS, Azure, or GCP). Experience with hybrid or private cloud environments is a strong plus.
  • Strong hands-on experience with Infrastructure as Code (IaC) tools (e.g., Terraform, CloudFormation, ARM Templates).
  • Proficiency with containerization technologies (Docker) and container orchestration platforms (Kubernetes).
  • Experience implementing and managing CI/CD pipelines using tools like Jenkins, GitLab CI, Azure DevOps, or similar.
  • Solid understanding of networking concepts (VPCs, subnets, firewalls, load balancers) and cloud security best practices.
  • Experience with monitoring, logging, and alerting tools and frameworks.
  • Scripting skills (e.g., Python, Bash, PowerShell).
  • Excellent problem-solving and troubleshooting skills related to infrastructure and deployment issues.
  • Strong communication and collaboration skills.
  • Experience working in Agile/Scrum environments.


Preferred Qualifications:


  • Experience managing infrastructure for AI/ML workloads (e.g., setting up environments for model training/serving, managing vector databases).
  • Experience deploying and managing applications in client-owned private cloud environments
  • Relevant cloud certifications (e.g., AWS Certified Solutions Architect - Professional, AWS Certified DevOps Engineer - Professional, Azure Solutions Architect Expert, Google Cloud Certified - Professional Cloud Architect/DevOps Engineer).
  • Experience with GitOps practices.
  • Understanding of Generative AI concepts and their infrastructure implications.

Submit Your Application

You have successfully applied
  • You have errors in applying