Job Title: Data & Databricks Practice Lead
Location: Remote
About Us
At Codvo, we are committed to building scalable, future-ready data platforms that power business impact. We believe in a culture of innovation, collaboration, and growth, where engineers can experiment, learn, and thrive. Join us to be part of a team that solves complex data challenges with creativity and cutting-edge technology.
Role Summary
We are looking for a dynamic and visionary Practice Lead to build and scale our Data & AI capabilities with a strong focus on Databricks, Apache Spark, Iceberg, Azure Data Fabric, Amazon EMR, and Redshift. This role is pivotal in driving Lakehouse architecture adoption, cloud-native modernization, and ML engineering across industries such as Oil & Gas, Medtech, Retail CPG, and Fintech.
You will lead a cross-functional team of architects, engineers, and ML specialists to deliver high-impact solutions that combine technical excellence, cost efficiency, and governance maturity.
Key Responsibilities
Strategic Leadership & Practice Growth
• Define and execute the strategic roadmap for the Data & AI practice, aligning with evolving trends in Lakehouse, cloud-native platforms, and AI engineering.
• Build reusable solution accelerators (e.g., Iceberg ingestion templates, Spark ETL frameworks, MLflow pipelines).
• Establish and manage strategic partnerships with Databricks, AWS, Azure, and other ecosystem players.
• Represent the practice in industry forums, client workshops, and thought leadership initiatives.
Technical Architecture & Delivery Oversight
• Architect scalable Lakehouse solutions using Delta Lake or Iceberg, optimized for performance and cost.
• Lead the design of unified batch + streaming pipelines using Spark Structured Streaming and Scala.
• Oversee multi-cloud data platform modernization using Azure Data Fabric, EMR, and Redshift.
• Guide implementation of data governance frameworks using Unity Catalog, Azure Purview, and Iceberg metadata.
Team Leadership & Capability Building
• Mentor and grow a high-performing team of data engineers, ML engineers, and platform specialists.
• Define career paths, training plans, and certification goals aligned with Databricks, AWS, and Azure.
• Foster a culture of innovation, experimentation, and continuous improvement.
Client Engagement & Business Impact
• Engage with clients to understand business challenges and translate them into scalable data solutions.
• Lead solutioning, proposal development, and delivery for strategic engagements.
• Ensure delivery excellence through performance optimization (Photon, EMR Autoscaling) and cost engineering.
Required Skills & Experience
• 10+ years of experience in data engineering, architecture, and platform modernization.
• 3+ years of hands-on experience with Databricks, Apache Spark, Iceberg, and Scala.
• Deep expertise in cloud-native data platforms (Azure Data Fabric, Amazon EMR, Redshift).
• Proven track record in building ML pipelines, implementing MLOps (MLflow), and integrating Redshift ML.
• Strong understanding of data governance, observability, and schema evolution.
• Excellent leadership, communication, and stakeholder management skills.
Preferred Qualifications
• Certifications in Databricks, Azure Data Engineering, or AWS Big Data.
• Experience in vertical-specific data solutions (e.g., predictive maintenance in Oil & Gas, real-time analytics in Retail).
• Familiarity with data mesh architectures, time travel, and audit compliance frameworks.
Why Join Us?
• Be at the forefront of Lakehouse innovation and AI-driven transformation.
• Work with cutting-edge technologies and shape the future of data platforms.
• Lead a high-impact practice with global reach and industry relevance.