Job Title: Data Engineer
Experience: 6+ years
About Us
At Codvo, we are committed to building scalable, future-ready data platforms that power business impact. We believe in a culture of innovation, collaboration, and growth, where engineers can experiment, learn, and thrive. Join us to be part of a team that solves complex data challenges with creativity and cutting-edge technology.
Key Responsibilities:
- Design, implement, and maintain scalable data pipelines on Google Cloud Platform (BigQuery, Dataflow, Pub/Sub, Cloud Storage).
- Ingest and process real-time data from connected vehicles and IoT devices.
- Develop and maintain ETL/ELT workflows using Python, SQL, and Apache Beam/Spark.
- Ensure data quality, lineage, and governance in the data lake.
- Act as a bridge between data engineering and analytics teams, performing data wrangling, feature engineering, and ML model prototyping.
- Support pipeline automation and ML model deployment on GCP.
- Collaborate with AI/ML leads to operationalize machine learning pipelines and integrate data solutions with applications.
Skills & Expertise Required:
- Strong experience with Python, SQL, and GCP services (BigQuery, Dataflow, Vertex AI, Pub/Sub, Cloud Storage).
- Experience with Airflow, CI/CD, and data pipeline orchestration.
- Familiarity with ML frameworks (scikit-learn, TensorFlow) and feature engineering.
- Expertise in API integration, data ingestion, and automation of ML pipelines.
- Strong understanding of data quality, governance, and lineage best practices.