Egen
Egen5h ago
New

Lead Data Engineer – Python & GCP

IndiaIndia·HyderabadFull Timelead
Data EngineerData
0 views0 saves0 applied

Quick Summary

Requirements Summary

10 to 12 years of relevant IT ex

Technical Tools
Data EngineerData
Job Overview:           
We are looking for a skilled and motivated Lead Data Engineer with strong experience in Python programming and Google Cloud Platform (GCP) to join our data engineering team. The ideal candidate will be responsible for requirements gathering, designing, architecting the solution, developing, and maintaining robust and scalable ETL (Extract, Transform, Load) & ELT data pipelines. The role involves working with customers directly, gathering requirements, discovery phase,  designing, architecting the solution, using various GCP services, implementing data transformations, data ingestion, data quality, and consistency across systems, and post post-delivery support.
 
Experience Level: 10 to 12 years of relevant IT experience
  • Design, develop, test, and maintain scalable ETL data pipelines using Python.
  • Architect the enterprise solutions with various technologies like Kafka, multi-cloud services, auto-scaling using GKE, Load balancers, APIGEE proxy API management, DBT, using LLMs as needed in the solution, redaction of sensitive information, DLP (Data Loss Prevention) etc.
  • Work extensively on Google Cloud Platform (GCP) services such as:
  • Dataflow for real-time and batch data processing
  • Cloud Functions for lightweight serverless compute
  • BigQuery for data warehousing and analytics
  • Cloud Composer for orchestration of data workflows (on Apache Airflow)
  • Google Cloud Storage (GCS) for managing data at scale
  • IAM for access control and security
  • Cloud Run for containerized applications
  • Should have experience in the following areas : 
  • API framework: Python FastAPI
  • Processing engine: Apache Spark
  • Messaging and streaming data processing: Kafka
  • Storage: MongoDB, Redis/Bigtable
  • Orchestration: Airflow
  • Experience in deployments in GKE, Cloud Run.
  • Perform data ingestion from various sources and apply transformation and cleansing logic to ensure high-quality data delivery.
  • Implement and enforce data quality checks, validation rules, and monitoring.
  • Collaborate with data scientists, analysts, and other engineering teams to understand data needs and deliver efficient data solutions.
  • Manage version control using GitHub and participate in CI/CD pipeline deployments for data projects.
  • Write complex SQL queries for data extraction and validation from relational databases such as SQL Server, Oracle, or PostgreSQL.
  • Document pipeline designs, data flow diagrams, and operational support procedures.
  • Designing, building, and maintaining large-scale data ingestion and ETL pipelines.
  • Managing Pub/Sub-based catalog feeds, Vertex AI embedding generation, BigQuery analytics workflows, and PP5 Knowledge Graph data pipelines.
  • 10 to 12 years of hands-on experience in Python for backend or data engineering projects.
  • Strong understanding and working experience with GCP cloud services (especially Dataflow, BigQuery, Cloud Functions, Cloud Composer, etc.).
  • Solid understanding of data pipeline architecture, data integration, and transformation techniques.
  • Experience in working with version control systems like GitHub and knowledge of CI/CD practices.
  • Experience in Apache Spark, Kafka, Redis, Fast APIs, Airflow, GCP Composer DAGs.
  • Strong experience in SQL with at least one enterprise database (SQL Server, Oracle, PostgreSQL, etc.).
  • Experience in data migrations from on-premise data sources to Cloud platforms.
  • Experience working with the Snowflake cloud data platform.
  • Hands-on knowledge of Databricks for big data processing and analytics.
  • Familiarity with Azure Data Factory (ADF) and other Azure data engineering tools.
  • Excellent problem-solving and analytical skills.
  • Strong communication skills and ability to collaborate in a team environment.
  •  Bachelor's degree in Computer Science, a related field, or equivalent experience.
  • Location & Eligibility

    Where is the job
    Hyderabad, India
    Hybrid — some on-site time required
    Who can apply
    IN

    Listing Details

    Posted
    June 18, 2026
    First seen
    June 18, 2026
    Last seen
    June 18, 2026

    Posting Health

    Days active
    0
    Repost count
    0
    Trust Level
    70%
    Scored at
    June 18, 2026

    Signal breakdown

    freshnesssource trustcontent trustemployer trust
    Egen
    Egen
    lever

    Egen Solutions is a cutting-edge technology company that specializes in artificial intelligence and machine learning.

    Employees
    125
    Founded
    2018
    Domain
    egen.ai
    View company profile
    Newsletter

    Stay ahead of the market

    Get the latest job openings, salary trends, and hiring insights delivered to your inbox every week.

    A
    B
    C
    D
    Join 12,000+ marketers

    No spam. Unsubscribe at any time.

    EgenLead Data Engineer – Python & GCP