Lead Data Engineer
  • Fuge Technologies Inc
133 Days Ago
NA
NA
Phoenix-AZ, Atlanta-GA, Dallas-TX
10-15 Years
Required Skills: Data mesh, Data Lakehouse architecture, Ml Ops, Python, SQL, Spark, Cloud platforms, data modeling, Data warehousing, Kafka, Kinesis, Flink, Airflow, Prefect, Dagster, Terraform, Docker, Kubernetes, GDPR, HIPAA
Job Description
Job Title: Lead Data Engineer
Location: TX, AZ, CA, GA, NJ, NY / Remote
Job Type: W2
Experience: 10+ years

Job Summary:
We are seeking a highly skilled Lead Data Engineer to architect, build, and manage scalable data infrastructure and pipelines. You will lead a team of data engineers and collaborate with data scientists, analysts, and stakeholders to drive data-driven decision-making across the organization.

Key Responsibilities:
  • Lead design and development of end-to-end data pipelines (batch and streaming) using modern technologies.

  • Architect and maintain cloud-based data platforms (e.g., AWS, Azure, GCP).

  • Develop and optimize ETL/ELT processes to ingest data from various internal and external sources.

  • Implement data governance practices including data quality, cataloging, and lineage.

  • Collaborate with business and analytics teams to understand data requirements and deliver high-quality datasets.

  • Mentor junior engineers and provide technical leadership on data projects.

  • Drive best practices in data engineering, DevOps, and CI/CD pipelines.

  • Ensure scalability, performance, and cost-efficiency of data platforms.


Required Skills & Experience:
  • 8+ years of experience in data engineering or related fields.

  • 3+ years of experience in a lead or senior-level role.

  • Strong proficiency in Python, SQL, and Spark.

  • Experience with cloud platforms like AWS (S3, Redshift, Glue, EMR), Azure, or Google Cloud (BigQuery, Dataflow).

  • Deep knowledge of data modeling, warehousing (Snowflake, Redshift, BigQuery), and performance tuning.

  • Experience with streaming technologies (Kafka, Kinesis, Flink).

  • Familiarity with orchestration tools (Airflow, Prefect, Dagster).

  • Exposure to infrastructure as code (Terraform, CloudFormation) and containerization (Docker, Kubernetes).

  • Strong understanding of data privacy and compliance (e.g., GDPR, HIPAA).


Preferred Qualifications:
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field.

  • Experience in data mesh or data lakehouse architectures.

  • Background in leading cross-functional data initiatives.

  • Experience with ML Ops or support for data science workflows.


Soft Skills:
  • Excellent communication and interpersonal skills.
  • Ability to work in an agile, fast-paced environment.
  • Strong problem-solving and decision-making abilities.
  • Proven leadership and mentoring skills.

Jobseeker

Looking For Job?
Search Jobs

Recruiter

Are You Recruiting?
Search Candidates