Required Skills: Spark / PySpark, SQL, CI/CD, DevOps fundamentals
Job Description
Job Title: Data Architect
Location: Plano, Texas
Duration: 1 Year
Job Description:
This role is a Hybrid role in Plano, TX
Role Summary:
We are seeking a hands-on Data Architect to design and build a modern Data Lakehouse platform
using Apache Iceberg, Snowflake, and Databricks. This is a developer-first role focused on coding,
building pipelines, Python microservices, and enabling Data Engineering, Data Science, and GenAI
workloads.
Key Responsibilities:
1 Design and implement scalable Lakehouse architecture
2 Build batch and streaming pipelines using Spark (Databricks)
3 Create and optimize Iceberg tables
4 Develop Python-based microservices and APIs
5 Optimize Snowflake performance and cost
6 Implement data modeling (bronze/silver/gold layers)
7 Enable datasets for Data Science and GenAI use cases
8 Ensure data quality, governance, and performance
Required Skills:
1 Hands-on experience with Iceberg, Snowflake, and Databricks
2 Advanced Python programming
3 Data Engineering (ETL/ELT, CDC, streaming)
4 Microservices architecture
5 Spark / PySpark
6 SQL performance tuning
7 CI/CD and DevOps fundamentals
Nice to Have
1 Experience supporting GenAI / RAG workflows
2 Feature engineering for ML pipelines
3 Vector data or semantic search experience