Required Skills: ADF, ADO, API, Agile ,Data Processing, Data Engineering ,Data Quality, Data Science, Microsoft Azure, Microsoft Power BI, Microsoft, Root Cause Analysis, TMS, SQL, DevOps
Job Description
Responsibilities
Validate human labeling results in the data pipeline to ensure data accuracy and consistency.
Understand data labeling requirements, propose solutions for how upper stream data labeling team should correct label quality by performing root cause analysis based on data quality evaluation
Clean and preprocess TMS/IMS/Jailbreak training and evaluation datasets to maintain high-quality data standards.
Collaborate with the API RAI team to understand data requirements and provide data solutions.
Develop and implement data pipelines and workflows to automate data processing tasks.
Monitor and maintain data quality throughout the data lifecycle.
Work closely with cross-functional teams to support data-driven decision-making.
Working with internal team to create business plans for strategic growth.
Other responsibilities assigned by the CSI management team for business development.
Requirement
Bachelor's degree in Computer Science, Data Engineering, Data Science or a related field.
5 years + experience as a data engineer, analytics engineer, software engineer
Strong coding skills in C#, Python, and SQL.
Experience with Azure DevOps (ADO) and pipeline maintenance.
Familiarity with Azure services and Azure Data Factory (ADF).
Proven experience in automation and data engineering.
Excellent problem-solving skills and attention to detail.
Strong communication and collaboration skills.
Preferred Qualifications:
Prior experience related to human labeled data quality evaluation process is strongly preferred.
Experience with Microsoft is preferred
Knowledge of machine learning and AI concepts.
Prior experience with evaluation of data quality and identifying bottlenecks.
Familiarity with data visualization tools (e.g., Power BI, Tableau).
Experience working in an Agile development environment.