Full-Time

Big Data Lead

Posted on 8/5/2025

HEXAWARE

HEXAWARE

No salary listed

United States

In Person

Category
Data & Analytics (1)
Requirements
  • Strong proficiency in Python for data manipulation and automation.
  • Hands-on experience with Pentaho Data Integration (PDI).
  • Solid understanding of ETL concepts, data warehousing, and data modeling.
  • Experience with SQL (joins, aggregations, subqueries).
  • Familiarity with Azure Data Factory, cloud storage (Blob, Data Lake), and DevOps tools.
  • Version control using Git or Azure DevOps.
  • Basic scripting in PowerShell or Shell is a plus.
  • Bachelor's degree in Computer Science, Engineering, or related field.
  • 5 to 7 years of experience in data engineering or ETL development.
Responsibilities
  • Analyze and document existing Pentaho ETL jobs, transformations, and data flows.
  • Translate Pentaho logic into Python scripts and/or Azure Data Factory pipeline components.
  • Develop and maintain scalable Python-based data processing solutions.
  • Validate data accuracy post-migration using automated testing and SQL queries.
  • Collaborate with data engineers, architects, and QA teams to troubleshoot issues.
  • Create technical documentation and participate in knowledge transfer sessions.
Desired Qualifications
  • Prior experience in migration projects is highly desirable.
  • Detail-oriented with a meticulous approach to data validation.
  • Strong communication and documentation abilities.
  • Collaborative mindset with a proactive attitude.

Company Size

N/A

Company Stage

N/A

Total Funding

N/A

Headquarters

N/A

Founded

N/A

INACTIVE