Lead Data Engineer w/ IDMC experience
Posted on 3/1/2024
INACTIVE
Thoughtworks

10,001+ employees

Global technology consultancy driving digital innovation
Company Overview
Thoughtworks stands out as a global technology consultancy with a rich history of over 28 years, offering a unique blend of strategy, design, and engineering to drive digital innovation. The company's culture fosters learning and growth, bringing together a diverse mix of technologists, from fresh computer science graduates to experienced professionals and self-taught developers, creating an environment that encourages mutual learning and challenge. With a proven track record in creating adaptable technology platforms, designing top-tier digital products, and harnessing data and AI, Thoughtworks has consistently demonstrated industry leadership and a competitive edge.
Consumer Software
Consulting
Fintech

Company Stage

Private

Total Funding

$748M

Founded

1993

Headquarters

Chicago, Illinois

Growth & Insights
Headcount

6 month growth

0%

1 year growth

-2%

2 year growth

3%
Locations
Remote in USA
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Power BI
Microsoft Azure
Python
Apache Spark
Tableau
AWS
Hadoop
Data Analysis
CategoriesNew
Data Engineering
Data Management
Data & Analytics
Requirements
  • Experience with Informatica Data Management Cloud (IDMC) or PowerCenter, or axon, or IDQ (Informatica Data Quality)
  • Experience with AWS Services (Glue, EMR, lambda, database), Python, React, Spark
  • Experience performing as an architect and understanding architectural principles
  • Experience with Thoughtspot or (PowerBI, Tableau, Quicksight)
  • Nice to have experience with Databricks, Docker, and EKS
  • Knowledge and Hands-on Experience in Data Quality and Data Governance
  • Hands-on experience in MapR, Cloudera, Hortonworks and/or cloud (AWS EMR, Azure HDInsights, Qubole etc.) based Hadoop distributions
  • Experience in creating Big data architecture, building and operating data pipelines, and maintaining data storage within distributed systems
Responsibilities
  • Develop modern data architecture approaches to meet key business objectives and provide end-to-end data solutions
  • Partner with teammates to create complex data processing pipelines to solve clients' most ambitious challenges
  • Collaborate with Data Scientists to design scalable implementations of their models
  • Write clean and iterative code based on TDD
  • Leverage continuous delivery practices to deploy, support, and operate data pipelines
  • Advise and educate clients on the use of different distributed storage and computing technologies
  • Act as the architect, leading the design of technical solutions, or overseeing a program inception to build a new product
  • Incorporate data quality into day-to-day work and the delivery process
  • Assure effective collaboration between Thoughtworks' and the client's teams