Lead Data/Machine Learning Engineer
Confirmed live in the last 24 hours
Thoughtworks

10,001+ employees

Global technology consultancy driving digital innovation
Company Overview
Thoughtworks stands out as a global technology consultancy with a rich history of over 28 years, offering a unique blend of strategy, design, and engineering to drive digital innovation. The company's culture fosters learning and growth, bringing together a diverse mix of technologists, from fresh computer science graduates to experienced professionals and self-taught developers, creating an environment that encourages mutual learning and challenge. With a proven track record in creating adaptable technology platforms, designing top-tier digital products, and harnessing data and AI, Thoughtworks has consistently demonstrated industry leadership and a competitive edge.
Consumer Software
Consulting
Fintech

Company Stage

Private

Total Funding

$748M

Founded

1993

Headquarters

Chicago, Illinois

Growth & Insights
Headcount

6 month growth

-2%

1 year growth

-5%

2 year growth

13%
Locations
Chicago, IL, USA
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Apache Hive
Apache Spark
AWS
Apache Kafka
Data Analysis
Hadoop
Airflow
Microsoft Azure
NoSQL
Cassandra
CategoriesNew
Software Engineering
Requirements
  • You are equally happy coding and leading a team to implement a solution
  • You have a track record of innovation and expertise in Data Engineering
  • You're passionate about craftsmanship and have applied your expertise across a range of industries and organizations
  • You have a deep understanding of data modelling and experience with data engineering tools and platforms such as Kafka, Spark, and Hadoop
  • You have built large-scale data pipelines and data-centric applications using any of the distributed storage platforms such as HDFS, S3, NoSQL databases (Hbase, Cassandra, etc.) and any of the distributed processing platforms like Hadoop, Spark, Hive, Oozie, and Airflow in a production setting
  • Hands on experience in MapR, Cloudera, Hortonworks and/or cloud (AWS EMR, Azure HDInsights, Qubole etc.) based Hadoop distributions
  • You are comfortable taking data-driven approaches and applying data security strategy to solve business problems
  • You're genuinely excited about data infrastructure and operations with a familiarity working in cloud environments
  • Working with data excites you: you have created Big data architecture, you can build and operate data pipelines, and maintain data storage, all within distributed systems
  • Advocate your data engineering expertise to the broader tech community outside of Thoughtworks, speaking at conferences and acting as a mentor for more junior-level data engineers
  • You're resilient and flexible in ambiguous situations and enjoy solving problems from technical and business perspectives
  • An interest in coaching others, sharing your experience and knowledge with teammates
  • You enjoy influencing others and always advocate for technical excellence while being open to change when needed
Responsibilities
  • You might spend a few weeks with a new client on a deep technical review or a complete organizational review, helping them to understand the potential that data brings to solve their most pressing problems
  • You will partner with teammates to create complex data processing pipelines in order to solve our clients' most ambitious challenges
  • You will collaborate with Data Scientists in order to design scalable implementations of their models
  • You will pair to write clean and iterative code based on TDD and leverage various continuous delivery practices to deploy, support and operate data pipelines*
  • Advise and educate clients on how to use different distributed storage and computing technologies from the plethora of options available
  • Develop and operate modern data architecture approaches to meet key business objectives and provide end-to-end data solutions
  • Create data models and speak to the tradeoffs of different modeling approaches
  • On other projects, you might be acting as the architect, leading the design of technical solutions, or perhaps overseeing a program inception to build a new product
  • Seamlessly incorporate data quality into your day-to-day work as well as into the delivery process
  • Assure effective collaboration between Thoughtworks' and the client's teams, encouraging open communication and advocating for shared outcomes