Data Engineer
Updated on 1/25/2023
Locations
Remote
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Data Analysis
Development Operations (DevOps)
Pandas
Python
Requirements
- Experience working with scientific or geospatial datasets
Responsibilities
- Collect, standardize, clean, and organize heterogeneous mineral exploration data. This data comes from a variety of sources (government, third parties, collected in our own operations), includes multiple different types (unstructured text, images, tabular data, …), and comes in a variety of formats and cleanliness. This role will handle the idiosyncrasies of each data set
- Build tooling to support our field operations such as dashboards, visualizations, data processing pipelines, integrations with third party software
- Build scalable, reliable, and performant systems to support the above operations, in collaboration with the engineering team
- Eliminate work through automation and efficiency improvements
- Optimize existing systems and processes to increase scale and velocity of our data processing
- Evangelize automation and engineering best practices throughout Kobold. Mentor other team members to adopt them
- Work with data scientists and geoscientists to generate data that meet their data quality and accessibility requirements
- At least 5 years of experience in a Site Reliability Engineering (SRE), DevOps or Software Engineering role, ideally working on a data product
- Proficiency in large-scale system design
- Proficiency in Python, including data processing libraries such as pandas and distributed computing frameworks such as Dask
- Proficiency in performance optimization for data storage and serving
- Discerning ability, curiosity and initiative to identify inefficiencies and opportunities for automation. Creativity and follow-through to address these problems in a systematic way
- Track record of scaling a fast-growing product, in a startup or as a member of a small team
- Collaborative attitude to work with stakeholders with different backgrounds (data scientists, geoscientists, software engineers, operations)