Data Engineer
Confirmed live in the last 24 hours
Veeva Systems

5,001-10,000 employees

Cloud computing services for pharmaceutical companies.
Company Overview
Veep's mission is to help R&D, quality, and regulatory teams eliminate inefficiencies and bring high-quality, safe, sustainable products to market without compromising quality. The company builds cloud-based tools for pharmaceutical research.

Company Stage

IPO

Total Funding

$224M

Founded

2007

Headquarters

Pleasanton, California

Growth & Insights
Headcount

6 month growth

5%

1 year growth

15%

2 year growth

51%
Locations
Kansas City, MO, USA
Experience Level
Entry
Junior
Mid
Senior
Expert
Desired Skills
Agile
Apache Spark
AWS
Data Analysis
Data Structures & Algorithms
Airflow
SQL
Python
CategoriesNew
Data & Analytics
Requirements
  • 3+ years of experience developing data pipelines using cloud-managed Spark clusters (e.g. AWS EMR, Databricks)
  • Fluent in Python programming language and PySpark (3+ years of experience)
  • Proficient with SQL/SparkSQL
  • Hands-on experience working with a Data Lakehouse
  • Good verbal and written communication and proven experience of working and delivering in an Agile environment
Responsibilities
  • Build and maintain data processing pipelines using state-of-the-art technologies
  • Work with Python and SQL on Spark-based data pipelines
  • Build analytical data structures to support reporting
  • Build and maintain Data Quality processes
  • Collaborate with Product and Data Operations teams to adapt our reference data to changing demands in the market
Desired Qualifications
  • Experience running data workflows through the DevOps pipeline
  • Develop data pipelines with orchestration tools (e.g. Airflow)
  • Experience with AWS services for data processing like EMR, MWAA, etc
  • Previous experience in the Life Sciences sector