Full-Time
Data Engineer
Posted on 4/3/2024
Cloud solutions for life sciences sector
Mid
Remote in USA
- 3+ years of experience developing data pipelines using cloud-managed Spark clusters (e.g. AWS EMR, Databricks)
- Fluent in Python programming language and PySpark (3+ years of experience)
- Previous experience building tools and libraries to automate and streamline data processing workflows
- Proficient with SQL / SparkSQL
- Hands-on experience working with a Data Lakehouse
- Good verbal and written communication and proven experience of working and delivering in an Agile environment
- Applicants must have the unrestricted right to work in the United States. Veeva will not provide sponsorship at this time
- Looking for strong mentors with a proven record of making your team better
- Build and maintain data processing pipeline and tools using state-of-the-art technologies
- Work with Python and SQL on Spark-based data pipelines
- Develop algorithms to build complex data relationships
- Build analytical data structures to support reporting
- Build and maintain Data Quality processes
- Collaborate with Product team to adapt reference data to changing demands in the market
Veeva Systems offers industry cloud solutions for the life sciences sector, providing technologies such as Vault Clinical Data Management, Vault EDC, Vault Coder, Vault Clinical Operations, Vault RIM Suite, Vault Quality Suite, Vault Safety Suite, Veeva Medical Suite, Veeva Data Cloud, and Veeva Commercial Cloud to support critical functions from R&D through commercialization. These technologies aim to streamline quality processes, manage clinical data, and improve regulatory compliance for life sciences companies.
Company Stage
IPO
Total Funding
$224M
Headquarters
Pleasanton, California
Founded
2007
6 month growth
↑ 3%1 year growth
↑ 10%2 year growth
↑ 37%Benefits
Parental leave
PTO
Free food
Health, dental, & vision insurance
Gym membership reimbursement