Data Engineer @ Caliva | Simplify Jobs

INACTIVE

Full-Time

Data Engineer

Caliva

51-200 employees

Vertically integrated cannabis operator offering direct-to-consumer sales

Consumer Goods

Mid

San Jose, CA, USA

Required Skills

Python

Airflow

Git

Data Structures & Algorithms

BigQuery

SQL

AWS

Pandas

REST APIs

Google Cloud Platform

Requirements

Bachelor's degree or higher in an engineering or technical field such as Computer Science, Physics, Mathematics, Statistics, Engineering, Business Administration, or similar or equivalent combination of education and experience
4+ Years' experience in a data engineering role supporting production systems
1+ years experience extracting data from REST APIs
1+ years experience managing a codebase in GitHub
Previous experience developing ETL pipelines using technologies such as Airflow (preferable), Luigi, Oozie, Azkaban, etc
Previous experience developing data models to support a data warehouse
Experience manipulating and de-normalizing data in JSON format for storage in relational databases
Experience with Google Cloud Platform or AWS cloud services
(Preferred) Knowledge and experience with Kubernetes and/or Docker
(Preferred) Advanced knowledge of SQL and experience working with relational databases. BigQuery experience is an extra plus
Work revolves around objectives, projects and priorities, not hours; must be able to work weekends, holidays, and occasional overtime as needed
Must be able to stand, walk, lift, sit, and bend for a majority of their work schedule
Must be able to travel to other office locations
Ability to use computer and calculator for 8 hours or more
Must be 21 years of age or older
Must comply with all legal or company regulations for working in the industry
Selected candidate will be required to complete a post offer, pre-employment background check with the local law enforcement or San Jose Police Department

Responsibilities

Assist with the implementation of new systems and updates to existing systems by leading the data strategy for each, assuring data integrity, value and access
Establish best practices in our data engineering practice and strategy
Develop appropriate data schemas and structures for use in downstream models/reports
Develop data management and oversight program spanning dozens of source systems across all departments, creating new ETL pipelines and maintenance of existing ones, ensuring data richness and quality
Engineer capacity and performance in addition to providing forecasting and future planning as well as review and consideration of technology trends
Recommend and develop changes to source data structures/systems based on observations of data within the context of operational use
Assemble large, complex data models to meet the needs of operational and strategic stakeholders
Work closely with our in-house analysts to integrate SQL data models to a dependency tree
Document and maintain our data lineage and data dictionary
Other duties and responsibilities as assigned by management

Desired Qualifications

1+ years experience manipulating data using Python (experience with Pandas is a plus)

The Data Engineer is responsible for the design of an end-to-end data engineering strategy, operation of ETL pipelines from all TPCo internal and external data sources and creation of federated data models for consumption by the Business Intelligence team to drive impactful insights and analysis. You will support a production-grade data warehouse working with software engineers and multiple partners and ultimately serve as the data foundation for all decision support tools.

Salary: $105k-$160k

What You’ll Do

Assist with the implementation of new systems and updates to existing systems by leading the data strategy for each, assuring data integrity, value and access
Establish best practices in our data engineering practice and strategy
Develop appropriate data schemas and structures for use in downstream models/reports
Develop data management and oversight program spanning dozens of source systems across all departments, creating new ETL pipelines and maintenance of existing ones, ensuring data richness and quality.
Engineer capacity and performance in addition to providing forecasting and future planning as well as review and consideration of technology trends.
Recommend and develop changes to source data structures/systems based on observations of data within the context of operational use.
Assemble large, complex data models to meet the needs of operational and strategic stakeholders.
Work closely with our in-house analysts to integrate SQL data models to a dependency tree.
Document and maintain our data lineage and data dictionary
Other duties and responsibilities as assigned by management.

What You Have

A successful candidate should come from a strong data engineering background, is self-directed and comfortable supporting the data needs of multiple teams, systems and products. You need to have experience with NoSQL and SQL data structures and be able to analyze/transform the data using various tools. Your analytical skills and knowledge of schema metadata will be essential. The right candidate will be excited by the prospect of optimizing our company’s data architecture to support our next generation of products and data initiatives.

Bachelor’s degree or higher in an engineering or technical field such as Computer Science, Physics, Mathematics, Statistics, Engineering, Business Administration, or similar or equivalent combination of education and experience
4+ Years’ experience in a data engineering role supporting production systems
1+ years experience manipulating data using Python (experience with Pandas is a plus)
1+ years experience extracting data from REST APIs
1+ years experience managing a codebase in GitHub
Previous experience developing ETL pipelines using technologies such as Airflow (preferable), Luigi, Oozie, Azkaban, etc.
Previous experience developing data models to support a data warehouse
Experience manipulating and de-normalizing data in JSON format for storage in relational databases
Experience with Google Cloud Platform or AWS cloud services
(Preferred) Knowledge and experience with Kubernetes and/or Docker
(Preferred) Advanced knowledge of SQL and experience working with relational databases. BigQuery experience is an extra plus

Working Environment

This job operates in a professional office environment and routinely uses standard office equipment such as computers, phones, copiers, filing cabinets and may work in proximity with customers and staff.

Job Requirements

Work revolves around objectives, projects and priorities, not hours; must be able to work weekends, holidays, and occasional overtime as needed.
Must be able to stand, walk, lift, sit, and bend for a majority of their work schedule.
Must be able to travel to other office locations.
Ability to use computer and calculator for 8 hours or more.
Must be 21 years of age or older.
Must comply with all legal or company regulations for working in the industry.
Selected candidate will be required to complete a post offer, pre-employment background check with the local law enforcement or San Jose Police Department.

Applicants must be authorized to work for ANY employer in the US.

We are unable to sponsor or take over sponsorship of employment Visa at this time.

The Parent Company is an equal opportunity employer.

Caliva

View

Website

View Company Profile

Caliva stands out as a leading cannabis operator in California due to its unique vertical integration and direct-to-consumer platform, which allows for seamless online ordering, in-store pickup, and same-day delivery. The company's commitment to compliance and quality, coupled with its diverse plant-based solutions, positions it as a trusted name in the cannabis industry. Furthermore, Caliva's engagement in creative collaborations, such as producing music and edibles with The Santanas, demonstrates its dynamic approach to enhancing the consumer experience.

Company Stage

M&A

Total Funding

$75M

Headquarters

San Jose, California

Founded

2015

Growth & Insights

Headcount

6 month growth

↓ -6%

1 year growth

↓ -1%

2 year growth

↓ -23%

INACTIVE