Full-Time

Data Engineer

Boston

Posted on 6/6/2024

CDC Foundation

CDC Foundation

1,001-5,000 employees

Mobilizes resources for health protection missions

Social Impact

Senior

Remote in USA

Required Skills
Power BI
Microsoft Azure
Agile
Python
MySQL
Apache Flink
NoSQL
Data Science
R
Apache Spark
SQL
Apache Kafka
Java
Postgres
MongoDB
Hadoop
Requirements
  • Bachelor's degree in Computer Science, Information Technology, Data Science, or a related field
  • Minimum of five (5) years of experience in building Data Warehouse and/or Data Lake implementations in a product-centric environment
  • Proficiency in programming languages commonly used in data engineering, such as Python, Java, Scala, and SQL
  • Experience with big data technologies and frameworks like Hadoop, Spark, Kafka, and Flink
  • Strong understanding of database systems, including relational databases (e.g., MySQL, PostgreSQL) and NoSQL databases (e.g., MongoDB, Cassandra)
  • Experience regarding engineering best practices such as source control, automated testing, continuous integration and deployment, and peer review
  • Knowledge of data warehousing concepts and tools
  • Experience with cloud computing platforms. Microsoft Azure is a plus
  • Expertise in data modeling, both in ETL (Extract, Transform, Load) and ELT (Extract, Load, Transform) processes, and data integration techniques
  • Experience in building Data Warehouse, Data Lake or other Data Platforms in Microsoft Azure platform using Azure Data Factory, SQL Server, Azure Blob Storage (Data Lake Gen2), and Powerbi Visualization tool
  • Knowledge of SAS and R is desirable
  • Familiarity with agile development methodologies, software design patterns, and best practices
  • Strong analytical thinking and problem-solving abilities
  • Excellent verbal and written communication skills, including the ability to convey technical concepts to non-technical partners effectively
  • Flexibility to adapt to evolving project requirements and priorities
  • Outstanding interpersonal and teamwork skills; and the ability to develop productive working relationships with colleagues and partners
  • Experience working in a virtual environment with remote partners and teams
  • Proficiency in Microsoft Office
Responsibilities
  • Create and manage data systems and pipelines
  • Collect, transform, and load data into storage systems
  • Optimize data pipelines and infrastructure
  • Design, create, test, deploy, and maintain data pipelines
  • Monitor data pipelines and systems
  • Implement security measures
  • Collaborate with data scientists, analysts, and other partners
  • Collaborate with cross-functional teams
  • Implement and maintain ETL and ELT processes
  • Design and manage data storage systems
  • Stay knowledgeable about industry trends and best practices
  • Provide technical guidance
  • Communicate effectively with partners

The CDC Foundation facilitates a unique public health mission by addressing global health threats through the strategic mobilization of resources and advanced technological solutions. This organization stands out as a leader in managing health crises such as the COVID-19 and Ebola outbreaks, by fostering a collaborative environment with a focus on continuous improvement and deep commitment to public safety. Such a purpose-driven atmosphere not only fulfills critical global needs but also cultivates an inspiring and meaningful workplace.

Company Stage

N/A

Total Funding

N/A

Headquarters

Atlanta, Georgia

Founded

1992

Growth & Insights
Headcount

6 month growth

-4%

1 year growth

7%

2 year growth

-1%