Full-Time

Jr. Data Engineer

Posted on 11/21/2024

Sayari

Sayari

201-500 employees

Provides risk intelligence for supply chains

Data & Analytics
Financial Services

Compensation Overview

$100k - $125kAnnually

Entry, Junior

Remote in USA

Category
Data Engineering
Data & Analytics
Required Skills
Kubernetes
Microsoft Azure
Python
NoSQL
Git
BigQuery
Apache Spark
SQL
Docker
AWS
Scala
Google Cloud Platform
Requirements
  • Professional experience with Python and a JVM language (e.g., Scala)
  • 2+ years of experience designing and maintaining data pipelines
  • Experience using Apache Spark and Apache Airflow
  • Experience with SQL and NoSQL databases (e.g., columns stores, graph, etc.)
  • Experience working on a cloud platform like GCP, AWS, or Azure
  • Experience working collaboratively with Git
  • Understanding of Docker/Kubernetes
  • Interest in learning from and mentoring team members
  • Experience supporting and working with cross-functional teams in a dynamic environment
  • Passionate about open source development and innovative technology
  • Experience working with BI tools like BigQuery and Superset is a plus
  • Understanding of knowledge graphs is a plus
Responsibilities
  • Write and deploy crawling scripts to collect source data from the web
  • Write and run data transformers in Scala Spark to standardize bulk data sets
  • Write and run modules in Python to parse entity references and relationships from source data
  • Diagnose and fix bugs reported by internal and external users
  • Analyze and report on internal datasets to answer questions and inform feature work
  • Work collaboratively on and across a team of engineers using agile principles
  • Give and receive feedback through code reviews

Sayari provides intelligence on counterparty and supply chain risks, helping businesses and analysts make informed decisions by offering visibility into corporate relationships and stakeholders. The platform integrates trade data from 65 countries with global corporate ownership data, allowing clients to identify various risks, including regulatory and reputational risks. Sayari stands out from competitors with its subscription-based model, which ensures continuous access to updated data and features like batch supplier screening. The company's goal is to enhance transparency and mitigate risks in global commerce.

Company Stage

N/A

Total Funding

$53.2M

Headquarters

Washington, District of Columbia

Founded

2015

Growth & Insights
Headcount

6 month growth

49%

1 year growth

52%

2 year growth

114%
Simplify Jobs

Simplify's Take

What believers are saying

  • The $235 million strategic investment from TPG provides substantial capital for organic growth and M&A opportunities, indicating strong financial backing and growth potential.
  • Sayari's platform is trusted by high-profile clients, including U.S. and European regulators, law enforcement, and over 100 of the world's largest companies, showcasing its reliability and market penetration.
  • The addition of experienced leaders like Steve Nguyen to accelerate U.S. government business suggests a focused strategy to expand in lucrative sectors.

What critics are saying

  • The competitive landscape in supply chain risk intelligence is intensifying, requiring Sayari to continuously innovate to maintain its edge.
  • Dependence on subscription-based revenue means that any downturn in client renewals could impact financial stability.

What makes Sayari unique

  • Sayari's integration of trade data from 65 countries with global corporate ownership data offers unparalleled visibility into cross-border networks, setting it apart from competitors.
  • The company's focus on counterparty and supply chain risk intelligence specifically addresses regulatory, reputational, and business continuity risks, unlike broader risk management platforms.
  • Partnerships with firms like GAN Integrity and significant investments from TPG highlight Sayari's commitment to combating corruption and modern slavery, enhancing its credibility and market position.

Help us improve and share your feedback! Did you find this helpful?