Full-Time

Principal Data Engineer

Enterprise Corporate Data Team

Posted on 7/18/2025

LocalEdge

LocalEdge

201-500 employees

Local digital marketing for small businesses

Compensation Overview

$325k - $350k/yr

New York, NY, USA

In Person

This role is based in New York City.

Category
Data & Analytics (1)
Required Skills
Microsoft Azure
Python
Airflow
Apache Spark
SQL
Machine Learning
Apache Kafka
TypeScript
AWS
Data Analysis
Google Cloud Platform
Requirements
  • 10+ years of experience in data engineering, with significant experience building large-scale, distributed data systems to support Data analysis, AI/ ML and key business use cases.
  • Proven expertise in content classification, tagging, and ontology/taxonomy development, especially using NLP and semantic techniques.
  • Strong coding and data architecture skills using Typescript, Python, SQL, and tools like Apache Spark, Kafka, Airflow, Node Js, and cloud-native platforms (e.g., AWS, GCP, or Azure).
  • Hands-on experience integrating ML models into production environments for tasks such as entity extraction, text classification, or semantic search.
  • Deep understanding of working with unstructured data (text, images, video), metadata enrichment, and knowledge graph integration.
  • Experience managing and mentoring distributed/offshore engineering teams, with a track record of driving execution across time zones.
  • Excellent communication and collaboration skills, with the ability to bridge technical execution and business strategy.
Responsibilities
  • Lead the design and implementation of high-performance data pipelines and infrastructure to support automated generation of semantic ontology and knowledge graph.
  • Architect scalable data platforms that integrate structured and unstructured data—including behavioral signals, content metadata, and user engagement data—for Gen AI use cases.
  • Build systems that enable semantic enrichment of content through entity recognition, disambiguation, normalization and deduplication techniques.
  • Drive the creation and maintenance of flexible ontologies and taxonomies to organize media content for personalization, recommendation, and audience segmentation.
  • Partner closely with ML engineers and data scientists to deploy and operationalize models for content and audience intelligence.
  • Oversee and co-ordinate with an offshore engineering team, providing technical guidance, code reviews, and project oversight to ensure timely, high-quality deliverables.
  • Ensure best practices in data governance, quality, observability, and documentation across all engineering workflows.
  • Collaborate with stakeholders across product, marketing, and data science to translate business needs into scalable AI data systems.
  • Well versed in architecting, designing and developing large scale OLTP and OLAP systems.
  • Experience building and operating streaming systems using messaging systems like Kafka, Pub/sub, SQS etc.
  • Experience building an RAG system with Google, OpenAI or another Gen AI platform.
  • Experience building a knowledge graph using Neo4j, Spanner, Neptune or another tool is a plus.
Desired Qualifications
  • Experience in digital media, publishing, ad tech, or content platforms.
  • Bachelor’s, Master’s or Ph.D. in Computer Science, Data Engineering, or a related field.
  • Knowledge of LLMs and generative AI in applied settings (e.g., content summarization, auto-tagging, retrieval augmentation).
  • Working experience with OLAP and OLTP systems is a plus.

LocalEdge is a digital marketing service from Hearst Media Services that helps local businesses reach their target customers online. It offers a suite of advertising products including social media management, local SEO, review generation, and marketing analytics, with a performance-based model aimed at delivering cost-effective leads. The product set works by optimizing a business’s local online presence and running targeted campaigns across social channels, search, and review platforms to attract the right customers while tracking results through analytics. What sets LocalEdge apart is its scale as part of Hearst Media Services and its focus on local markets across the United States, serving a broad range of clients from small shops to larger enterprises with a goal of improving marketing ROI. The overarching goal is to help local businesses grow by increasing visibility and generating measurable, cost-efficient leads through local-focused digital marketing.

Company Size

201-500

Company Stage

N/A

Total Funding

N/A

Headquarters

Buffalo, New York

Founded

1968

Simplify Jobs

Simplify's Take

What believers are saying

  • Expanding Local Service Ads into healthcare and legal boosts specialized LSA packages.
  • Reseller programs attract agencies preferring white-label local digital services.
  • Bundling video optimization with social management drives conversions for clients.

What critics are saying

  • Google's AI Overviews suppress 70-90% local SEO traffic in 3-6 months.
  • Yext's Walmart contracts erode SMB market share in 12-18 months.
  • Meta's algorithm shift reduces unverified clients' social efficacy in 6-12 months.

What makes LocalEdge unique

  • LocalEdge offers performance-based advertising delivering cost-effective leads to local businesses.
  • Specializes in AI-enhanced local SEO adapting to Google's AI Overviews launched 2024.
  • Provides comprehensive suite including social media, reviews, and marketing analytics.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Health Insurance

Dental Insurance

Vision Insurance

401(k) Company Match

Mental Health Support

Paid Vacation

Paid Parental Leave

INACTIVE