Full-Time

Senior AI Data Engineer

Image Generation Data

iSoftStone

iSoftStone

5,001-10,000 employees

IT services and digital transformation provider

Compensation Overview

$105k - $110k/yr

No H1B Sponsorship

Menlo Park, CA, USA

In Person

Onsite 5 days per week in Menlo Park; U.S. work authorization required; visa sponsorship not available.

Category
Data & Analytics (1)
Required Skills
LLM
SQL
Machine Learning
Data Engineering
Requirements
  • Bachelor's degree or higher in Computer Science, Data Engineering, Machine Learning, or a related STEM field.
  • 5+ years of industry experience in data engineering, ML engineering, or a hybrid role involving both data pipelines and model serving/inference.
  • Demonstrated track record of building and operating production data pipelines that invoke ML models at scale.
  • Previous experience at Meta is preferred but not required.
Responsibilities
  • AI-Augmented Data Pipelines: Design and maintain AI-augmented, large-scale data pipelines (billions of images) integrating traditional transformations with ML models (classifiers, embeddings, LLMs) for cleaning and annotation.
  • Remote Inference Orchestration: Own the systems for remote ML model inference orchestration within pipelines, managing batching, retries, async jobs, and ensuring graceful degradation.
  • Feature Pipelines: Build and maintain scalable pipelines for generating, storing, and serving vector embeddings, including nearest-neighbor index management and quality validation.
  • Data Curation at Scale: Source, filter, and curate training datasets using a combination of SQL and model-derived signals (e.g., aesthetic scores, NSFW classifiers), owning the end-to-end data flow and maintaining governance, quality, and compliance.
  • LLM-Assisted Annotation: Design and operate pipelines that use LLMs and vision models for automated annotation of training data, including auditing workflows to measure and improve annotation model performance.
  • Tooling & Frameworks: Contribute to shared tooling and frameworks that make it easier for the broader team to build AI-augmented data pipelines — e.g., reusable operators for model invocation, standard patterns for async job management.
Desired Qualifications
  • Working knowledge of embeddings and vector representations like generating, storing, indexing, and querying embeddings (FAISS, Milvus, or equivalent).
  • Familiarity with content-understanding models like image classifiers, object detection, OCR, NSFW detection, aesthetic scoring.
  • Experience with LLMs for data tasks like prompt engineering for annotation, data cleaning, or evaluation using LLM APIs.
  • Knowledge of generative AI like diffusion models, image generation, evaluation metrics (FID, CLIP score, etc.).

iSoftStone provides IT services and digital transformation for large enterprises. It delivers consulting, software development, cloud, data management, AI, digital experience, testing, and business process outsourcing through long-term contracts and partnerships. It differentiates itself with a global delivery network serving 2,600+ clients, including 90+ Fortune Global 500 firms, plus a software-hardware integration approach with Tongfang Computer. Its goal is to expand internationally and offer end-to-end IT solutions that combine software services with hardware capabilities for major customers.

Company Size

5,001-10,000

Company Stage

IPO

Headquarters

Kirkland, Washington

Founded

2001

Simplify Jobs

Simplify's Take

What believers are saying

  • Q1 2025 revenue grew 28.65% year-on-year to RMB 7.011 billion, demonstrating strong momentum.
  • July 2025 'iSoftStone Digital' brand launch and Kingdee partnership accelerate international expansion.
  • Tier-2 and tier-3 Chinese urbanization drives recurring smart city and digital infrastructure demand.

What critics are saying

  • Tongfang Computer integration failure destroys margins; hardware requires different operational expertise than software.
  • US export controls on Chinese tech vendors block Western client contracts and supply chains.
  • Loss of 2-3 Fortune Global 500 clients reduces revenue 15-25% due to concentration risk.
  • Hardware revenue growth compresses blended margins as Tongfang operates at lower software margins.
  • China-US tech decoupling forces US clients to shift work to domestic or allied vendors.
  • Kingdee partnership dependency creates acquisition or termination risk if strategic priorities shift.
  • Government contract ties expose company to US regulatory scrutiny and client blacklisting.
  • AI commoditization erodes consulting premiums as OpenAI and cloud providers compress fees.
  • Wage inflation in China and India increases labor costs, eroding unit economics.
  • ChiNext delisting or regulatory suspension destroys shareholder value and capital access.

What makes iSoftStone unique

  • Software-hardware integration via Tongfang Computer acquisition differentiates from pure-play IT services competitors.
  • End-to-end smart city expertise across 80 Chinese cities creates recurring infrastructure revenue.
  • Diversified service portfolio spanning cloud, IoT, blockchain, 5G, and semiconductors enables cross-selling.

Help us improve and share your feedback! Did you find this helpful?

Your Connections

People at iSoftStone who can refer or advise you

Benefits

Health Insurance

Dental Insurance

Vision Insurance

401(k) Retirement Plan

Paid Holidays

Paid Sick Leave

Life Insurance

Growth & Insights and Company News

Headcount

6 month growth

0%

1 year growth

0%

2 year growth

0%
ICT Review
Dec 25th, 2024
Who is ISoftStone? Mysterious tech firm set to become one of the biggest PC vendors in China, beating Huawei, HP and Apple

Internationally, ISoftStone has expanded its footprint, collaborating with Kingdee International Software Group in Singapore to provide digital solutions and services to global customers.

Data Centre Dynamics Ltd
Nov 15th, 2024
China's Kingdee International Software Group launches data center in Singapore

iSoftStone will also work with Kingdee to provide digital solutions to its global customers from the data center.