Full-Time

LLM or GenAI Application Engineer

Posted on 10/4/2025

FocusKPI

FocusKPI

11-50 employees

Data analytics, AI, and staffing solutions

Compensation Overview

$110 - $120/hr

Mountain View, CA, USA

Hybrid

Hybrid role requires 4 days onsite per week in Mountain View, CA.

Category
AI & Machine Learning (1)
Software Engineering (1)
Requirements
  • 5-7 years of industrial work experience along with research/academic experience.
  • Advanced degree in Computer Science, Artificial Intelligence, Data Science, or a related field.
  • Strong programming skills.
  • Expertise with LLM and GenAI application development.
  • Experience with deep learning frameworks such as TensorFlow, PyTorch, or JAX.
  • Hands-on experience with transformer-based models (e.g., GPT, BERT, RoBERTa, LLaMA).
  • Expertise in natural language processing (NLP) and sequence-to-sequence models.
  • Familiarity with Hugging Face libraries and OpenAI APIs.
  • Experience with MLOps tools like Docker, Kubernetes, and CI/CD pipelines.
  • Strong understanding of distributed computing and GPU acceleration using CUDA.
  • Knowledge of reinforcement learning and RLHF (Reinforcement Learning with Human Feedback).
  • The candidate must have a real (actual) product experience, application development, and GenAI-based application shipping.
  • Actual or Industrial LLM or GenAI application experience of at least 2-3 years
Responsibilities
  • Design, train, and fine-tune large language models (e.g., GPT, LLaMA, PaLM) for various applications.
  • Research cutting-edge techniques in natural language processing (NLP) and machine learning to improve model performance.
  • Explore advancements in transformer architectures, multi-modal models, and emergent AI behaviors.
  • Collect, clean, and preprocess large-scale text datasets from diverse sources.
  • Develop and implement data augmentation techniques to improve training data quality.
  • Ensure data is free from bias and aligned with ethical AI standards.
  • Optimize model architecture to improve accuracy, efficiency, and scalability.
  • Implement techniques to reduce latency, memory footprint, and inference time for real-time applications.
  • Collaborate with MLOps teams to deploy LLMs into production environments using Docker, Kubernetes, and cloud.
  • Develop robust evaluation pipelines to measure model performance using key metrics like accuracy, perplexity, BLEU, and F1 score.
  • Continuously test for bias, fairness, and robustness of language models across diverse datasets.
  • Conduct A/B testing to evaluate model improvements in real-world applications.
  • Stay updated with the latest advancements in generative AI, transformers, and NLP research.
  • Contribute to research papers, patents, and open-source projects—present findings and insights at conferences and internal knowledge-sharing sessions.

FocusKPI provides data analytics, artificial intelligence models, and specialized technical staffing to help businesses increase revenue and reduce operational costs. The firm builds customized machine learning tools and private generative AI platforms, such as Netpoint.AI, which allow companies to analyze customer behavior and automate tasks while keeping their data secure. Unlike many consulting firms, FocusKPI distinguishes itself by offering a 24-hour turnaround for technical recruiting and providing private AI deployments that ensure clients maintain total control over their proprietary information. The company's goal is to deliver high-quality technical talent and actionable business insights that scale efficiently across industries like retail, media, and software.

Company Size

11-50

Company Stage

N/A

Total Funding

N/A

Headquarters

Santa Clara, California

Founded

2010

Simplify Jobs

Simplify's Take

What believers are saying

  • Fraud warnings on domains like focusKPIjobs.com build client trust.
  • Custom AI agents integrate with CRM for decision-ready outputs.
  • 90% client retention from 15+ years of enterprise AI solutions.

What critics are saying

  • 33–36 employees hinder scaling against Palantir in 12–24 months.
  • Inconsistent addresses erode trust in 6–12 months.
  • No proprietary IP commoditizes Accelerators via Hugging Face in 18–36 months.

What makes FocusKPI unique

  • Accelerators toolbox, built over 10+ years, fast-tracks analytics projects.
  • Tailored GenAI solutions deliver 80% average efficiency improvement.
  • Leadership features Yunxiao He as CDO and Peter Zhu as CEO.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Remote Work Options

Hybrid Work Options

INACTIVE