Full-Time

Web Scraping Engineer

Confirmed live in the last 24 hours

YipitData

YipitData

501-1,000 employees

Data analysis and research services provider

No salary listed

Mid, Senior

No H1B Sponsorship

India

Category
Data Management
Data & Analytics
Required Skills
Playwright
REST APIs
Selenium
Puppeteer
HTML/CSS
Requirements
  • Effective communication in English with both technical and non-technical stakeholders.
  • 4+ years of experience with web scraping frameworks (e.g., Selenium, Playwright, or Puppeteer).
  • Strong understanding of HTTP, RESTful APIs, HTML parsing, browser rendering, and TLS/SSL mechanics.
  • Expertise in advanced fingerprinting and evasion strategies (e.g., browser fingerprint spoofing, request signature manipulation).
  • Deep experience managing cookies, headers, session states, and proxy rotations, including the deployment of both residential and data center proxies.
  • Experience with logging, metrics, and alerting to ensure high availability.
  • Troubleshooting skills to optimize scraper performance for efficiency, reliability, and scalability.
Responsibilities
  • Refactor and Maintain Web Scrapers
  • Overhaul existing scraping scripts to improve reliability, maintainability, and efficiency.
  • Implement best coding practices (clean code, modular architecture, code reviews, etc.) to ensure quality and sustainability.
  • Implement Advanced Scraping Techniques
  • Utilize sophisticated fingerprinting methods (cookies, headers, user-agent rotation, proxies) to avoid detection and blocking.
  • Handle dynamic content, navigate complex DOM structures, and manage session/cookie lifecycles effectively.
  • Collaborate with Cross-Functional Teams
  • Work closely with analysts and other stakeholders to gather requirements, align on targets, and ensure data quality.
  • Support internal users of our web scraping tooling by providing troubleshooting, documentation, and best practices to ensure efficient data usage for critical reporting.
  • Monitor and Troubleshoot
  • Develop robust monitoring solutions, alerting frameworks to quickly identify and address failures.
  • Continuously evaluate scraper performance, proactively diagnosing bottlenecks and scaling issues.
  • Drive Continuous Improvement
  • Propose new tooling, methodologies, and technologies to enhance our scraping capabilities and processes.
  • Stay up to date with industry trends, evolving bot-detection tactics, and novel approaches to web data extraction.

YipitData provides data analysis and research services by converting large volumes of raw data from various sources into clear and actionable insights. These insights help clients, primarily investors and corporate entities, understand market trends and company performance. YipitData operates on a subscription model, offering clients regular datasets and reports that include updates on multiple sectors such as autos, marketplaces, and logistics. Each subscription provides tailored information with three layers of coverage, ensuring relevance to client needs. The company generates revenue through these subscription fees, while also offering data integration and management practices to enhance their analytics capabilities. YipitData's goal is to empower clients with the information necessary for informed decision-making.

Company Size

501-1,000

Company Stage

Series E

Total Funding

$577M

Headquarters

New York City, New York

Founded

2013

Simplify Jobs

Simplify's Take

What believers are saying

  • Increased demand for alternative data is driving growth in the market research sector.
  • The rise of e-commerce creates opportunities for YipitData to analyze consumer behavior.
  • Integration of AI enhances YipitData's ability to process and interpret large datasets.

What critics are saying

  • Competition from AI-driven analytics platforms like ChatGPT could challenge YipitData's market position.
  • Volatility in inflation rates may impact YipitData's predictive insights and client trust.
  • Amazon's market moves may require YipitData to adjust its data analysis strategies.

What makes YipitData unique

  • YipitData specializes in alternative data, offering unique insights beyond traditional data sources.
  • The company provides custom reports tailored to specific business needs and sectors.
  • YipitData's subscription model ensures clients receive regular, detailed datasets and reports.

Help us improve and share your feedback! Did you find this helpful?

Benefits

401(k) Company Match

Flexible Work Hours

Unlimited Paid Time Off

Parental Leave

Wellness Program

Professional Development Budget

Growth & Insights and Company News

Headcount

6 month growth

-1%

1 year growth

-1%

2 year growth

-3%
AI Jobs
Mar 14th, 2025
YipitData: Data QA Associate, $475M Funding

YipitData, a market research firm, is hiring a Data QA Associate for a remote role in India. The position involves data cleaning and quality assurance for e-commerce insights, collaborating with teams in China and the U.S. Candidates should have a bachelor's degree, 0-2 years of relevant experience, and skills in data tagging and analysis. YipitData recently raised $475M from The Carlyle Group, valuing the company at over $1B. The role offers growth opportunities and a comprehensive benefits package.

Silicon Canals
Nov 4th, 2024
YipitData Expands Global Consumer Coverage to Europe and China

YipitData expands global consumer coverage to Europe and China.

Business Wire
Jun 13th, 2024
Services financiers Innovation CIBC dirige un financement par emprunt syndiqué de premier rang pour Yipit, LLC

Services financiers Innovation CIBC a le plaisir d’annoncer qu’elle a récemment agi à titre de chef de file bancaire et d’agent administratif dans le

PYMNTS
Sep 20th, 2023
Chatgpt Traffic Rebounds With Start Of School Year

Some new data about the usage patterns of generative artificial intelligence (AI) chatbots may have teachers taking a closer look at students’ homework. ChatGPT, the AI chatbot developed by OpenAI, is experiencing a surge in traffic at the same time students are returning to school, according to recent estimates from third-party firms, Bloomberg reported Wednesday (Sept. 20). This is raising concerns about the widespread use of the chatbot among students. Data intelligence firm SimilarWeb reported that web traffic to ChatGPT rose by about 12% last week compared to the previous week, according to the report. This increase is believed to be directly influenced by the return of students to school in the U.S

PR Newswire
Aug 17th, 2023
As Home Improvement Growth Slows, Consumers Turn To E-Commerce

YipitData industry report shows Amazon growing share, ranking second behind The Home DepotNEW YORK, Aug. 17, 2023 /PRNewswire/ -- The home improvement industry has seen significant sales shifts towards online shopping, with generalist retailers such as Amazon making significant market share gains against specialty retail leaders such as The Home Depot. YipitData's State of the Home Improvement Industry Report provides visibility into the dynamics of the $220 billion home improvement market, covering industry trends and growth by channel, retailer and subcategory.Home improvement consumers turning to e-comm in new YipitData report; Amazon ranking #2 market share behind Home Depot. Tweet this Generalist retailers are making gains in the home improvement industry. Amazon lags Home Depot in market share by only 2.6 pp.Home improvement GMV sales growth at a modest 2-3% YoY in 1H 2023 vs 23.5% YoY in 2021.More than ⅓ of total home improvement sales in 2023 were made online. Generalist retailers, led by Amazon, are capitalizing on the shift to e-commerce.Various subcategory performance is evolving due to consumer preference changes, including hand & power tools, outdoor power equipment, and lawn & garden."Our goal is to help retailers and brands answer their key questions on evolving consumer behavior with actionable data," said Dan Pellegrinelli, VP of Research at YipitData