Full-Time

Software Engineer

Data, US

Posted on 7/19/2023

Gauss Labs

Gauss Labs

51-200 employees

AI solutions for manufacturing industries

AI & Machine Learning
Data & Analytics
Industrial & Manufacturing

Senior

Palo Alto, CA, USA

Required Skills
Kubernetes
Rust
Python
Airflow
Data Science
Apache Beam
Apache Spark
SQL
Apache Kafka
Java
Docker
Pandas
Redis
Apache Hive
MongoDB
Hadoop
Data Analysis
Cassandra
Requirements
  • BS/MS degree in Computer Science and Engineering or strong industry experience in software development
  • 3+ years of experience as a hands-on Data Engineer/Architect including ETL jobs, data pipelines, and Big Data analytics
  • 5+ years of industry experience in building large-scale production systems
  • Startup spirit with the ability to be flexible and wear multiple hats
  • Proficient in large-scale data processing including batch and streaming, query engines, tooling, and storage formats
  • Experience working with Terabytes of data
  • Preferred experience in time series data storage and processing as well as image data processing
  • Experience in distributed systems design, common data platform architecture, and open source data processing frameworks
  • Experience in technologies such as Hadoop, Spark, Kafka, Redis, Cassandra, Pandas, Dask, Airflow, Apache Beam, MongoDb, Hive, Impala, Hazelcast, Athena, Presto
  • Experience in at least one modern programming language such as Python, Java, Go, Rust, and proficiency in SQL
  • Experience architecting and implementing large operational data stores
  • Excellent verbal and written communication skills, able to collaborate cross-functionally
  • Plus: Experience in Kubernetes, Kubeflow, Docker, and container technologies, as well as infrastructure as code and CI/CD technologies
Responsibilities
  • Design and develop the scalable data architecture for Gauss Lab’s AI products which include time series and image data for semiconductor industry
  • Design and build data infrastructure systems, services, and tools to handle Gauss Labs’s data-intensive products and business requirements that securely scale over terabytes of data
  • Define the schemas, layout, storage format, and database technologies that will be used to store, retrieve and process the data for our products
  • Build databases, object stores, data warehouses, and lakes for time series, images, and structured data
  • Develop robust, well-instrumented near real-time stream processing data pipelines that can scale to handle future growth and adhere to SLAs
  • Design and develop stream processing features to execute on various event-driven ETL and AI pipelines
  • Evolve Gauss Lab’s data infrastructure and tools and technical lead for the design, building, and launching of new data models and data pipelines within our products
  • Work closely with product/program managers to understand the product’s needs, business problems, and domain
  • Work cross-functionally with various engineering and data science teams to identify and execute data-infrastructure challenges

Gauss Labs specializes in industrial AI solutions, integrating cutting-edge technologies like machine learning and computer vision to optimize manufacturing processes. This company offers a stimulating work environment for professionals passionate about utilizing artificial intelligence to enhance efficiency and accuracy in industries like semiconductor, automotive, and pharmaceuticals. The focus on continuous improvement and expansion into diverse manufacturing sectors provides employees with ample opportunities for growth and innovation in their careers.

Company Stage

Seed

Total Funding

$110M

Headquarters

Palo Alto, California

Founded

2020

Growth & Insights
Headcount

6 month growth

0%

1 year growth

-1%

2 year growth

31%

Benefits

Competitive compensation + meaningful equity

Generous PTO policy

Comprehensive wellbeing benefits

Paid family leaves

L&D opportunities

Additional benefits unique to each site

INACTIVE