Software Engineer

Data, US

Posted on 7/19/2023

Gauss Labs

Gauss Labs

51-200 employees

AI solutions for manufacturing industries

AI & Machine Learning
Data & Analytics
Industrial & Manufacturing


Palo Alto, CA, USA

Required Skills
Data Science
Apache Beam
Apache Spark
Apache Kafka
Apache Hive
Data Analysis
  • BS/MS degree in Computer Science and Engineering or strong industry experience in software development
  • 3+ years of experience as a hands-on Data Engineer/Architect including ETL jobs, data pipelines, and Big Data analytics
  • 5+ years of industry experience in building large-scale production systems
  • Startup spirit with the ability to be flexible and wear multiple hats
  • Proficient in large-scale data processing including batch and streaming, query engines, tooling, and storage formats
  • Experience working with Terabytes of data
  • Preferred experience in time series data storage and processing as well as image data processing
  • Experience in distributed systems design, common data platform architecture, and open source data processing frameworks
  • Experience in technologies such as Hadoop, Spark, Kafka, Redis, Cassandra, Pandas, Dask, Airflow, Apache Beam, MongoDb, Hive, Impala, Hazelcast, Athena, Presto
  • Experience in at least one modern programming language such as Python, Java, Go, Rust, and proficiency in SQL
  • Experience architecting and implementing large operational data stores
  • Excellent verbal and written communication skills, able to collaborate cross-functionally
  • Plus: Experience in Kubernetes, Kubeflow, Docker, and container technologies, as well as infrastructure as code and CI/CD technologies
  • Design and develop the scalable data architecture for Gauss Lab’s AI products which include time series and image data for semiconductor industry
  • Design and build data infrastructure systems, services, and tools to handle Gauss Labs’s data-intensive products and business requirements that securely scale over terabytes of data
  • Define the schemas, layout, storage format, and database technologies that will be used to store, retrieve and process the data for our products
  • Build databases, object stores, data warehouses, and lakes for time series, images, and structured data
  • Develop robust, well-instrumented near real-time stream processing data pipelines that can scale to handle future growth and adhere to SLAs
  • Design and develop stream processing features to execute on various event-driven ETL and AI pipelines
  • Evolve Gauss Lab’s data infrastructure and tools and technical lead for the design, building, and launching of new data models and data pipelines within our products
  • Work closely with product/program managers to understand the product’s needs, business problems, and domain
  • Work cross-functionally with various engineering and data science teams to identify and execute data-infrastructure challenges

Gauss Labs specializes in industrial AI solutions, integrating cutting-edge technologies like machine learning and computer vision to optimize manufacturing processes. This company offers a stimulating work environment for professionals passionate about utilizing artificial intelligence to enhance efficiency and accuracy in industries like semiconductor, automotive, and pharmaceuticals. The focus on continuous improvement and expansion into diverse manufacturing sectors provides employees with ample opportunities for growth and innovation in their careers.

Company Stage


Total Funding



Palo Alto, California



Growth & Insights

6 month growth


1 year growth


2 year growth



Competitive compensation + meaningful equity

Generous PTO policy

Comprehensive wellbeing benefits

Paid family leaves

L&D opportunities

Additional benefits unique to each site