We are seeking Software Engineer interns to accelerate Apache Spark and related distributed frameworks on GPUs. Apache Spark is the most popular data processing engine in data centers for data science. It is used for a wide variety of data workloads, from data preparation, to running ML experiments, and all the way to deployment of ML applications. Data scientists spend a considerable amount of time exploring data and iterating over machine learning (ML) experiments. Every hour of compute required to sort through datasets, extract features and fit ML algorithms impedes an efficient business workflow.
At NVIDIA, we are passionate about working on hard problems that have an impact. You will work with an engineering team accelerating Apache Spark with GPUs using CUDA and open source libraries. As an intern you will be paired with a senior engineer and assigned one or more projects to make Apache Spark applications accelerated with GPUs and DPUs for select customers and partners. The projects will give you an opportunity to work with Apache Spark on different platforms using different APIs. We will benchmark solutions and measure performance against theoretical optimal results. You will have the opportunity to work on real world customer problems. Data workflows can benefit tremendously from being accelerated, enabling data scientists to explore many more and larger datasets to achieve their business goals, faster and more efficiently.
What you’ll be doing:
Work with select customers and partners to make Apache Spark applications accelerated with GPUs and DPUs.
Develop applications to promote best practice of accelerated data analytics and machine/deep learning in various industry verticals.
Build tools to analyze data processing workloads to identify opportunities for acceleration and cost savings.
Work with major cloud service providers and Spark vendors globally.
Engaging open source communities, including Apache Spark and RAPIDS, for technical discussions and contributions
Working with NVIDIA strategic partners to deploy sophisticated machine learning and data analytics solutions in public cloud or on-premise clusters
What we need to see:
Pursuing BS, MS, or PhD in Computer Science, Computer Engineering, or closely related field
Outstanding problem solving skills
Excellent programming skills in Python, Scala or SQL
Knowledge of open source big-data projects including Apache Spark, Apache Hadoop, Apache Flink, and Apache Kafka
Able to work successfully with multi-functional teams across organizational boundaries and geographies
Highly motivated with strong communication skills
Ways to stand out from the crowd:
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. Are you creative and autonomous? Do you love a challenge? If so, apply today!
The Colorado Equal Pay for Equal Work Act requires that NVIDIA provide the compensation range and benefits offered for this position if performed in Colorado. The base hourly range for this position in Colorado is $31.80 - 43.73 USD.
NVIDIA also offers a comprehensive benefits package. We provide health care coverage, dental and vision, 401(K), including company matching and after tax contributions, Employee Stock Purchase Program (ESPP), Employee Assistance Program (EAP), company paid holidays, paid sick leave, vacation leave, professional time off, life and disability protection. Employees in eligible sales and positions may also be eligible for commission.
Base pay is based on market location and may vary based on factors including experience, skills, education, and other job-related reasons.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression , sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.