Full-Time

Sr. Site Reliability Engineer

Updated on 12/20/2024

CDK Global

CDK Global

5,001-10,000 employees

Integrated software solutions for automotive retail

Automotive & Transportation
Enterprise Software

Compensation Overview

$110k - $140kAnnually

+ Bonus

Senior, Expert

H1B Sponsorship Available

Hoffman Estates, IL, USA

Category
DevOps & Infrastructure
Site Reliability Engineering
Required Skills
RabbitMQ
React.js
Apache Kafka
Java
Postgres
CloudFormation
AWS
Prometheus
Terraform
Nginx
MongoDB
Oracle
Linux/Unix
AngularJS
Requirements
  • Bachelor’s degree, or equivalent experience, in Computer Science, Engineering, or related field, with 8+ years of relevant experience with large-scale enterprise-grade solutions.
  • A strong background in architecture / design and currently working in a similar role, in a forward-thinking and fast paced business.
  • 4+ years professional SRE experience relevant to the responsibilities listed above, including event driven architectures, cloud native and distributed / SaaS solutions.
  • 4+ years of experience with CI/CD pipelines, infrastructure as code, proactive monitoring, smart alerting, ensuring performance / scalability and proactive capacity management of enterprise-grade solutions.
  • Expertise troubleshooting across the entire stack: network, server, operating system, and application.
  • Expertise with monitoring and alerting tools (e.g., New Relic, Prometheus, Grafana).
  • Strong analytical and problem-solving skills, with a keen attention to detail.
  • Experience with Microservices, Java, Node, Kafka / RabbitMQ, Oracle / PostgreSQL, MongoDB / DynamoDB, React / Angular, Istio, NGINX, F5, AWS API Gateway, ECS, Cloudformation, Terraform.
  • Experience deploying, maintaining and troubleshooting containerized applications.
  • A level of comfort with Linux.
  • Solid communication and collaboration skills.
  • Certification in AWS or related cloud technologies (preferred).
  • Automotive retail experience (preferred).
Responsibilities
  • Engage in and improve the whole lifecycle of solutions, from inception and design, through to build/test, deployment, operation and refinement.
  • Ensure our solutions are reliable, fault-tolerant, secure, efficiently scalable, available, reachable and cost-effective.
  • Measure, monitor and proactively alert on resource consumption, error rates, traffic anomalies, availability, performance, reachability and overall system health.
  • Quickly respond to and prevent disruptions to users. If a disruption does occur, quickly respond to and resolve incidents efficiently.
  • Expertly troubleshoot issues with distributed systems, interactions between cloud technology layers and components, common dependencies at scale.
  • Practice sustainable incident response, blameless postmortems and prompt implementation of recommended changes to prevent recurrence.
  • Contribute to the development and implementation of routine maintenance automation and alerting.
  • Recommend configurations optimal of cloud technology solutions and modify the code base that defines systems or cloud technologies to improve the reliability, availability, efficiency, observability, performance and operability of supported products.
  • Collaborate well with cross-functional teams across product, architecture, engineering, infrastructure, and security to ensure that reliability standards are integrated into the development and deployment of all solutions.
  • Maintain up-to-date documentation on system configurations, incident response protocols, and operational best practices.
  • Earnestly participate in code/design reviews, and regular meetings with the engineering teams that develop and/or manage the products in question.
  • Research and maintain an awareness in industry trends, advances in distributed systems and cloud technologies, tools, and/or processes for maintaining and improving product availability, reliability, efficiency, observability, and/or performance.
  • Contribute to the implementation of new solutions within the team by identifying ways they can be applied to solve persistent problems.
  • Ensure that uniform enterprise-wide architecture and design standards are adhered to high availability of products, services and database.

CDK Global provides integrated software solutions specifically designed for the automotive retail industry. Their products help auto dealerships manage various operations, including billing, customer relationship management (CRM), inventory management, and service scheduling. By using these software tools, dealerships can streamline their processes, improve customer experiences, and increase sales. CDK Global differentiates itself from competitors by focusing on the unique challenges of the automotive market, particularly the transition to electric vehicles (EVs). They conduct research to understand dealership needs and tailor their solutions accordingly. The company's goal is to enhance the efficiency and productivity of automotive retailers through advanced technology.

Company Stage

IPO

Total Funding

N/A

Headquarters

Hoffman Estates, Illinois

Founded

1972

Simplify Jobs

Simplify's Take

What believers are saying

  • Increased focus on cybersecurity can enhance CDK Global's reputation and client trust.
  • Digital transformation in automotive industry offers expansion opportunities for CDK Global.
  • Rise of EVs creates new service offerings and partnerships for CDK Global.

What critics are saying

  • Tekion's lawsuit could lead to significant legal and financial repercussions for CDK Global.
  • Antitrust lawsuit highlights vulnerabilities in CDK Global's business practices.
  • Cybersecurity breach impacts client operations and financial performance, posing ongoing risks.

What makes CDK Global unique

  • CDK Global specializes in integrated software solutions for the automotive retail industry.
  • The company offers a subscription-based model, ensuring a stable revenue stream.
  • CDK Global focuses on EV transition, tailoring solutions for evolving automotive needs.

Help us improve and share your feedback! Did you find this helpful?