Full-Time

Team Lead – HPC System Operations

Updated on 1/15/2025

Telesat

Telesat

501-1,000 employees

Global satellite operator providing connectivity solutions

Automotive & Transportation
Consulting
Aerospace

Senior

Ottawa, ON, Canada

Hybrid position based in Ottawa, Ontario.

Canada Top Secret Clearance Required

Category
Network Administration
System Administration
IT & Security
Required Skills
Bash
Linux/Unix
Requirements
  • A Diploma or Degree in a relevant area of study with a preference for Computer Science together with demonstrated operational network-related experience.
  • Minimum of 5 years in Information Technology (with a related University Degree) or minimum of 7 years in Information Technology (with a three-year College Diploma).
  • Industry certifications such as MCSE, CISSP are a strong asset.
  • In-depth and demonstrated experience in the installation and operation of Linux platforms in an Enterprise environment (Ubuntu/RedHat).
  • Experience in the use of KVM or other hypervisors.
  • Experience in HPC tools such as Slurm, OpenHPC, LSF or GridEngine.
  • Demonstrated knowledge of HPC clusters and use cases.
  • Working technical knowledge of network systems.
  • Working technical knowledge of current systems software, protocols and standards including Active Directory.
  • Identity management using Microsoft Identity Manager and Azure AD Connect.
  • Solid understanding of the Windows based endpoints.
  • Solid scripting experience (e.g. Bash).
  • Excellent written and oral communication skills.
  • Excellent problem-solving skills.
  • Strong analytical and troubleshooting skills.
  • Strong interpersonal and organizational skills.
  • Must be well organized and able to grasp system concepts and communicate their applications.
  • Must be capable of quickly learning new systems and associated software applications for proficient execution of tasks.
  • Ability to manage multiple demands with time related constraints in a fast-paced environment.
  • Prioritize and schedule work as necessary to maintain department standards and service level agreements.
  • Ability to speak effectively before groups of internal employees, communicate technical information, create and deliver presentations and information sessions to both technical and nontechnical personnel.
  • Demonstrated experience in applying technical expertise and in-depth evaluation to solve complex problems in own area of expertise.
  • Ability to create and maintain documentation and training materials, including KB articles, for technical staff and end-user audiences.
Responsibilities
  • Identify, diagnose, and resolve level two problems for users of the software and hardware, LAN and WAN, VPN, the Internet, mobile devices, and new computer technology; communicate solutions to end-users.
  • Respond to more complex issues (second line support) escalated by the first line support using problem-solving skills and analysis to identify root causes of issues, determine course of action and propose creative solutions.
  • Manage day-day operations and support of the HPC environment (Linux).
  • Take ownership of capacity, availability and performance of the HPC cluster(s).
  • Support end users in the submission and management of jobs based on Slurm and OpenHPC.
  • Migrate existing nodes as required to Linux.
  • Implement and manage a system based on Foreman or similar to manage patching and oversee cluster management.
  • Implement patches and upgrades to Linux, Slurm and OpenHPC as required.
  • Install new servers and storage, build new clusters, configure and manage Linux distributions, hypervisors (KVM) and tooling.
  • Automate where possible to increase efficiency of operations.
  • Execute upon firewall access requests to the environment.
  • Escalate priority support issues to senior staff and/or other corporate technology groups.
  • Collect and document all relevant information prior to escalation to allow senior staff to operate efficiently.
  • Document, track and monitor problems to ensure timely resolution.
  • Assist in tracking helpdesk calls pertaining to application, networking, and systems problems and issues.
  • Assign username, password and access right permissions for multiple proprietary applications, as well as client software.
  • Identity Management and multifactor authentication with integration between Active Directory and Linux platforms.
  • Perform hardware & software audits.
  • Product research and evaluation.
  • Provide emergency support on incidents as required.
  • Perform occasional after-hours maintenance.
  • Incident on-call rotation as required.
  • Day-to-day operational support.
Desired Qualifications
  • Microsoft Windows experience is an asset.
  • Bilingualism (English/French) is an asset.

Telesat operates in the satellite communications industry, providing connectivity solutions to various sectors including government, telecommunications, inflight connectivity, maritime, oil and gas, and corporate networks. The company utilizes both geostationary (GEO) and low Earth orbit (LEO) satellites to ensure reliable global communications. Telesat's products include satellite capacity and consulting services, which are offered through long-term contracts and service agreements. A key feature of Telesat is its Lightspeed network, which uses advanced LEO satellites to deliver high-speed and low-latency connectivity. This focus on diverse applications and advanced technology sets Telesat apart from its competitors, with the goal of addressing complex communication challenges worldwide.

Company Stage

IPO

Total Funding

$30.7M

Headquarters

Ottawa, Canada

Founded

1969

Simplify Jobs

Simplify's Take

What believers are saying

  • Growing demand for satellite internet boosts Telesat's Lightspeed project prospects.
  • Hybrid satellite-terrestrial networks offer Telesat integration opportunities for enhanced connectivity.
  • 5G integration with satellite networks creates new business opportunities for Telesat.

What critics are saying

  • Competition from SpaceX's Starlink could impact Telesat's market share.
  • Delays in Lightspeed's launch could lead to financial penalties or contract losses.
  • Reliance on government funding exposes Telesat to political and economic risks.

What makes Telesat unique

  • Telesat leverages both GEO and LEO satellites for comprehensive global communications.
  • The Lightspeed network offers high-speed, low-latency connectivity solutions.
  • Telesat serves diverse sectors, including maritime, aviation, and oil and gas.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Remote Work Options

Hybrid Work Options