Full-Time

Data Consultant

Data & Analytics

Posted on 10/18/2024

Pythian

Pythian

201-500 employees

Cloud migration and data management services

No salary listed

Junior, Mid

Remote in Canada

Category
Data Management
Data Engineering
Data & Analytics
Required Skills
Talend
Python
Airflow
Data Science
BigQuery
Apache Spark
SQL
Java
Go
Scala
Jenkins
Terraform
LangChain
Google Cloud Platform
Connection
Connection
Connection
logo

Get referrals →

You have ways to get a Pythian referral from your network.

💡

Applications through a referral are 3x more likely to get an interview!

Requirements
  • Proficiency in a programming language such as Python, Java, Go or Scala
  • Experience with big data cloud technologies like EMR, Athena, Glue, Big Query, Dataproc, Dataflow.
  • Ideally you will have specific strong hands on experience working with Google Cloud Platform data technologies - Google BigQuery, Google DataFlow, and Executing PySpark and SparkSQL code at Dataproc
  • Understand the fundamentals of Spark (PySpark or SparkSQL) including using the Dataframe Application Programming Interface as well as analyzing and performance tuning Spark queries
  • Have experience developing and supporting robust, automated and reliable data pipelines
  • Develop frameworks and solutions that enable us to acquire, process, monitor and extract value from large dataset
  • Have strong SQL skills
  • Bring a good knowledge of popular database and data warehouse technologies & concepts from Google, Amazon or Microsoft (Cloud & Conventional RDBMS), such as BigQuery, Redshift, Microsoft Azure SQL Data Warehouse, Snowflake etc.
  • Have strong knowledge of a Data Orchestration solutions like Airflow, Oozie, Luigi or Talend
  • Have strong knowledge of DBT (Data Build Tool) or DataForm.
  • Experience with Apache Iceberg, Hudi and Query engines like Presto (Trino) is a plus.
  • Knowledge of Data Catalogs (AWS Glue, Google DataPlex etc.), Data Governance and Data Quality Solutions (for eg. Great Expectations) is an added advantage.
  • Have knowledge of how to design distributed systems and the trade-offs involved
  • Experience with working with software engineering best practices for development, including source control systems, automated deployment pipelines like Jenkins and devops tools like Terraform
  • Experience in data modeling, data design and persistence (e.g. warehousing, data marts, data lakes).
  • Experience in performing DevOps activities such as IaC using Terraform, provisioning infrastructure in GCP/aws/Azure, defining Data Security layers etc.
  • Good to have knowledge of GenAI tools and frameworks such as Vertex AI, Langchain. Proficiency in prompt engineering.
Responsibilities
  • Design and development of end to end data based solutions with heavy focus on application and data and good understanding of infrastructure.
  • Translate complex functional and technical requirements into detailed designs.
  • Write high-performance, reliable and maintainable code.
  • Develop test automation and associated tooling needed for the project.
  • Work on complex and varied data based projects including tasks such as collecting, parsing, managing, analyzing, and visualizing very large datasets etc.
  • Maintain and execute DataOps tasks such as performance optimization of ETL/ELT pipeline, diagnosis and troubleshooting of pipeline issues, interpreting Data Observability Dashboards, Enhancements etc.
  • Perform Data Pipeline specific DevOps activities such as Infrastructure provisioning, writing IaC code, implementing data security etc.
  • Analyze potential issues and complete root cause analysis and assign issues to be resolved.
  • Follow up with Data Engineering team members to see fixes through completion.
  • Review bug descriptions, functional requirements and design documents, incorporating this information into test plans and test cases.
  • Performance tuning for batch and real-time data processing.
  • Secure components of clients’ Cloud Data platforms.
  • Health-checks and configuration reviews.
  • Data pipelines development – ingestion, transformation, cleansing.
  • Data flow integration with external systems.
  • Integration with data access tools and products.
  • Foundational CI/CD for all infrastructure components, data pipelines, and custom data apps.
  • Common operational visibility of the data platform from data platform infrastructure to data pipelines, machine learning apps.
  • Assist client application developers and advise on efficient data access and manipulations.
  • Define and implement efficient operational processes.

Pythian assists businesses in managing and optimizing their data and IT infrastructure through services like cloud migration, managed services, and advanced analytics. They help companies move their data to cloud platforms such as Google Cloud and AWS, while providing ongoing support to ensure smooth operations. Pythian differentiates itself by offering specialized expertise in machine learning and data science, helping clients turn data into valuable insights. Their goal is to empower organizations to leverage cloud computing and analytics for improved operations and growth.

Company Size

201-500

Company Stage

Early VC

Total Funding

$21M

Headquarters

Ottawa, Canada

Founded

1997

Simplify Jobs

Simplify's Take

What believers are saying

  • Pythian's expansion into the UK and Europe increases its market presence.
  • The launch of Agentspace QuickStart positions Pythian as a leader in AI adoption.
  • Christina O'Reilly's appointment strengthens Pythian's marketing and brand strategies.

What critics are saying

  • Integrating Rittman Mead may lead to operational inefficiencies.
  • Rapid AI service expansion could stretch Pythian's resources.
  • New leadership may disrupt existing client relationships and service offerings.

What makes Pythian unique

  • Pythian specializes in managing complex enterprise data infrastructure globally.
  • The company offers unique cloud migration and advanced analytics services.
  • Pythian's acquisition of Rittman Mead enhances its Oracle and AI capabilities.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Remote Work Options

Flexible Work Hours

Paid Vacation

Paid Sick Leave

Wellness Program

Professional Development Budget

401(k) Company Match

Growth & Insights and Company News

Headcount

6 month growth

1%

1 year growth

0%

2 year growth

2%
ChannelE2E
Apr 30th, 2025
Pythian Expands Oracle Cloud Services with Rittman Mead Acquisition

Pythian has expanded its Oracle consulting and managed services portfolio with the acquisition of Rittman Mead, a U.K.-based Oracle data and analytics company.

GlobeNewswire
Apr 30th, 2025
Pythian Positioned As Oracle Database@Google Cloud Leader With Acquisition Of Rittman Mead

OTTAWA, Ontario, April 30, 2025 (GLOBE NEWSWIRE) -- Pythian Services Inc. (“Pythian”), a leading global services company specializing in data, analytics, and AI solutions, today announced the acquisition of globally recognized Oracle data and analytics consultancy Rittman Mead. The acquisition significantly enhances Pythian’s Oracle footprint, expands its geographic market presence in the United Kingdom and Europe, and strengthens the company’s Oracle Database@Google Cloud capabilities—a powerful multicloud partnership that accelerates modernization for customers. The acquisition combines Pythian's expertise in data, analytics and AI with Rittman Mead's Oracle specializations. Pythian, a 2025 Google Cloud Partner of the Year for Databases, will add more Oracle ACEs to its roster of consultants. The synergy resulting from the acquisition will enable Pythian to offer an even wider range of advanced Oracle services to a broader and more global customer base

GlobeNewswire
Apr 15th, 2025
Pythian Doubles Down on AI-Readiness, Expanding AI Practice with Valuable Services and Expertise

Shishir Suresh will join Pythian as Senior Director, AI Services, along with Karen Pfeifer, Field CAIO.

ChannelE2E
Apr 10th, 2025
Pythian Launches Agentspace QuickStart to Accelerate Enterprise AI Adoption

Pythian has introduced its Agentspace QuickStart service, a new offering that helps enterprises rapidly implement Google's Agentspace platform.

The Mainstream
Apr 7th, 2025
Pythian Partners with GigaOm to Drive Ethical Generative AI Adoption

Attendees of Pythian's AI Workshop will acquire essential knowledge of AI technologies, discover how to pinpoint applicable use cases, and create a strategic roadmap for implementation utilizing the AI Maturity Framework.

INACTIVE