
Posted 13 hours ago
Site Reliability Engineer (SRE)
ScotiabankSite Reliability Engineer (SRE)
Requirements
3+ years real time streaming data operations, 7+ years production support and troubleshooting, 2+ years Apache Kafka, 3+ years Splunk or Dynatrace, 5+ years CI/CD pipelines, Java debugging skills, RESTful Services understanding, SQL proficiency, Cloud microservices knowledge (GCP/Azure), UNIX shell and Python scripting, Post-secondary degree in CS, Engineering, or Math
Skills
KafkaSplunkPythonGCPAzureJenkins
About the role
Responsibilities
- Implement, measure, and gather insights from Operational Level Indicators to improve service availability, performance, and resilience.
- Automate repetitive tasks to reduce toil and implement jobs according to established Runbooks.
- Lead and perform Disaster Recovery (DR) exercises and engage teams for technical validation.
- Participate in technical vulnerability assessments and provide recommendations for remediation and system enhancements.
- Manage communication regarding production releases and their impact on service availability for internal and external stakeholders.
- Act as a Subject Matter Expert (SME) on performance, scalability, reliability, monitoring, and security following SRE best practices.
Requirements
- 7+ years of experience in production support and in-depth troubleshooting of major incidents.
- 3+ years of experience in real-time streaming data operations.
- 2+ years of experience using Apache Kafka for event management.
- 3+ years of experience with monitoring and alerting tools such as Splunk or Dynatrace.
- 5+ years of experience with software build and CI/CD deployment pipelines (e.g., Jenkins, Gradle, Maven, or Bitbucket).
- Proficiency in UNIX shell scripting and Python.
- Ability to read Java code for troubleshooting and debugging purposes.
- Strong understanding of RESTful Services and SQL proficiency with relational databases.
- Knowledge of cloud microservices (GCP and/or Azure).
- Post-secondary degree in Computer Science, Engineering, or Mathematics.
Preferred Qualifications
- Completion of a Confluent Certified Administrator for Apache Kafka certification.
Benefits
- Competitive rewards program including bonus, flexible vacation, personal, and sick days.
- Comprehensive benefits starting on day one.
- Upskilling opportunities through online courses, cross-functional development, and tuition assistance.
- Inclusive culture with access to various Employee Resource Groups (ERGs).
- Dynamic workspace featuring collaboration spaces and free tea and coffee.
About the Company
Scotiabank is a leading bank in the Americas, dedicated to helping customers, families, and communities achieve success through a broad range of financial products and services, including personal and commercial banking, wealth management, and capital markets.
ScoutJobs Agent
Get matches like this delivered daily
Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.
Get started — it's freeSite Reliability Engineer (SRE)
Scotiabank · Toronto
