The Job Auction Country

Share On

This listing has Ended
Go to My Listings

Site Reliability Engineer (SRE) - Advisor 

resume-library  |  United States  |  

United StatesUnited States (US)
Work Type:
Work Time:
Full Time


Site Reliability Engineer (SRE) - Advisor

Site Reliability Engineer (SRE) - Advisor

Position Description

CGI has an immediate need for a Site Reliability Engineer (SRE) - Advisor to join our financial services team in one of our selected CGI locations. Due to the current COVID-19 status, candidates will not be required to work within the physical work location at this time. When COVID-19 restrictions are lifted, they will be required to be located within the proximity of the assigned CGI location.

This is an exciting opportunity to work in a fast-paced team environment supporting one of the largest leaders in the secondary mortgage industry. We take an innovative approach to supporting our client, working side-by-side in an agile environment using emerging technologies.

" We partner with 15 of the top 20 banks globally, and our top 10 banking clients have worked with us for an average of 26 years!

" We have over 73,000+ CGI Members in 40 countries and over 5k+ loyal Clients who are leveraging our end-to-end services across the globe


As a valued colleague on our team, you will act as a team lead in the designing, producing, testing, or implementing software, technology, or processes, as well as lead processes for creating and maintaining IT architecture, large scale data stores, and cloud-based systems.

You will apply your expertise in software and systems engineering to ensure that both our internally critical and externally visible systems meet the appropriate performance needs of our users. You will serve as a champion of service availability, efficiency, automation, monitoring, and capacity management. Specifically, you will leverage your skills and experience in Amazon Web Services, software development with Java and/or Python, customization in Splunk and/or Dynatrace, and automation in Selenium and/or Blue Prism (among others) to enable increased feature velocity and continuous improvement.

Your future duties and responsibilities

The Service Reliability Engineering (SRE) Advisor role will offer you the flexibility to make each day your own, while working alongside people who care, so that you can deliver on the following responsibilities

" Independently determine the needs of the customer and create solution frameworks.

" Design and develop moderately complex software solutions to meet needs.

" Use a process-driven approach in designing and developing solutions.

" Implement new software technology and coordinate end-to-end tasks across the team.

" May maintain or oversee the maintenance of existing software.

Required qualifications to be successful in this role

" 6+ years of relevant experience

" Experience creating disaster recovery plans and executing failover tests

" Experience with capacity planning and performance testing / engineering tools, such as JMeter and / or LoadRunner

" Experience with Failure Mode Effect Analysis (FMEA) and Chaos testing / engineering tools, such as Gremlin, Chaos Monkey, Chaos Toolkit, AWS Fault Injection Service (FIS)

" Experience working with code repositories such as Bitbucket and / or GitHub

" Experience with programming in Java and / or Python

" Understanding of Java performance monitors (JVM, GC, Heap Size, Message Broker)

" Experience with building automation solutions using tools such as BluePrism and / or Selenium

" Understanding of fault tolerant / resilience architectural design patterns, such as Bulkhead, Circuit-breaker, Retry, Timeout, etc

" 2+ years of experience leading teams in applications development, infrastructure, or operations

" 4+ years of experience working in a Scaled Agile Framework (SAFe), Scrum, or Kanban environment using Jira and Confluence

" 3+ years of experience supporting AWS cloud applications and technologies, including containerization, virtualization, microservices, and server-less architecture

" 2+ years of experience with J2EE frameworks

" 2+ years of experience application monitoring / observability, including building dashboards, establishing service level indicators / objectives / agreements (SLIs / SLOs / SLAs), and logging / tracing using

" 2+ years of experience with CI/CD / DevOps deployment tools

" 2+ years of experience with application production / operations support, including incident response, problem management, runbooks, and knowledge articles

" 2+ years of experience with post-mortems, root-cause analysis (RCA), and / or AWS Correction-of-Errors (CoE)

" Understanding of error budgeting and toil reduction

" Excellent problem-solving skills, proactivity in resolving issues / blockers

" Excellent verbal / written communication, presentation, and relationship management skills, and ability to collaborate with multiple stakeholders

" Ability to work independently with minimal guidance

" Understanding of current industry trends, and a track record of spearheading innovation and leading solutions

" Strong ability to persuade and influence without authority

" Experience using AWS Elastic Container Service (ECS) and Fargate

" Experience using frameworks JavaScript, Spring Boot / Spring Cloud, and REST

" Experience using tools such as AWS CloudWatch, Splunk, Dynatrace, CatchPoint, and / or Datadog

" Excellent understanding and demonstrated experience in the use of DevOps / CICD tools like Jenkins, Terraform, UrbanCode Deploy (UCD), and / or GitLab

" Skilled in using ServiceNow, Moogsoft, StatusHub, and / or Blameless

" Understanding of IT Service Management (ITSM)


" Bachelor s Degree in Computer Science, Management Information Systems (MIS), Systems Engineering, or related field

" Certification in AWS Solutions Architect Associate or Developer Associate, Splunk Certification Developer, or Sun Certified Java Developer