The following job is no longer available:
Site Reliability Engineer - Enterprise API Platform

Site Reliability Engineer - Enterprise API Platform

Posted 1 April by eFinancialCareers
Ended
At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join the Broadridge team.

Role Overview

APIs are critical and a foundational component to digital business implementation, application integration, multi-experience development, and ecosystem enablement, with a defined value to the business here at Broadridge. Broadridge is focused on having an established API program delivering measurable and monetized business outcomes while supporting a foundation of modern application architecture, application composition via APIs, multi-experience development, integration, and ecosystems. To support Broadridge's API enablement, this role will be a member of the Enterprise API Platform Team to implement and support the long-term strategy and business outcomes of APIs at Broadridge.

The role of the Enterprise API SRE is to implement solutions supporting the Future State Architecture across Broadridge's API platforms. This role will leverage and reinforce the reference API architecture, lifecycle, and mechanics adhering to defined standards of API management and delivery. The role will include the design, development, implementation, and support of standard API contracts applicable to various applications and programming languages to increase the sharing of functionality across the enterprise.

Broadridge currently has multiple applications that maintain services for internal and external consumption with a focus on increasing the use and consumption of these services while introducing additional revenue streams across the enterprise. Outcomes will be aligned and measured against the business requirements and revenue goals of Broadridge.

The Enterprise API Platform Team Site Reliability Engineers (SREs) are responsible for improving system reliability and resilience to make it faster and easier to develop and deploy new software capabilities. SREs focus especially on building automation to reduce manual effort and prevent operational incidents.

Key Responsibilities
  • Track performance against SLOs in partnership with monitoring teams or other stakeholders, and ensure systems continue to meet SLOs over time.
  • Work with product owners to define service level objectives (SLOs) for system operations.
  • Review modules for quality assurance and check compliance with application standards and SLOs.
  • Design, code, test, and deliver software to automate manual operational work.
  • Create dashboards and reports to communicate key metrics.
  • Create software to improve the performance, scalability, and stability of systems.
  • Drive continuous improvement in API software quality and infrastructure reliability and resilience.
  • Collaborate with development teams to promote the concept of reliability engineering during all phases of the software development lifecycle to detect and correct API performance issues and meet availability goals.
  • Perform analytics on previous incidents to understand root causes and better predict and prevent future issues.
  • Use automation to reduce the probability and/or impact of problem recurrence.
  • Identify, evaluate, and recommend monitoring tools and diagnostic techniques to improve API observability.
  • Create confidence and certainty in deployments with immutable infrastructure built and tested using CI/CD.
  • Understand integration Architecture concepts and patterns, including Microservices, Service Oriented Architecture, Batch Integration, RESTful JSON services, etc.
  • Integrate solutions with COTS applications, platforms, and/or external systems (SAP Concur preferred).
  • Participate in the technical project planning process with IT business analysts and other business partners.
  • Assist in the testing and deployment of new modules as well as applicable upgrades.
  • Partner with Product Management in identifying key technical risks and mitigation plans for the same.
  • Participate in operational support and 24x7 on-call rotation shifts for supported systems and products.
  • Participate in system design consulting, platform management, capacity planning, and launch reviews.
  • Collaborate and share lessons learned regarding API performance and reliability issues with all stakeholders including developers, other SREs, operations teams, and project management teams.
  • Participate in communities of practice to share knowledge and foster continuous improvement.
Profile Needed

Hard Skills:
  • 5+ years of SRE and/or DevOps experience
  • Proven AWS cloud infrastructure and Linux Server skills for large-scale distributed cloud applications
  • Network/Security troubleshooting
  • Strong knowledge of API Security (HTTP, TLS, PKI, etc...)
  • Experience with JavaScript, NodeJS, ExpressJS, Python, and Bash
  • Proven experience with monitoring, observability, and alerting tools (Splunk, DataDog, PagerDuty, etc...)
  • Strong experience with orchestration tools and SCM (Jenkins, Git, JFrog/Artifactory, etc...)
  • Strong knowledge of container orchestration and configuration management tools (Kubernetes, Terraform, etc...)
Soft-Skills:
  • Strong problem-solving and analytical skills.
  • Self-starter approach and ability to solution "outside of the box".
  • Strong interpersonal written and verbal communication skills.
  • Ability to work within an international team landscape but also independently.
  • Ability to learn quickly and multi-task in a fast-paced changing environment.
  • Interest in continuously learning new skills and technologies.
Qualifications
  • Certifications related to API development, DevOps, Cloud Infrastructure, and Security is a plus.
  • Bachelor's degree a plus.
  • Mulesoft Anypoint Platform experience is a plus.
Hybrid Flexible at Broadridge

We are made up of high-performing teams that meet in person to learn and collaborate as needed. This role is considered hybrid, which means you'll be assigned to a Broadridge office and given the flexibility to work remote.

Broadridge associates helped us envision our Connected Workplace - a work model that allows associates around the globe, dependent upon their role responsibilities, take advantage of the benefits of both on-site and off-site work to support our clients, one another, and the communities where we live and work. Our Connected Workplace is grounded in the concept of FACS: Flexible, Accountable, Connected, and Supported, which is our commitment to our associates. FACS supports our strong culture and a

Reference: 52375605

Please note Reed.co.uk does not communicate with candidates via Whatsapp, and we will never ask you to provide your bank, passport or driving licence details during the application process. To stay safe in your job search and flexible work, we recommend visiting JobsAware, a non-profit, joint industry and law enforcement organisation working to combat labour market abuse. Visit the JobsAware website for information and free expert advice for safer work.

Report this job