Senior Reliability Engineer
Mastercard Payment Services Norway AS
- Frist Snarest
- Ansettelsesform Fast
Senior Reliability Engineer (Jenkins)
Job Description Summary
The Mastercard Payment Services Team is looking for a Senior Reliability Engineer who will be based in Oslo – Norway.
- Are you a born problem solver who loves to figure out how something works?
- Are you experienced in ITIL standards and “Service Event Management” practices?
- Do you have a low tolerance for manual work and look to automate everything you can?
“Business Operations” is leading the DevOps and operational transformation at Mastercard through our tooling and by being an advocate for change & standards throughout the development, quality, release, and product & engineering organizations. In parallel with our constant dedication on operational excellence, we are also highly focused on identifying any potential technical/operational risks, reporting them for awareness and following them up for mitigation.
We need team members with an appetite for change and pushing the boundaries of what can be done with automation. Experience in working across multiple departments to prioritize needs and to build relationships is a must.
You can click the link below for more information about BizOps:
https://youtu.be/wn_SqCEA5Ck
Responsibilities
• Involve in knowledge transfer sessions for new systems/platforms/applications.
• Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation and refinement.
• Align product and customer focused priorities with operational needs to protect the platform and customer experience.
• Proactively manage production events and involve in change activities to maximize customer experience and increase the overall value of supported applications.
• Practice sustainable and timely incident response (7/24/365) according to ITSM and Mastercard standards, create/update necessary incident and related problem records, engage with global and local teams to facilitate the incident resolution.
• Ensure necessary internal and external incident notifications are performed according to SLAs.
• Perform necessary incident follow-up tasks with relevant teams such as root cause analysis, preparation of incident documentation and perform blameless postmortems.
• Take a holistic approach to problem solving, by connecting the dots during a production event thru the various technology stack that makes up the platform, to optimize mean time to recover.
• Support daily operations with a hyper focus on triage and then root cause by understanding the business implications of our products. Shift left to be more proactive and upfront in the development process.
• Ensure any new products or product enhancements have the appropriate operational support structure to deliver promised business outcomes.
• Ensure any documented service commitments are monitored and appropriate mitigation steps taken to restore or maintain service commitments.
• Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
• Work with a global team spread across tech hubs in multiple geographies and time zones.
Required Skills:
- Scripting
- CI/CD Pipelines
- Monitoring tools
- Strong Jenkins experience
- Unix/Linux Operating commands
- Network logic understanding
- Networking concepts
- F5 understanding,
Qualifications
• BS degree in Computer Science or related technical field or equivalent practical experience.
• Extensive experience in ITIL standards, service event management activities, incident management and application development lifecycle.
• Proven track record in supporting production applications to facilitate change and incident activities.
• Understanding algorithms, data structures, scripting, pipeline management, and software design.
• Systematic problem-solving approach coupled with strong communication skills and a sense of ownership and drive.
• Experience in dealing with difficult situations and making decisions with a sense of urgency.
• Interest in understanding, analyzing and troubleshooting large-scale distributed systems.
• Experience in Site Reliability Engineering (SRE) practices and “Run” activities.
• Experience in customer support and delivery roles.
• Experience with financial oversight and process efficiencies.
Ferdigheter
- Algoritmer
- Behandling av IKT-feil
- CI/CD (Continuous Integration and Continuous Delivery)
- Data struktur
- IT Infrastructure Library (ITIL)
- Jenkins (Software)
- Nettverkskonsepter
- Programvaredesign
JobbMatch
BetaEr du kvalifisert for jobben?
Nysgjerrig på om du kvalifiserer til denne jobben? Med JobbMatch får du umiddelbar tilbakemelding på hvor godt din profil matcher stillingsutlysningen.
- Sektor: Privat
- Sted: 0978 Oslo
- Hjemmekontor: Delvis hjemmekontor, På kontoret
- Bransje: Bank, finans og forsikring, IT, IT - programvare
- Stillingsfunksjon: Drift/Operations
- Arbeidsspråk: Engelsk
Nøkkelord
Jenkins, DevOps, Site Reliability Engineer, CI/CD, Automation
Annonseinformasjon
- FINN-kode 450277428
- Sist endret