Senior Site Reliablity Engineer/ Senior DevOps Engineer

Pearson Online & Blended Learning K-12

Job Description

Employ deep troubleshooting skills to improve the availability, performance, and stability of Services.


Implement automated tests, automated deployments, and operational tools


Collaborate with Product and Support teams to plan and deploy product releases


Ensure services are designed with availability and operational readiness and rigor


Implementation of proactive monitoring, alerting, trend analysis and self-healing systems


Define non-functional requirements as part of the product lifecycle to influence the new designs, standards, and methods for scalable, highly available distributed systems


Contribute to product development / engineering as needed to ensure Quality of Service of Highly Available services


Identifies, evaluates and executes preventive measures to minimize/avoid impact to the customers experience. Proactive v/s Customer escalated


Resolution of product/service defects or design changes, infrastructure changes, or operational changes


Partner with other SREsand lead by example - contributor more than a delegator


Set Strategic and Operational goals for the team, and work with the team to deliver on goals.


Work with Engineering leadership to build services that meet the requirements and need of the platform and application teams


Coding and Automation of Applications on Cloud Platform


Qualifications


Basic Skills and Qualifications


5 years of Systems/Applications design, development and support in 24x7 Production Services environments


BS in Computer Science, Computer Engineering, Math, or equivalent professional experience


Fluency with one or more current generation scripting language used by DevOps professionals (Python, Perl, PHP, Ruby) Java Development and/or .NET


Excellent troubleshooter, utilizing a systematic problem-solving approach


Demonstrated experience in designing, analyzing, and diagnosing large-scale distributed systems Windows Server and/or Linux systems internals (system libraries, file systems, client-server protocols)


Strong interpersonal and communication skills to work in a fast paced and rapidly changing dynamic environment


Strong skills in data structures, relational and NOSQL databases, distributed system architecture, and web architectures


The Exceptional candidate will demonstrate the following skills


Experience with elastically scalable, fault tolerance and other cloud architecture patterns


Experience operating on AWS (both PaaS and IaaS offerings)


Experience in both Windows (2k8R2 ) and Linux (centos) Security triage & forensic analysis


Experience with Continuous Integration and Continuous Delivery concepts, including Infrastructure as code.


Experience in Containerization concepts like Docker, and PaaS services on AWS.


NoSQL/Docker/Micro-services/Forensic-Analysis experience is a requirement


Proven strength in SaaS services, experience in massive scale web operations


FindTheBestJob is a free service and does not charge a fee at any stage of application or recruitment process. Don’t provide your bank account or credit card details to anyone during job application. FindTheBestJob does not guarantee the availability of a job since organizations may end applications earlier than due date.

Apply Now