1272248 – DEVOPS-SITE-RELIABILITY-ENGINEERING

Pune & Chennai,
Minimum Ex. 5 year - Maximum Ex. 8 years

Job Description

Location – Pune/Chennai

Grade – P3

Sr. SRE

Mandatory?

Expected Proficiency [1-5 scale]; 5 highest

Infrastructure Monitoring – Enterprise solutions(Dynatrace/DATAdog/New Relic)

Yes

Kubernetes & Docker

Yes

Cloud(AWS EKS/AWS VPC/AWS EC2)

Yes

SIEM – Splunk

Yes

Configuration & Release Management

Yes

Infra as Code(Ansible/Terraform)

Yes

Chaos Engineering Tools

Good to have

Unix/Linux Scripting

Good to have

Gitlab

Good to have

Communication

Yes



Experience

6+ years

We are looking to speak with candidates who are:

Highly experienced SRE engineers who have excellent working knowledge of Cloud. You will be expected to interact with clients on a daily basis to discuss and share plans, outcomes etc. You should have a very strong technical flair and passion to learn new technologies, find innovative solutions to problems, and have the ability to keep abreast with emerging technologies. Work independently & collaborate with customer teams on daily basis to understand requirements, plan & implement tasks, and provide regular status updates

Technical / process skills:

Must have:

Has experience of designing, building, and/or operating large-scale production systems

Storing understanding of system design principles

Strong knowledge of cloud native ecosystems like containerization, packaging services & Orchestration such as Docker, Kuberenetes, helm etc.

Strong experience in unix/linux, networking along with scripting with shell, bash

Understands networking and messaging, especially between services

Experience with build automation tool like maven, gradle

Has experience automating infrastructure, testing, and deployments using Infrastructure as Code tools like Ansible, Terraform etc

Experience building self healing systems with the help of strong SRE practices

Developed dashboards & automated alerts to measure uptime, monitor services and remediate issues

Experienced in integrating observability, monitoring and logging tools with services across a mix of open-source tools such as Prometheus, Grafana, ELK, Jaeger etc., and/or commercial platforms such as New Dynatrace etc.

Strong knowledge in setting up Cloud(AWS prefered) infrastructure & services using best practices

Experience implementing scalable, resilient, and secure infrastructure considering industry best practices and following the processes defined

Familiarity with Security best practices & experience managing secure systems

Good to have:

CKA Certification Good to have

Familiarity with chaos engineering tools

Hands-on experience in setting up & managing production-level Kubernetes clusters

Knowledge of service mesh architecture eg: ISTIO

Experience working with Kafka Clusters

Soft skills:

Excellent Communication Skills

Ability to gather business requirements and interface with customer teams as required

Excellent analytical and problem-solving skills

Willingness to explore and learn new technology, POC

Team player

Job Overview

Industry : -
Country : -
Vacancy : 1
Job nature : -
City : Pune & Chennai
State : -

1272248 – DEVOPS-SITE-RELIABILITY-ENGINEERING

Job Description

Job Overview

About Us

Contact Us

Expertise

Jobs

Employers

Life@AddRec

Contact Us

Let’s Go Social

Office Hours

1272248 – DEVOPS-SITE-RELIABILITY-ENGINEERING

Job Description

Job Overview

Apply For a Job

About Us

Contact Us

Expertise

Jobs

Employers

Life@AddRec

Contact Us

Let’s Go Social

Office Hours