Company Overview
US-based consulting, data science and technology services firm. We help our clients achieve competitive advantage through end-to-end digital transformation and work across life science & healthcare, financial services, telecom, and product engineering sectors.
Job Description
Job Title: Devops & Production support Lead
Location: Pune, India
Position Summary: Full time role
Experience:
● 6-8 years experience Devops engineer working in Cloud based production environment
Must have Technical Skills:
● Hands on experience with design and implementation of Continuous Delivery and DevOps solutions
● Orchestration tools like - docker swarm/ kubernetes/ mesos/ nomad
● Hands-on experience managing Linux systems is required
● Good understanding of SSL, ELK, tomcat
● Devops automation tools like : Ansible,puppet,jenkins,git,linux or python scripting, Terraform
● Public Cloud : AWS or azure would be plus point
● Database - NoSql MongoDB
● AWS Services experience in Terraform (Or Cloudformation), EC2, S3, Cloudfront CDN, ELB, SNS’
● Strong hands-on knowledge of networking protocols and technologies including: VPC, VPN, firewalls, REST, SOAP, TCP/IP, DNS, SMTP, HTTP, etc
Requirements
Required Experience:
● Experience working in Cloud based production environment
● Working experience in L2 support/ incident management
● Experience working with Tech stack with focus towards up-time, SLA , Security and Site Reliability Engineering
● Involved in client calls on weekly Support summary, SLA target with achieved and breached SLAs, corrective action taken for non-compliance, risk issues etc
● Developing and driving incident management processes, playbooks and stakeholder communication mechanisms
● Should have good knowledge of networking
● Strong debugging and diagnosis skills (e.g. coding, scripting, etc.)
● Proactive and Team Oriented approach
● Excellent Communication skills
● Knowledge on agile and scrum
● Git, Jira, Confluence, Jenkins
● Has prior experience of providing L2 support for products managed 24*7 with L1/L2, L3 Support Roster model
Additional Information
Responsibilities:
● Investigate and resolve customer incidents and cases and provide accurate guidance
● Deploying product updates, identifying production issues and implementing integrations that meet customer needs
● Provide Operational Support for applications by adhering knowledge on DevOps Standards [CI/CD process] hosted in Private Cloud and Docker container platform
● Troubleshooting application issues as well as platform issues running in Cloud Infrastructure , coordinate issues resolutions with operations, functional and technical teams
● Report out of support issues and threats to process to appropriate team to investigate further
● Reproduce client environment data, configuration and setup in order to investigate possible issues
● Perform root cause analysis for production errors
● Create runbooks, wikis for incidents, troubleshooting performed etc. Be a proactive member of your team by sharing knowledge
● Flexible to work on support shifts
● Serve as a mentor to less experienced software engineers
● Expected to work in 24*7 environment (Roster - rotational basis)