Job Detail

Sr.Site Reliability Engineer - NowWiN Technologies C1 - D

Date Posted: Mar 31, 2022
Login to View Salary

Job Detail

  • Location:
    Chennai, Tamil Nadu, India
  • Company:
  • Type:
    Contract - Remote
  • Shift:
    First Shift (Day)
  • Career Level:
    Experienced Professional
  • Positions:
    1
  • Experience:
    7 - 12 years
  • Gender:
    No Preference
  • Degree:
    Certification
  • Apply Before:
    Apr 30, 2022

Job Description

Roles &
Responsibilities:

  • Accountable and responsible for the Monitoring infrastructure scalability, resilience and security ensuring
  • Monitoring/Automations services delivery
  • Provides technical leadership, guidance and training for the Monitoring/Automations team. Including managing
  • priorities, monitors and manages execution timelines, and quality of work
  • Automating work including infrastructure needs, testing, fail-over mitigation
  • Application and architecture performance and code profiling
  • Developing CI/CD processes to improve monitoring cadence
  • Working closely with internal partners and teams to ensure that we ship software that meets security, SLA, and
  • performance requirements
  • Debugging complex problems across an entire stack and creating solid solutions
  • Post incident-reviews to find out what’s working and what’s not and improving them by filling the gaps in the
  • process
  • Writing, updating, and user documentation, including runbooks/playbooks
  • Using Chaos Engineering to test what you build under real-world conditions
  • Running monthly Chaos Engineering “Game Days”
Candidate Skills:
  • 7 years of experience with software engineering, software development, or system operations
  • Expert experience with monitoring, dashboarding, and observability with Datadog, Prometheus, and Mobile
    Application monitoring (iOS and Android)
  • Expert experience with building incident management and integrations between Datadog, Jira Service
    Management, Ops Genie, and Statuspage
  • Experience designing, building, and operating large-scale production Software-as-a-Service platforms
  • In depth practical knowledge of infrastructure management, access, and monitoring systematic flows
  • Production experience with DevOps or site reliability engineering running web and/mobile applications
  • Excellent communication skills, both verbal and written
  • Advanced experience on Terraform
  • Hands-on expert experience with AWS cloud platform
  • Experience debugging complex problems, including application running on Kubernetes platform and EC2 instances
  • Knows their way around a Unix/Linux shell, can write shell scripts, and understands Linux internals
  • A solid understanding NodeJS and Java
  • Moderate understanding on how database works, writing queries to interact with databases, and troubleshooting
    complex data layers. Open-source databases (MySQL, Postgres, Redis, Cassandra, etc.)
  • A solid understanding of networking and core Internet protocols (e.g. TCP/IP, DNS, SMTP, HTTP, and distributed.










Benefits

Job is expired Drop you CV

Company Overview

Chennai, Tamil Nadu, India

C1 is ISO 27001-certified and HIPAA compliant. We enjoy repeating business from every one of our clients due to our uncompromising focus on complete client satisfaction. We achieve this through rigorously high standards for team expertise, emphasis o... Read More

Related Jobs

Google Map