• NecoJobs Logo - Nepal's Leading Job Portal

Site Reliability Engineer (SRE)

UBA Solution

  • UBA Solution Logo
  • Share
  • AvailabiltyFull Time
    CategoryIT Jobs
    Salary NegotiableYes
    Job LevelSenior Level
    Job LocationImadol, lalitpur
    No. Of Vacancy1
    Education LevelBachelor
    Experience RequiredMore Then 6

    Skills :-

  • Incident management and production support expertise
  • Linux & systems knowledge
  • Cloud platforms
  • Containers & Kubernetes
  • Monitoring, logging & alerting
  • Reliability concepts
  • Automation & CI/CD
  • Requirements :-

    ● 6+  years of working experience in monitoring, scripting, system engineering and troubleshooting.

    ● Solid grasp of Windows & Linux systems including networking concepts.

    ● Experience with monitoring tools such as CloudWatch, Site24x7, VictorOps, Sentry, ELK Stack, Prometheus, Grafana, New Relic, Pingdom, TICK stack , etc.

    ● Experience with Amazon AWS (EC2, S3, CloudFront, Route53, RDS, autoscaling, etc) and other cloud platforms

    ● Knowledge in Agile development environment

    ● Strong Presentation and communication skills (English required)

    ● Strong collaboration skills[RK1] [RG2]

    ● Excellent troubleshooting and analytical skills

    ● Strong sense of urgency and ownership over critical problem areas.

    ● Experience with managing 24x7 rotational team

    ● Excellent time management and organizational skills with an aptitude towards creative problem solving

    Job Responsibility :-

    • Lead and manage the 24x7x365 rotational SRE team.
    • Coordinate daily operations, reporting, and performance reviews with the SRE Manager.
    • Act as the primary technical escalation point for critical production issues.
    • Ensure system uptime, reliability, and operational excellence.
    • Support execution of SRE strategy and organizational goals.
    • Mentor and guide SRE team members on technical and operational challenges.
    • Collaborate with production engineering and cross-functional teams.
    • Drive incident management, root cause analysis, and postmortems.
    • Improve monitoring, alerting, and automation practices.

    Who are looking for :-

    Looking for a Site Reliability Engineer (SRE) skilled in Linux, cloud, automation, containers, CI/CD, and monitoring, who can ensure reliable, scalable systems and handle incidents effectively.

    • Share

    Similar Jobs

    Web Application Developer

    IT Jobs

    Full Time

    Director of Engineering

    IT Jobs

    Full Time

    Principal Engineer

    IT Jobs

    Full Time

    Site Reliability Engineer

    IT Jobs

    Full Time

    • Upload Your CV

    Example: 98********* / 97**********