Observability Expert /Site Reliability Engineer Dubai UAE

Position: Observability Expert /Site Reliability Engineer

Date posted: 2023-05-15

Industry: other

Employment type: Full Time

Experience: 3 to 4 year

Qualification: Bachelor’s Degree holder

Salary: AED 5000 to 10000

Location: Dubai, United Arab Emirates

Company: Confidential

Description:

We are hiring an Observability Expert /Site Reliability Engineer for the Dubai location.

Location: Dubai, UAE

1-Year Contract Extendable 

Experience- 3 – 4 Years

Notice period 30 days-45 days only (Not more than that)

The client is Looking for candidates who have hands-on experience working and troubleshooting the issues highlighted in blue is a must (hands-on expr) and is already an expert in Azure monitoring OCI observability for setting up, configuring, and Monitoring Infrastructure / Application / Business level products.

Technical Skillsets

  • Observability and AIOps applications
  • CloudWatch (must), Grafana(must) , Prometheus (must), Elastic , Logstash , Kibana ,Azure Monitor(must) , OCI Observability (good to have) , BMC Helix/True Sight Operation management – Patrol Agent (good to have)
  • Below basic skillsets to manage Observability and AIOps applications
  • OS – Windows , Linux , Solaris
  • Database – Oracle , SQL , Timeseries DB
  • Kubernetes and Docker
  • Cloud –, Azure , AWS , Oracle
  • ITSM – Remedy
  • Jira and confluence
  • Collaboration tools – Teams , Slack
  • Outlook , Word , PowerPoint , Project , Visio etc

Requirements and skills

  • Proven work experience as an Observability expert, Site Reliability Engineer.
  • Collaborate and communicate asynchronously.
  • Document all the things so you don’t need to learn the same thing twice.
  • Have an enthusiastic, go-for-it attitude.
  • Payment industry understanding will be an added advantage.

Responsibilities

  • Site Reliability Engineer responsibilities
  • Include monitoring computer systems.
  • building alerts for various operational issues that can be experienced in Network International
  • Availability Management
  • Ensure that the availability of Applications exceeds the defined service levels.
  • Maintain Enterprise Management solutions environment during business hours and off-hours as a responsibility.
  • Performance Management and Quality of service
  • Review performance on regularly and ensure proactive actions and communication to avoid disruption to services.
  • Lead new initiatives and process improvements.
  • Prepare, review, and send various KPI reports.

Leave a Reply

Your email address will not be published. Required fields are marked *