Observability Expert /Site Reliability Engineer Dubai UAE
Position: Observability Expert /Site Reliability Engineer
Date posted: 2023-05-15
Industry: other
Employment type: Full Time
Experience: 3 to 4 year
Qualification: Bachelor’s Degree holder
Salary: AED 5000 to 10000
Location: Dubai, United Arab Emirates
Company: Confidential
Description:
We are hiring an Observability Expert /Site Reliability Engineer for the Dubai location.
Location: Dubai, UAE
1-Year Contract Extendable
Experience- 3 – 4 Years
Notice period 30 days-45 days only (Not more than that)
The client is Looking for candidates who have hands-on experience working and troubleshooting the issues highlighted in blue is a must (hands-on expr) and is already an expert in Azure monitoring OCI observability for setting up, configuring, and Monitoring Infrastructure / Application / Business level products.
Technical Skillsets
- Observability and AIOps applications
- CloudWatch (must), Grafana(must) , Prometheus (must), Elastic , Logstash , Kibana ,Azure Monitor(must) , OCI Observability (good to have) , BMC Helix/True Sight Operation management – Patrol Agent (good to have)
- Below basic skillsets to manage Observability and AIOps applications
- OS – Windows , Linux , Solaris
- Database – Oracle , SQL , Timeseries DB
- Kubernetes and Docker
- Cloud –, Azure , AWS , Oracle
- ITSM – Remedy
- Jira and confluence
- Collaboration tools – Teams , Slack
- Outlook , Word , PowerPoint , Project , Visio etc
Requirements and skills
- Proven work experience as an Observability expert, Site Reliability Engineer.
- Collaborate and communicate asynchronously.
- Document all the things so you don’t need to learn the same thing twice.
- Have an enthusiastic, go-for-it attitude.
- Payment industry understanding will be an added advantage.
Responsibilities
- Site Reliability Engineer responsibilities
- Include monitoring computer systems.
- building alerts for various operational issues that can be experienced in Network International
- Availability Management
- Ensure that the availability of Applications exceeds the defined service levels.
- Maintain Enterprise Management solutions environment during business hours and off-hours as a responsibility.
- Performance Management and Quality of service
- Review performance on regularly and ensure proactive actions and communication to avoid disruption to services.
- Lead new initiatives and process improvements.
- Prepare, review, and send various KPI reports.