Dynatrace SME
The Data Resilience team are delivering an initiative that will maximise the capability of the Dynatrace platform and ensure full End-to-End Monitoring coverage exists across the Group's business-critical applications. We are seeking a skilled Dynatrace Admin/Consultant to play a key role in the enablement of observability across complex, hybrid cloud environments. The ideal candidate will have deep expertise in Dynatrace implementation (SaaS and On-Premises), monitoring configuration, and AI-driven insights to support performance, reliability, and business alignment.
You will:
* Collaborate with Application Stewards and Site Reliability Engineers (SREs) to confirm the list of critical assets in scope for monitoring verification and enhancement.
* Collaborate with EMAS to analyse Dynatrace coverage of critical assets.
* Work together with all parties to identify opportunities for enhancement to monitoring configuration and capabilities across critical applications.
* Participate in the review of roles and responsibilities between teams for observability and make recommendations for improvement of the standards with an emphasis on Operational Resilience.
* Play a key part in providing an automatically maintained end to end business flow for each important business process within the Dynatrace toolset.
* Collaborate with Application Stewards and Site Reliability Engineers (SREs) to ensure altering configuration is optimal and fit for purpose.
* Participate in workshops with third party software suppliers to review observability standards.
What You'll Need:
* The ability to demonstrate your extensive experience in designing and configuring the following within Dynatrace:
o Application performance monitoring
o Anomaly detection profiles
o Alerting rules and alert profiles
o Synthetic monitoring
o Log monitoring
o Real User Monitoring (RUM) to capture and analyse end-user experience across web and mobile applications.
o Utilisation of Dynatrace Query Language (DQL) and Grail for advanced data exploration and analytics.
o Integration of Dynatrace with external systems via APIs in complex environments.
* Ideally, you will have leveraged Davis AI to:
o Automatically detect anomalies and performance degradations.
o Correlate events across the full stack for root cause analysis.
o Provide predictive insights and proactive recommendations