The Monitoring & Tooling team provide a service of systems monitoring to TalkTalk IT Operations and as such are responsible for the availability and performance of the systems monitoring software (including Nagios, AppDynamics, Eggplant and others for alert aggregation/event correlation, log monitoring etc) as well as deploying and configuring the software across the TalkTalk estate based on the requirements of all teams across IT Ops and also the business.
As part of the L3 monitoring team, the IT Ops Monitoring & Tooling Specialist will help to define and govern strategy for infrastructure and application tooling. They will seek to improve system stability and to introduce efficiencies whilst providing technical guidance, expertise, and support to the wider team.
Working with a team of onshore and offshore Subject Matter Experts and engineers that are responsible for managing and configuring monitoring tools used for business-critical applications, databases, infrastructure, and services across the IT estate. This role is key to evolving working practice through end to end monitoring.
Engaging with project, design, development, service, and operational teams to define and deliver monitoring solutions that meet the requirements of projects and major changes in addition to driving improvements in the process’ and practices for such activities.
Working in a fast paced, technical environment, the IT Ops Monitoring & Tooling Specialist will require an understanding of a broad range of different technologies but is expected to have considerable experience in either infrastructure monitoring or application monitoring. They will engage with stakeholders at multiple levels and build appropriate and effective business relationships, communicating and presenting concepts and ideas effectively to the IT Enterprise Monitoring & Tooling Manager and others within TalkTalk Technology.
Accountable for the performance and availability of monitoring tools used for business-critical applications, databases, and infrastructure across the IT estate.
Works with the onshore monitoring team and offshore L1 & L2 teams, consisting of specialised Subject Matter Experts and engineers, to ensure the monitoring tools used for business-critical applications, databases, and infrastructure across the IT estate are available and performing well.
Drives the investigation and implementation of technologies to provide TalkTalk with component, transactional, volumetric, and synthetic end to end monitoring tools.
Engages with IT Ops teams to define the standards and best practice for implementation and usage of tools used across the business.
3rd line support and maintenance of the IT Ops Monitoring tools.
Is responsible for controlling access to tools, ensuring teams have the appropriate permissions they need to manage the monitoring of their services or to create dashboards, alerts, actions etc.
Engages with internal IT Ops teams, projects, design, and delivery to contribute to the technology roadmap and to ensure operational needs are understood.
Understands and assesses the implication of new technologies, products, and system changes, working with all interested parties to ensure acceptance and transfer into use.
Develops knowledge to stay in line with newly adopted technologies, for example Pega and Azure.
Carries out trend, capacity, and performance analysis to identify areas for improvement.
Seeks opportunities to drive efficiencies within TalkTalk and in the stability of supported applications.
Places security at the forefront of all thinking, working with the relevant technical teams on application security controls, providing understanding on latest industry developments and how these shapes their area of responsibility.
Always puts the customer first.
Absorbs complex information and communicates effectively, developing and maintaining relationships with stakeholders at multiple levels.
Creates documentation and training materials for the wider team.
Supports the change process through assessment and approval of requests for change and attendance as necessary to Change Approval Boards.
Provides regular updates to the IT Enterprise Monitoring & Tooling Manager.
Provides support to and cover for the IT Enterprise Monitoring & Tooling Manager as necessary.
Reviews Requests for Change and High-level/Low-level Design Documents
If this sounds like you then apply away - alternatively, you can catch me on email@example.com