Observability Engineer

Posted 70ds ago

Employment Information

Education
Salary
Experience
Job Type

Report this job

Job expired or something wrong with this job?

Job Description

Observability Engineer designing and managing monitoring solutions at PCI Pharma Services. Leading migration efforts to improve incident detection and operational visibility across environments.

Responsibilities:

  • Lead the enterprise-wide migration from SolarWinds to Dynatrace, including architecture design, agent deployment, and dashboard development
  • Design and implement comprehensive monitoring coverage for servers, network devices, applications, databases, and cloud resources across 16 global sites
  • Develop custom dashboards, alerts, and automated remediation workflows aligned with operational KPIs
  • Establish baseline metrics and anomaly detection rules for proactive incident identification
  • Integrate observability platform with ServiceNow for automated incident creation and enrichment
  • Configure monitoring for IT/OT environments including manufacturing systems, SCADA, and industrial control systems
  • Implement synthetic monitoring for critical business applications and user experience tracking
  • Design log aggregation and correlation strategies for security event monitoring in coordination with SECURE team
  • Create runbooks and standard operating procedures for alert response and escalation
  • Provide 24x7 monitoring strategy and coordinate with global follow-the-sun operations team
  • Integrate backup monitoring via Veeam reporting and alerting for RPO/RTO compliance visibility
  • Optimize monitoring costs through efficient data retention policies and license management
  • Train operations staff on platform usage, dashboard interpretation, and alert response procedures

Requirements:

  • Bachelor's degree in Computer Science, Information Technology, or related field
  • 5+ years of experience in infrastructure monitoring and observability
  • Hands-on experience with Dynatrace including OneAgent deployment, Davis AI, and dashboard development
  • Strong experience with SolarWinds (NPM, SAM, VMAN) for migration planning
  • Proficiency in monitoring network infrastructure (Cisco switches, routers, firewalls)
  • Experience monitoring VMware vSphere environments
  • Knowledge of cloud monitoring for Azure and AWS workloads
  • Strong scripting skills (PowerShell, Python, Bash) for automation
  • Understanding of SNMP, WMI, API-based monitoring approaches
  • Experience with log management and SIEM integration

Benefits:

  • Equal Opportunity/Affirmative Action Employer
  • Quality and operational excellence
  • Industry-leading customer experience
  • Fair and competitive rewards program
  • Professional development opportunities