Network Ops & Observability Architect
Posted 11ds ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
Network Operations and Observability Architect leading target-state monitoring strategy for large-scale consolidation program at a US-based Telco. Design observability architecture and minimize alert fatigue.
Responsibilities:
- Define and document the 'Greenfield' target-state observability architecture, including technical configuration templates, dashboard structures, and telemetry polling logic for the unified monitoring platform selected by the client
- Architect alert thresholds, suppression rules, and dependency mapping to minimize false positives and prevent alert fatigue within the NOC
- Design provisioning templates and API integrations to ensure that all sites exiting the migration factory are automatically onboarded into the target monitoring platform without manual intervention
- Collaborate with the ITSM/ITIL Architect during the design phase to establish automated alert-to-ticket routing logic, and partner with the Service Delivery Manager during rollout to ensure NOC runbooks align with the new telemetry model
- Contribute to architecture governance, including participation in architecture reviews, design approvals, and development of enterprise architecture standards, principles, and reference models. Collaborate with cross-domain architects (security, application, platform) on integrated design decisions.
Requirements:
- 10+ years of experience in Network Operations, Observability Architecture, or NMS Engineering within large-scale enterprise or service provider environments.
- Hands-on experience in delivering scalable monitoring templates and frameworks
- Strong expertise in modern unified observability platforms such as LogicMonitor, Datadog, SolarWinds, or Splunk, including network telemetry mapping.
- Advanced knowledge of alert correlation, SNMPv3, NetFlow, and API-based data extraction.
- Proven ability to design effective alert suppression logic that protects Tier 1 NOC teams from redundant or cascading alerts during complex network events.
Benefits:
- Health insurance
- Relocation program
- Work From Anywhere Culture
- Professional development opportunities
- Welcoming Multicultural Environment
- Social Sustainability Values












