Systems Monitoring Engineer
Posted 4ds ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
Systems & Monitoring Engineer architecting monitoring stack for Hedera’s DLT ecosystem. Blending NOC practices with Web3 challenges to ensure uptime and compliance.
Responsibilities:
- Design, deploy, and maintain monitoring solutions (Prometheus, Grafana) for DLT-specific metrics (consensus finality, node health, on-chain activity)
- Integrate and manage PagerDuty for rapid, automated incident response
- Deploy and manage Mirror Nodes and RPC relays using Terraform/Ansible across AWS/GCP
- Serve as the L3 escalation point for complex incidents (“ghost transactions,” API anomalies)
Requirements:
- 4+ years in DevOps, SRE, or NOC roles (with 1–2 years in Web3/Blockchain environments)
- Deep expertise in Prometheus/Grafana, Linux, Docker/Kubernetes, and scripting (Python, Go, Bash)
- Proven experience with cloud platforms (AWS/GCP) and IaC tools (Terraform)
- Strong understanding of Hedera Hashgraph or EVM-based chains, and ability to interpret ledger APIs
- Familiarity with ITIL/ITSM, DORA, SOC2, or ISO 27001 frameworks
Benefits:
- Opportunity to be a part of the world’s leading DLT ecosystem
- Significant career growth potential in a fast growing sector
- Working with colleagues and on projects across the globe
- Open and direct communication, flat structures
- Flexible working hours
- Competitive salary package



















