Staff/Principal Platform Engineer – Infrastructure

Posted 1hrs ago

Employment Information

Education
Salary
Experience
Job Type

Report this job

Job expired or something wrong with this job?

Job Description

Staff/Principal Platform Engineer managing infrastructure at P2P.org. Focused on building scalable internal developer platform and mentoring engineers.

Responsibilities:

  • Own the architecture and evolution of P2P.org's internal developer platform—Kubernetes, monitoring, secrets management, and delivery infrastructure. You are responsible for how these systems scale, interconnect, and evolve across 60+ blockchain networks.
  • Design and build scalable, fault-tolerant platform components—including capacity planning, multi-tenancy, networking topology, and storage architecture. Make and document architectural trade-offs clearly.
  • Write production code in Go and/or Python — platform tooling, services, operators, and APIs. This is a hands-on engineering role, not just configuration.
  • Build self-service infrastructure that teams consume with minimal friction — Terraform modules, Helm charts, reusable pipeline templates, and platform APIs. Champion our versioned module library and automated security propagation via Renovate.
  • Scale our Kubernetes platform across bare-metal and cloud, and evolve the observability stack (Victoria Metrics, Prometheus, Grafana, Loki) and delivery platform (ArgoCD, CI/CD pipelines).
  • Own the security posture of the platform — secrets management, access controls, identity federation, network policies, and encryption at rest and in transit.
  • Maintain SOC 2 Type II compliance across platform infrastructure — implement controls as code, maintain audit readiness, and work with compliance teams to close gaps proactively.
  • Enforce container and supply chain security — image scanning, signed artifacts, base image management, and vulnerability remediation. Apply zero-trust principles: least-privilege roles, short-lived credentials, auditable access. Harden Kubernetes workloads through pod security standards, network policies, RBAC, and runtime monitoring.
  • Use AI tooling as a core part of how you work and champion its adoption across the infrastructure team and wider engineering organization—in code generation, code review, incident response, documentation, and operational automation.
  • Build or integrate AI-assisted tooling into platform workflows where it delivers genuine improvement. Help define how we measure the impact of AI adoption across engineering.
  • Lead the technical direction for the infrastructure team—set architectural standards, drive best practices, and reduce technical debt.
  • Collaborate cross-functionally with SRE, blockchain, engineering, data, and security teams. Translate their needs into platform capabilities.
  • Mentor and grow engineers through code reviews, architectural guidance, design reviews, and knowledge sharing.
  • Contribute to incident response and post-incident reviews, driving platform reliability improvements.

Requirements:

  • 8+ years of experience in infrastructure, SRE, or platform engineering, with at least 3 years in a senior or staff-level role owning system architecture and technical direction.
  • Strong software engineering skills in Go and/or Python — you write production-quality code, not just scripts. You design, build, test, and ship platform services and tooling.
  • Proven ability to architect large-scale distributed systems—capacity planning, fault-tolerance patterns, system decomposition, and the ability to articulate trade-offs clearly.
  • Deep Kubernetes expertise — managed (GKE, EKS) and bare-metal/hybrid clusters, including networking, storage, multi-tenancy, and cluster lifecycle management.
  • Strong Terraform experience — reusable module libraries, state management at scale, and infrastructure lifecycle automation.
  • Experience with at least one major cloud provider (GCP, OCI, AWS, or Azure) plus bare-metal infrastructure.
  • Hands-on experience with observability (Prometheus, VictoriaMetrics, Grafana, Loki, OpenTelemetry, or equivalent) and delivery tooling (ArgoCD, Helm, CI/CD pipelines).
  • Strong Linux systems and networking fundamentals (DNS, load balancing, firewalls, VPN, BGP).
  • Infrastructure security experience — container security, supply chain security, network policies, zero-trust access patterns, and compliance (SOC 2 Type II or similar).
  • Active use of AI tooling in your engineering workflow, with experience driving adoption across a team.
  • A track record of building self-service platform capabilities that other teams actually adopt.
  • Strong written and verbal communication — technical proposals, architectural decisions, and cross-team collaboration across distributed teams. English B2 minimum.

Benefits:

  • Competitive salary level in $ (we can also pay in crypto)
  • Paid vacation and sick leave
  • Well-being program
  • Mental Health care program
  • Compensation for education, including foreign language & professional growth courses
  • Equipment & co-working reimbursement program
  • Overseas conferences, community immersion