Staff/Principal Platform Engineer – Infrastructure
Posted 1hrs ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
Staff/Principal Platform Engineer managing infrastructure at P2P.org. Focused on building scalable internal developer platform and mentoring engineers.
Responsibilities:
- Own the architecture and evolution of P2P.org's internal developer platform—Kubernetes, monitoring, secrets management, and delivery infrastructure. You are responsible for how these systems scale, interconnect, and evolve across 60+ blockchain networks.
- Design and build scalable, fault-tolerant platform components—including capacity planning, multi-tenancy, networking topology, and storage architecture. Make and document architectural trade-offs clearly.
- Write production code in Go and/or Python — platform tooling, services, operators, and APIs. This is a hands-on engineering role, not just configuration.
- Build self-service infrastructure that teams consume with minimal friction — Terraform modules, Helm charts, reusable pipeline templates, and platform APIs. Champion our versioned module library and automated security propagation via Renovate.
- Scale our Kubernetes platform across bare-metal and cloud, and evolve the observability stack (Victoria Metrics, Prometheus, Grafana, Loki) and delivery platform (ArgoCD, CI/CD pipelines).
- Own the security posture of the platform — secrets management, access controls, identity federation, network policies, and encryption at rest and in transit.
- Maintain SOC 2 Type II compliance across platform infrastructure — implement controls as code, maintain audit readiness, and work with compliance teams to close gaps proactively.
- Enforce container and supply chain security — image scanning, signed artifacts, base image management, and vulnerability remediation. Apply zero-trust principles: least-privilege roles, short-lived credentials, auditable access. Harden Kubernetes workloads through pod security standards, network policies, RBAC, and runtime monitoring.
- Use AI tooling as a core part of how you work and champion its adoption across the infrastructure team and wider engineering organization—in code generation, code review, incident response, documentation, and operational automation.
- Build or integrate AI-assisted tooling into platform workflows where it delivers genuine improvement. Help define how we measure the impact of AI adoption across engineering.
- Lead the technical direction for the infrastructure team—set architectural standards, drive best practices, and reduce technical debt.
- Collaborate cross-functionally with SRE, blockchain, engineering, data, and security teams. Translate their needs into platform capabilities.
- Mentor and grow engineers through code reviews, architectural guidance, design reviews, and knowledge sharing.
- Contribute to incident response and post-incident reviews, driving platform reliability improvements.
Requirements:
- 8+ years of experience in infrastructure, SRE, or platform engineering, with at least 3 years in a senior or staff-level role owning system architecture and technical direction.
- Strong software engineering skills in Go and/or Python — you write production-quality code, not just scripts. You design, build, test, and ship platform services and tooling.
- Proven ability to architect large-scale distributed systems—capacity planning, fault-tolerance patterns, system decomposition, and the ability to articulate trade-offs clearly.
- Deep Kubernetes expertise — managed (GKE, EKS) and bare-metal/hybrid clusters, including networking, storage, multi-tenancy, and cluster lifecycle management.
- Strong Terraform experience — reusable module libraries, state management at scale, and infrastructure lifecycle automation.
- Experience with at least one major cloud provider (GCP, OCI, AWS, or Azure) plus bare-metal infrastructure.
- Hands-on experience with observability (Prometheus, VictoriaMetrics, Grafana, Loki, OpenTelemetry, or equivalent) and delivery tooling (ArgoCD, Helm, CI/CD pipelines).
- Strong Linux systems and networking fundamentals (DNS, load balancing, firewalls, VPN, BGP).
- Infrastructure security experience — container security, supply chain security, network policies, zero-trust access patterns, and compliance (SOC 2 Type II or similar).
- Active use of AI tooling in your engineering workflow, with experience driving adoption across a team.
- A track record of building self-service platform capabilities that other teams actually adopt.
- Strong written and verbal communication — technical proposals, architectural decisions, and cross-team collaboration across distributed teams. English B2 minimum.
Benefits:
- Competitive salary level in $ (we can also pay in crypto)
- Paid vacation and sick leave
- Well-being program
- Mental Health care program
- Compensation for education, including foreign language & professional growth courses
- Equipment & co-working reimbursement program
- Overseas conferences, community immersion

















