Senior Software Engineer – Stream Storage, Apache Flink

Posted 1hrs ago

Employment Information

Education
Salary
Experience
Job Type

Report this job

Job expired or something wrong with this job?

Job Description

Senior Software Engineer building stream-native storage systems for real-time analytics and streaming pipelines. Contributing to Apache Fluss infrastructure and internal production systems.

Responsibilities:

  • About the RoleWe are building the next generation of stream-native storage systems centered around table semantics — not just logs. Our Stream Storage team develops high-performance distributed systems that power real-time analytics, streaming pipelines, and transactional workloads.
  • As a Senior Software Engineer, you will contribute to core stream storage infrastructure, Apache Fluss (open-source), and internal production-grade systems built on Fluss.
  • This role sits at the intersection of distributed systems, storage engines, streaming systems, and database internals.
  • What You’ll DoCore Engineering
  • Design and implement distributed storage components for streaming tables
  • Work on table lifecycle: schema evolution, ingestion, compaction, retention, indexing
  • Enhance the Fluss Lakehouse ecosystem support
  • Contribute features and fixes to the Apache Fluss OSS project
  • Participate in design discussions and Fluss Improvement Proposals (FIPs)
  • Improve Fluss table engine performance and reliability
  • Enhance integration with streaming engines (e.g., Flink)
  • Engage with the open-source community via PR reviews and discussions
  • Internal Platform Development
  • Build internal tooling and services on top of Fluss
  • Improve observability (metrics, logging, failure diagnostics)
  • Optimize production deployments
  • Contribute to benchmarking and testing frameworks

Requirements:

  • What We’re Looking ForRequired Experience
  • 5+ years building distributed systems or storage systems
  • Strong experience in Java, Go, Rust, or similar systems languages
  • Good understanding of:
  • Consensus and replication (Raft, Paxos, etc.)
  • Storage engines (LSM trees, B-Trees, WAL, compaction)
  • Streaming systems (Kafka, Pulsar, Flink, etc.)
  • Transactional systems and consistency models
  • Experience debugging production distributed systems
  • Strongly Preferred
  • Experience with database internals or stream processing engines
  • Familiarity with table formats (Iceberg, Hudi, Delta, etc.)
  • Contributions to open-source projects (better if ASF)
  • Experience with Flink or streaming SQL engines

Benefits:

  • Why Join Us
  • Work on table-first stream storage, not just message logs
  • Direct impact on Apache Fluss open-source evolution
  • Solve complex consistency and performance trade-offs
  • Influence architectural decisions in a fast-moving space
  • Collaborate with engineers passionate about distributed systems