Senior Software Engineer – Stream Storage, Apache Flink
Posted 1hrs ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
Senior Software Engineer building stream-native storage systems for real-time analytics and streaming pipelines. Contributing to Apache Fluss infrastructure and internal production systems.
Responsibilities:
- About the RoleWe are building the next generation of stream-native storage systems centered around table semantics — not just logs. Our Stream Storage team develops high-performance distributed systems that power real-time analytics, streaming pipelines, and transactional workloads.
- As a Senior Software Engineer, you will contribute to core stream storage infrastructure, Apache Fluss (open-source), and internal production-grade systems built on Fluss.
- This role sits at the intersection of distributed systems, storage engines, streaming systems, and database internals.
- What You’ll DoCore Engineering
- Design and implement distributed storage components for streaming tables
- Work on table lifecycle: schema evolution, ingestion, compaction, retention, indexing
- Enhance the Fluss Lakehouse ecosystem support
- Contribute features and fixes to the Apache Fluss OSS project
- Participate in design discussions and Fluss Improvement Proposals (FIPs)
- Improve Fluss table engine performance and reliability
- Enhance integration with streaming engines (e.g., Flink)
- Engage with the open-source community via PR reviews and discussions
- Internal Platform Development
- Build internal tooling and services on top of Fluss
- Improve observability (metrics, logging, failure diagnostics)
- Optimize production deployments
- Contribute to benchmarking and testing frameworks
Requirements:
- What We’re Looking ForRequired Experience
- 5+ years building distributed systems or storage systems
- Strong experience in Java, Go, Rust, or similar systems languages
- Good understanding of:
- Consensus and replication (Raft, Paxos, etc.)
- Storage engines (LSM trees, B-Trees, WAL, compaction)
- Streaming systems (Kafka, Pulsar, Flink, etc.)
- Transactional systems and consistency models
- Experience debugging production distributed systems
- Strongly Preferred
- Experience with database internals or stream processing engines
- Familiarity with table formats (Iceberg, Hudi, Delta, etc.)
- Contributions to open-source projects (better if ASF)
- Experience with Flink or streaming SQL engines
Benefits:
- Why Join Us
- Work on table-first stream storage, not just message logs
- Direct impact on Apache Fluss open-source evolution
- Solve complex consistency and performance trade-offs
- Influence architectural decisions in a fast-moving space
- Collaborate with engineers passionate about distributed systems


















