Principal AI/ML Researcher – Reasoning, Planning, and Decision-making Systems
Posted 8hrs ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
Principal AI/ML Researcher developing cognitive AI systems for practical decision-making at Airbnb. Leading research and innovation in reasoning and planning frameworks.
Responsibilities:
- Drive foundational and applied research in reasoning engines, planning architectures, and decision-making frameworks at scale.
- Advance techniques in LLM/LRM post-training, reinforcement learning–based decisioning, and knowledge-integrated agents.
- Design methods for plan induction, value estimation, and contingency modeling within intelligent agents.
- Explore and validate protocols for distributed reasoning and joint planning among cooperative agents in multi-agent systems.
- Architect RPD systems that integrate post-trained LLMs/LRMs, graph-structured memory (e.g., KGs), and RL-driven controllers.
- Design recursive task planners, search-based or policy-based reasoners, and belief-state trackers that can interoperate with large model substrates.
- Build and evolve stateful, dynamic models that combine supervised learning with online/offline reinforcement, simulation-based rollouts, and symbol grounding.
- Set direction for planning/reasoning infrastructure within the AI/ML platform strategy.
Requirements:
- Masters or equivalent in Computer Science, AI, Cognitive Science, or related fields.
- Recent published work or patents in AI, Cognitive Science, or related fields.
- 15+ years in AI/ML, including post-training architectures and production-scale reasoning systems.
- Advanced coding proficiency in Java, Python, C++, or similar, with experience in ML/RL frameworks (e.g., PyTorch, Ray, JAX, RLlib) at scale.
- Proven experience integrating LLMs/LRMs with Knowledge Graphs or structured world models.
- Deep understanding of Reinforcement Learning and its application to decisioning and planning.
- Fluency in hybrid model architectures: connectionist-symbolic fusion, retrieval-based agents, or goal-directed transformers.
- Experience working on multi-agent coordination, distributed RL, or cooperative inference systems.
Benefits:
- Bonus
- Equity
- Employee Travel Credits











