Applied AI Engineer
Posted 3ds ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
Senior Applied AI Engineer developing resilient AI systems. Focusing on production-grade stability and optimization of LLM-powered workflows for LATAM.
Responsibilities:
- Design, build, and maintain evaluation pipelines for production AI agent systems
- Instrument multi-agent workflows with tracing and observability tooling
- Build evaluation datasets using real production traffic and interaction logs
- Develop quality scoring and robustness scoring systems for LLM outputs
- Improve reliability of AI systems handling non-deterministic model behavior
- Implement and optimize HITL (Human-in-the-Loop) escalation workflows
- Analyze production failures and drive architectural improvements
- Own the full feedback loop between evaluations, prompt optimization, architecture updates, and re-testing
- Contribute to prompt engineering and model optimization strategies
- Collaborate on multi-agent orchestration and workflow reliability decisions
- Work across backend systems, deployment pipelines, monitoring, and operational sustainment
- Participate in production support and on-call responsibilities
- Maintain high engineering standards around scalability, observability, and maintainability
- Operate independently across development, testing, deployment, and production ownership
Requirements:
- 5+ years of backend or AI engineering experience in production environments
- Strong hands-on experience with production LLM or agentic AI systems
- Proven experience debugging and maintaining non-deterministic AI workflows under live traffic
- Experience building or operating evaluation/evals pipelines for AI systems
- Strong understanding of scorer design, feedback loops, and AI system evaluation methodologies
- Excellent Python backend engineering skills
- Production experience with: FastAPI Django Celery LangGraph or similar orchestration frameworks
- Experience with observability and tracing tools such as: Langfuse Grafana Loki OpenTelemetry or equivalent
- Experience deploying and operating distributed backend systems
- Strong understanding of AI reliability, prompt behavior, and model failure handling
- Ability to independently own projects end-to-end
- Experience working in asynchronous remote teams
- Strong written communication skills in English.
Benefits:
- Fully Remote
- LATAM-friendly collaboration preferred

















