Prompt Engineer – LLM Systems, Evals & Safety
Posted 76ds ago
Employment Information
Report this job
Job expired or something wrong with this job?
Job Description
Designing high-quality prompts and improving LLM features for webook.com, Saudi’s #1 event ticketing platform. Partnering with engineers to integrate safe and effective prompts into production.
Responsibilities:
- Design high-quality prompts, system instructions, and tooling that make our LLM features accurate, safe, and cost-effective.
- Own evaluation, prompt versioning, and continuous improvement.
- Author, refactor, and chain prompts (system/tool/policy) for varied tasks.
- Create offline/online evaluation harnesses (rubrics, golden sets, metrics).
- Build prompt libraries with versioning, A/B testing, and telemetry.
- Reduce hallucinations via verification, constrained decoding, and tool use.
- Implement safety: jailbreak/prompt-injection tests, content policy checks, PII handling.
- Partner with engineers to integrate prompts into production features.
Requirements:
- Demonstrated prompt design across multiple task types and models.
- Experience building eval datasets and automated scoring (e.g., accuracy, faithfulness, utility, cost/latency).
- Familiarity with retrieval-augmented generation concepts and tool/function calling.
- Strong scripting (Python/TypeScript) for data prep, evals, and analysis.
- Clear writing; ability to translate business goals into measurable prompt specs.
- Nice-to-Haves
- Experience with LangChain/LLM orchestration, vector stores, and rerankers.
- Knowledge of safety tooling and red-teaming techniques.
- Experiment platforms (feature flags, A/B tests), analytics.



















