Prompt Engineer (LLM Systems, Evals & Safety) at WEbook | Torre

Prompt Engineer (LLM Systems, Evals & Safety)

You'll shape AI accuracy and safety, transforming user experiences on Saudi's leading event platform.
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Full-time

Legal agreement: Employment

Provide your expected compensation while applying
location_on
Remote (for Jordan residents)
Match
skeleton-gauges
You have opted out of job matches in .
To undo this, go to the 'Skills and Interests' section of your preferences.
Review preferences
Shared by
Emma of Torre.ai
about 1 month ago

Requirements and responsibilities


Do you want to love what you do at work? Do you want to make a difference, an impact, and transform peoples lives? Do you want to work with a team that believes in disrupting the normal, boring, and average?If yes, then this is the job you are looking for , webook.com is Saudi’s #1 event ticketing and experience booking platform in terms of technology, features, agility, revenue serving some of the largest mega events in the Kingdom surpassing over 2 billion in sales.  Role Overview Design high-quality prompts, system instructions, and tooling that make our LLM features accurate, safe, and cost-effective. You’ll own evaluation, prompt versioning, and continuous improvement.Key Responsibilities:Author, refactor, and chain prompts (system/tool/policy) for varied tasks.Create offline/online evaluation harnesses (rubrics, golden sets, metrics).Build prompt libraries with versioning, A/B testing, and telemetry.Reduce hallucinations via verification, constrained decoding, and tool use.Implement safety: jailbreak/prompt-injection tests, content policy checks, PII handling.Partner with engineers to integrate prompts into production features.RequirementsDemonstrated prompt design across multiple task types and models.Experience building eval datasets and automated scoring (e.g., accuracy, faithfulness, utility, cost/latency).Familiarity with retrieval-augmented generation concepts and tool/function calling.Strong scripting (Python/TypeScript) for data prep, evals, and analysis.Clear writing; ability to translate business goals into measurable prompt specs.Nice-to-HavesExperience with LangChain/LLM orchestration, vector stores, and rerankers.Knowledge of safety tooling and red-teaming techniques.Experiment platforms (feature flags, A/B tests), analytics.
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.