Lead AI Platform Engineer - Remote Portugal at HumanIT Digital Consulting | Torre

Lead AI Platform Engineer - Remote Portugal

You'll shape enterprise AI infrastructure and autonomous agent workflows, driving technical excellence.
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Full-time

Legal agreement: Employment

Provide your expected compensation while applying
location_on
Remote (for Portugal residents)
Match
skeleton-gauges
You have opted out of job matches in .
To undo this, go to the 'Skills and Interests' section of your preferences.
Review preferences
Shared by
Emma of Torre.ai
22 days ago

Requirements and responsibilities


ABOUT THE OPPORTUNITYWe are partnering with an innovative international technology company focused on building scalable AI-native platforms that support high-performance digital products used worldwide. This is an opportunity for a senior engineering leader who wants to shape the future of enterprise AI infrastructure, autonomous agent workflows, and cloud-native distributed systems.As a Lead AI Platform Engineer, you will work closely with executive leadership and engineering teams to define and implement the architecture behind advanced AI solutions. The role combines hands-on technical leadership with platform strategy, reliability engineering, and internal developer experience improvements. You will play a critical role in enabling secure, scalable, and production-ready AI ecosystems while mentoring engineering teams and driving technical excellence across the organization.This is a fully remote position based in Portugal, with occasional national and international travel requirements estimated at 0%–15%.PROJECT & CONTEXTThe project focuses on designing and evolving a modern AI platform ecosystem powered by AWS cloud technologies and agent-based architectures. The engineering environment is highly collaborative, fast-paced, and centered around cloud automation, observability, and AI workload orchestration.You will lead initiatives involving AWS Bedrock AgentCore, AWS Step Functions, multi-tenant AI infrastructure, vector databases, and automated CI/CD pipelines for AI workloads. The platform supports intelligent agent execution, evaluation pipelines, retrieval-augmented generation (RAG), and internal developer platforms (IDP).The stack includes AWS Networking and IAM, Terraform v1.x, GitHub Actions, Kubernetes, Docker, Python 3.x, Bash scripting, JavaScript/TypeScript, OpenSearch, Pinecone, Milvus, Datadog, CloudWatch, LangSmith, Vault, Artifactory, Backstage, and policy enforcement frameworks such as OPA and Cedar.English is required for daily communication with international stakeholders and distributed engineering teams.WHAT WE'RE LOOKING FOR (Required)Strong experience designing and operating cloud-native platforms on AWSProven expertise with AWS Bedrock AgentCore and AWS Step Functions in production environmentsExperience building custom Agent/Tool Gateways and AI orchestration workflowsAdvanced knowledge of Infrastructure as Code using Terraform v1.xHands-on experience with CI/CD automation using GitHub ActionsStrong containerization and orchestration experience with Docker and KubernetesExperience building scalable microservices architecturesSolid understanding of AI observability, monitoring, and tracing tools such as Datadog, CloudWatch, or LangSmithExperience with vector databases including OpenSearch, Pinecone, or MilvusStrong understanding of RAG architectures and AI knowledge retrieval strategiesExperience implementing secure multi-tenant environments and IAM policiesFamiliarity with policy and governance frameworks such as OPA or CedarStrong scripting and automation skills using Python 3.x, Bash, or JavaScript/TypeScriptExperience collaborating with senior stakeholders and translating technical concepts into business valueExcellent communication skills in English (written and spoken)NICE TO HAVE (Preferred)Experience with Internal Developer Platforms (IDP) and developer enablement initiativesFamiliarity with Backstage for platform engineering and developer experience improvementsExperience implementing AI model evaluation frameworks (Evals)Knowledge of non-deterministic AI agent behavior analysis and reliability engineering practicesPrevious experience mentoring engineering teams or acting as a technical leadExposure to enterprise security tooling such as Vault and ArtifactoryExperience working in distributed international teamsPortuguese language skills are considered a plusExperience supporting large-scale AI-native or autonomous agent ecosystems
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.