This is an AI-first role owning the data acquisition, serving, and reporting infrastructure that powers our product. We aggregate public data at scale, and these pipelines are core to what we deliver. We don’t want someone to hand-operate scrapers and hand-debug every breakage—we want an engineer who builds systems that largely run, diagnose, and repair themselves, using LLMs and agentic workflows to keep everything reliable while continuously raising the level of automation.You own critical systems end to end with minimal hand-holding and treat AI as a core part of the toolkit. Remote, reporting to our Principal Engineer and team lead.What you’ll doOwn and self-heal our fleet of web scrapers—build LLM-assisted resilience so structural, markup, and anti-bot changes are detected, diagnosed, and self-repaired with minimal manual effort. When something does break, agents do the first pass on root-cause and propose fixes; you review and approve.Keep daily scraping runs stable—monitoring, alerting, retries, and graceful handling of upstream failures so data lands reliably each morningUse LLMs for resilient parsing and entity extraction from messy or changing HTML, reducing reliance on brittle selectorsOwn and optimize the serving layer and the ETL/ELT pipelines feeding our BigQuery warehouse—ensuring data is fresh, performant, and reliable for live useBuild our reporting infrastructure—data models, transformations, and dashboards—plus AI-native layers like natural-language query and LLM-generated narrative insightDrive data quality through both rule-based checks and ML/LLM-based anomaly detection, and manage anti-bot challenges (proxies, rate limiting, request patterns) within legal and ethical guidelinesBuild and maintain production-grade MCP servers and agentic workflows that expose our data and tooling to internal and AI consumersPartner with the Principal Engineer, analysts, product, and leadership; document systems and best practices for maintainability and human-in-the-loop AI operationsWhat we’re looking for6+ years in data engineering, including ownership of mission-critical production systemsStrong Python with deep experience building, maintaining, and debugging scrapers (e.g., Scrapy, Playwright, Selenium, BeautifulSoup)AI-first: Hands-on experience building LLM-powered and agentic workflows in production—not just calling an API, but designing systems where agents do meaningful work under human supervision—including production-grade MCP serversPrompt engineering and LLM evaluation/observability—reasoning about output quality, cost, latency, and failure modes the way you’d reason about uptime—plus fluency with AI-assisted dev tools (e.g., Claude Code, Cursor)Proven experience designing reporting/analytics layers—data modeling, transformations (e.g., dbt), and BI toolsHands-on with the GCP data stack—BigQuery, Cloud Composer (managed Airflow), Cloud Storage, Cloud Run or GKE—plus advanced SQL and DockerA reliability mindset—proven track record owning systems, triaging failures, and being accountable for uptime; sound judgment on when to use deterministic code versus an LLMUnderstanding of the legal and ethical considerations around web scrapingNice to haveExperience training, deploying, and maintaining ML modelsExperience with MotherDuck / DuckDB, ideally serving data to production applicationsExperience scaling or refactoring distributed scraping systemsKnowledge of Pub/Sub, Dataflow, or other large-scale data processing toolsInfrastructure-as-code (Terraform)Experience setting data strategy or mentoring other engineersLogisticsLocation: Remote (US based)On-call: This role supports daily scraping and nightly processing runs and a production serving layer; some availability for off-hours incident response may be expectedCompensation (based on experience): $190-210K Base Salary + BonusGrace Hill offers a robust suite of benefits, including health, dental and vision insurance, 401K, PTO, life insurance, disability insurance, and more.Unfortunately we are not able to offer visa sponsorship or assistance. Applicants must be based in the US and authorized to work in the US at the time of hire.About usGrace Hill provides industry-leading SaaS technology solutions designed to make a positive impact in real estate and improve the lives of people where they work and live. Harnessing years of real estate experience and the understanding that people are better together, Grace Hill helps owners and operators increase property performance, reduce operating risk and grow top talent. More than 500,000 professionals from over 1,700 companies rely on Grace Hill’s talent performance solutions covering policy, training, assessment, survey, and data-driven insights. Visit us at gracehill.com or on LinkedOur HelloData product solves complex data problems for the multifamily industry, utilizing automated pipelines and AI to provide real-time market insights for the nation's top managers, developers, and investors. Our platform is trusted by the industry’s largest operators to help optimize rents, underwrite operating expenses, and grow NOI with its highly accurate data and user-friendly interface. Since being acquired by Grace Hill in April 2025, HelloData has continued to accelerate at an unbelievable rate, growing ARR by over 300% in 2025 alone and on track for a record-breaking 2026. We combine the agility and innovation of a high-growth startup with the stability and resources of an established enterprise, making us the gold standard in multifamily data analytics.