Why use RAG instead of fine-tuning for a real-estate AI agent?

Listings change daily — fine-tuning on inventory means re-training every time prices shift. We index listings into a vector store and retrieve fresh data on every query, so the model stays general and the data stays current. Tradeoff: slightly higher per-query cost in exchange for zero re-training overhead and instant freshness.

How do you prevent hallucinations on price and availability?

Price and availability are the two facts buyers will absolutely sue you over. Before the agent quotes a number, it cross-checks against the listings index. If retrieval misses, it says 'let me get back to you' instead of guessing. Tradeoff: a small lift in 'I don't know' responses — which is exactly what you want when accuracy matters more than vibes.

LAB BUILD · Open-source demo

An AI real estate broker thatqualifies leads at 3 a.m.— and we open-sourced the whole thing.fork-friendly.

A working reference implementation we built to show what we ship. It searches real-estate listings, qualifies buyers, and books property tours. Not a client engagement — a lab build to show, not tell.

Read the code← yep, we'll customize it

MIT-licensed · Fork or self-host · No vendor lock-in

hexextract/propertysearch

AI agent that qualifies real-estate leads, searches listings via RAG, and books tours. Built with OpenAI + LangChain.

TypeScriptOpenAILangChainRAGMIT

Open source · MIT LicenseView on GitHub

go on, peek at the code

See it work

Type what you want. The agent does the searching.

One sentence becomes a filtered search, refined by chat or by hand. Click any listing to see the detail panel and book a tour without leaving the app.

ai-broker.example

Natural language search

AI brokeronline

Hi! Describe what you're looking for in plain English — beds, area, budget, school district…

press enter to search →

chat → filters auto-derived

click a listing → book a tour

Ask in plain English

Describe the home you want — beds, neighbourhood, budget, schools — the way you'd describe it to a friend.

Filters set themselves

The agent parses the conversation into structured filters. You can still flip any of them on or off by hand.

Listings stay in sync

The grid refreshes whenever you refine — by typing more, swapping a filter, or changing the sort.

Open any listing

Click a card to see the full detail panel with photos, agent notes, and a yes/no on every requirement you stated.

Book a tour

Pick a time slot, confirm. The booking goes straight into the broker's calendar with the conversation attached.

What it does

Four capabilities, working together.

Natural-language property search

Buyers describe what they want — '3-bed under $500K in Austin, walkable to schools' — and the agent returns ranked, explained matches.

Lead qualification

Smart questioning surfaces budget, timeline, and intent. Hot leads get prioritized; lukewarm ones get nurtured.

Tour booking & calendar handoff

When a buyer is ready, the agent books a viewing through the broker's calendar — no human intervention required.

Guardrails on price & availability

Hallucinated prices kill trust. The agent cross-checks claims against the listings index before answering.

The problem we wanted to solve

Real-estate leads cool faster than your morning coffee.

Conversion rates on real-estate leads drop sharply within an hour if no one responds. Brokerages can't afford 24/7 staffing. Off-the-shelf chatbots feel scripted and obvious. And LLMs left unchecked will cheerfully invent listings that don't exist — which torches trust the moment a buyer walks into the wrong house.

We wanted to see how close we could get to a real broker's judgement, with AI doing the heavy lifting and humans only stepping in when it matters.

How it works

Three decisions that did most of the work.

The interesting part of any AI build is the tradeoffs you make. Here are ours.

RAG over fine-tuning

Listings change daily. Fine-tuning on inventory means re-training every time prices shift. We index listings into a vector store and retrieve fresh data on every query — model stays general, data stays current.

Tradeoff: Slightly higher per-query cost in exchange for zero re-training overhead and instant freshness.

Function calling for booking & lead capture

Free-text answers are great for discovery, terrible for actions. We expose calendar, CRM, and listing-detail APIs as functions the agent calls deterministically — so 'book me a tour Friday at 4' becomes a real calendar event, not a hallucinated promise.

Tradeoff: More upfront wiring, but downstream actions are auditable and reliable.

Hallucination guardrails on price & availability

The two facts buyers will absolutely sue you over. Before the agent quotes a number, it cross-checks against the index. If retrieval misses, it says 'let me get back to you' instead of guessing.

Tradeoff: A small lift in 'I don't know' responses — which is exactly what you want when accuracy matters more than vibes.

architecture.txt

  Buyer message
        │
        ▼
  ┌──────────────┐    ┌──────────────────────┐
  │   LLM Agent  │ ←─ │  System prompt &     │
  │   (OpenAI)   │    │  conversation state  │
  └──────┬───────┘    └──────────────────────┘
         │
    ┌────┴────┬────────────┬───────────────┐
    ▼         ▼            ▼               ▼
 Search    Book tour    Lead score     Fallback
 listings  (Calendar)   (CRM write)    (human)
    │
    ▼
 ┌─────────────────────────┐
 │  Vector index (RAG)     │
 │  — listings + metadata  │
 │  — refreshed on update  │
 └─────────────────────────┘

The stack

Boring, proven pieces. on purpose.

We picked components your in-house team can take over without a PhD.

OpenAI GPT

Reasoning + function calling

LangChain

Agent orchestration

Vector store

Listings retrieval (RAG)

TypeScript + React

End-to-end

Calendar API

Tour booking

Listings API

Real-estate ingestion

For your business

What we'd build for you, specifically.

The demo is a starting point. Here's how we'd shape it for an actual brokerage.

1–2 weeks

Plug in your MLS / IDX feed

Replace the demo's synthetic listings with your live inventory. We handle the ingestion pipeline, embedding refresh, and de-duplication.

~1 week

Train on your brand voice

Your scripts, your tone, your qualifying questions. The agent stops sounding like ChatGPT and starts sounding like your top broker.

~1 week

Wire it into your CRM

HubSpot, Salesforce, Follow Up Boss — qualified leads flow into your pipeline with the conversation context attached.

Open source

The whole thing — on GitHub.

Read it. Fork it. Self-host it. Submit a PR. We open-source our lab work because the code is the proof — and engineers don't trust pretty case studies, they trust commits.

Open repo Star on GitHub

hexextract / propertysearch

MIT License

Things this demo doesn't do

What we're not claiming.

Demo work has limits. Calling them out builds more trust than glossing over them.

Demo runs on synthetic listings — connect your live MLS/IDX feed to make it production-ready.
No production auth, rate-limiting, or audit logging — those get added per engagement, not bolted onto the public reference.
We haven't published latency or cost benchmarks yet. Happy to run them on a candidate dataset during scoping.
Voice/phone integration isn't included — the demo is text-only. We can add Twilio + speech-to-text if you need it.

Want one of these trained on your listings?

30-min call with an actual engineer. No slides, no pitch deck — just whether we can ship what you need, and what it'd cost.

Or just fork it

(we don't bite. usually.)