← All postsGEO Strategy

How to Measure GEO: The Metrics That Tell You If Your Healthcare Practice Is Visible in AI Search

Traditional SEO has rankings and traffic. GEO needs a different measurement framework entirely. Here are the four layers that actually tell you where you stand.

By Luis Castro, Founder, The Purpose Pilot

4

layers in the measurement framework: visibility, traffic, engagement, and business impact

20–30

target queries to run manually across AI engines each month

4–8 wks

typical time to see measurable citation improvement after GEO changes

88%

of Google AI Mode citations do NOT appear in the organic top 10 results

The measurement problem nobody talks about

You have been reading about GEO for months. Maybe you have already made changes to your content, added clinical bylines, built out FAQ sections, or cleaned up your directory listings. Now your ownership group or CEO asks the obvious question: is it working?

With traditional SEO, that question has clear answers. You pull your keyword rankings, your organic traffic trend, your click-through rate from Search Console. The numbers tell a story even when the story is incomplete.

GEO does not have that yet. There is no universal ranking position for an AI-generated answer. There is no GEO equivalent of Search Console that shows impressions and clicks from every AI engine in one dashboard. The field is 18 months old and the measurement infrastructure is still being built.

That gap is exactly why most facilities either skip measuring GEO entirely or default to proxy metrics that do not tell them what they actually need to know. Neither approach is acceptable when you are trying to justify the investment to a leadership team.

This post lays out the measurement framework that actually works right now, organized into four layers, with specific instructions for healthcare practices — including treatment centers, mental health clinics, dermatology practices, plastic surgery groups, dental offices, and physical rehabilitation centers — on what to track, how to track it, and how to interpret what you find.

Why GEO measurement is structurally different from SEO measurement

Before getting into the framework, it helps to understand why the old metrics do not transfer.

In traditional SEO, you optimize a page to rank for a keyword, and the ranking is a visible, stable, verifiable output. Position 3 for “dermatologist accepting new patients in Austin” or “IOP treatment Phoenix” is an observable fact that anyone can verify by running the search.

In AI search, there is no position. There is presence or absence in a synthesized answer, and that answer varies by query phrasing, by AI engine, by the user's location and search history, and by how recently the AI's knowledge has been updated. The same practice can be cited in one phrasing of a query and invisible in another phrasing of what is essentially the same question.

This creates a measurement challenge with three specific dimensions.

First, citation is not the same as ranking. A page can rank in the top three on Google and never appear in a Google AI Overview. Research from Moz found that 88% of Google AI Mode citations do not appear in the organic top 10 SERP results. Optimizing for traditional rankings does not automatically transfer to AI citation rates. The two channels require different strategies and different measurement.

Second, AI referral traffic is largely invisible in standard analytics. Most AI engines do not pass referral data the way a traditional search click does. When a user copies information from a ChatGPT answer and visits your site, that session typically registers as direct traffic, not AI-referred. Standard GA4 without additional configuration misses the majority of AI-influenced visits.

Third, the lag between optimization and measurable output is longer than most practices expect. Open-world engines like Perplexity and Google AI Overviews pull live data, so structural changes can show results in weeks. Closed-world models like some versions of ChatGPT rely on training data snapshots that update on longer cycles. You may see Perplexity citation improvement in four weeks and ChatGPT improvement three months later from the same underlying content change.

Layer 1: Citation visibility

This is the core GEO metric. How often does your facility appear when someone queries an AI engine for your priority services and location?

What to measure

  • Citation frequency: out of every ten times you run a target query, how many times does your facility appear in the answer?
  • Citation position: are you the primary source cited, a secondary reference, or mentioned in passing?
  • Citation accuracy: when AI engines reference your facility, is the information correct? Right services, right location, right contact information?
  • Competitive share of voice: when your facility does not appear, who does?

How to track it manually

Build a query bank of 20 to 30 prompts that mirror the questions your prospective patients actually ask. The right queries depend on your specialty, but the pattern is the same across all healthcare verticals:

  • Treatment center: “PHP programs in Phoenix Arizona” / “Treatment centers that accept Blue Cross in Phoenix”
  • Mental health clinic: “therapist accepting new patients in [city]” / “anxiety treatment clinic near [neighborhood]”
  • Dermatology: “best dermatologist for acne in [city]” / “dermatology clinic that accepts [insurance] in [city]”
  • Plastic surgery: “board-certified plastic surgeon rhinoplasty [city]” / “mommy makeover consultation [city]”
  • Dental: “dentist accepting new patients in [neighborhood]” / “cosmetic dentist [city] invisalign”

Run your query bank across Google AI Mode, ChatGPT, and Perplexity at the beginning of each month. Record whether your practice appears, where in the answer it appears, and what competitors appear instead. A simple spreadsheet with engine, query, present or absent, and notes is enough to start.

The honest limitation

Manual tracking at this scale is time-consuming and has sampling error. Automated tools exist and are improving: Semrush's AI Toolkit, Otterly.ai, and SE Ranking all offer citation monitoring across multiple engines. Altitude's GEO citation tracker handles this automatically for client facilities, running your priority query bank against major AI engines monthly and surfacing results in the dashboard. For facilities without a platform, manual tracking with a documented query bank is the baseline.

What good looks like

A healthcare practice that has invested 90 days in GEO fundamentals — named clinical authorship, FAQPage schema, consistent directory listings (SAMHSA for treatment centers, Psychology Today for mental health clinics, Healthgrades for medical specialties, and Google Business Profile for all), and authoritative source citation — should expect to appear in 30 to 50% of relevant local service queries across Perplexity and Google AI Mode. ChatGPT citation rates tend to be lower for local service queries. Treat 10 to 20% on ChatGPT as a reasonable benchmark at 90 days regardless of specialty.

Layer 2: AI referral traffic

Citation visibility tells you whether AI engines mention you. AI referral traffic tells you whether those mentions generate actual site visits and, ultimately, admissions inquiries.

What to measure

  • Sessions from identifiable AI referral sources
  • Pages AI visitors land on
  • Conversion rate of AI-referred sessions compared to other organic traffic

How to track it in GA4

Configure a custom channel group in GA4 to capture AI referral traffic. The major AI engines use identifiable referral strings when they do pass referral data: traffic from ChatGPT typically shows as a referral from chat.openai.com, Perplexity from perplexity.ai, and Google AI Overviews from google.com with specific query parameters. Create a segment or channel group that captures these sources together as “AI Search.”

For server-level tracking, check your logs for known AI user agent strings. ChatGPT uses “ChatGPT-User,” Perplexity uses “PerplexityBot,” and Claude uses “ClaudeBot.” Cloudflare's AI Crawl Metrics dashboard shows this data automatically if you use Cloudflare for your site. This is crawler activity rather than user visits, but high crawler frequency from AI bots is a leading indicator of citation activity.

The honest limitation

A significant share of AI-influenced visits will never appear as AI referral traffic in any analytics tool. When a user reads a ChatGPT answer, remembers your facility name, and searches for you directly, that becomes branded search traffic or direct traffic. The AI was involved in the discovery, but the analytics do not capture it. This is why branded search volume lift is an important secondary signal covered in Layer 4.

Layer 3: Content-level citation signals

Beyond whether you are being cited, it matters which pages are being cited and whether those pages are structured to convert the traffic they receive from AI referrals.

What to measure

  • Which specific pages appear most frequently as AI citations for your priority queries
  • Whether those pages have the structural elements that maximize citation probability: H1, FAQ section with FAQPage schema, named clinical author, external authoritative citations, internal links, last-reviewed date
  • Content freshness: pages not updated in more than six months show a measurable drop in AI citation frequency
  • Schema coverage: what percentage of your service and condition pages have FAQPage schema deployed

How to track it

When you run your monthly manual citation check, note the specific URL that appears in each citation, not just whether your facility appears. Over time, this reveals which pages are doing the citation work and which priority service pages are invisible despite the facility being known.

Cross-reference that list with your content audit. The pages that appear most in AI citations are almost always the ones with the strongest combination of depth, named authorship, FAQ structure, and external citations. The pages that never appear typically have one or more of those elements missing.

The diagnostic question

For each of your top five service pages, run this check: does the page have a named clinical author with credentials? Does it have a FAQ section with at least five substantive Q and A pairs? Does it cite at least one authoritative source in the body content — SAMHSA or NIDA for addiction treatment, NIMH or APA for mental health, AAD or ASDS for dermatology, ASPS for plastic surgery, ADA for dental? Has it been reviewed and updated in the last six months? If the answer to any of those is no, that page has a known, fixable reason it is not being cited.

Layer 4: Business impact signals

This is the layer that matters to ownership groups and CEOs. The first three layers measure the mechanism. Layer 4 measures whether the mechanism is producing outcomes.

Branded search volume

When AI engines cite your facility, users who did not previously know your name learn it. They subsequently search for you by name. A rising branded search trend in Google Search Console, measured as impressions and clicks for your facility name as a query, is one of the clearest downstream signals that AI citation activity is compounding. It is also the signal most likely to impress a leadership team because it is directly visible in tools they may already use.

New patient inquiry source attribution

Ask every inbound inquiry — phone, form, or front-desk intake — where they first heard of your practice. Add “AI search / ChatGPT / Perplexity” as an explicit option. Whether you run a treatment center, a dermatology clinic, a dental practice, or a mental health group, most practices are already hearing this answer from new patients but not capturing it systematically. Starting to capture it now establishes a baseline for measuring growth.

Direct traffic trend

As AI citation compounds, direct traffic to your site tends to rise because more users are arriving via name recognition developed through AI-generated answers. A rising direct traffic trend alongside rising branded search volume is a strong combined signal of growing AI visibility.

Referral source diversity

Track how many distinct external domains are referring traffic to your site month over month. Growing referral diversity is a proxy for the earned media activity that drives GEO, and it compounds both traditional SEO authority and AI entity recognition simultaneously.

Putting it together: a monthly reporting cadence

A practical monthly GEO measurement routine for a treatment center looks like this:

Week 1 of each month

Run the full query bank (20 to 30 queries) across Google AI Mode, ChatGPT, and Perplexity. Record citation frequency, citation position, and competitor appearances. Note which pages were cited.

Week 2

Pull GA4 AI referral traffic for the prior month. Compare to the previous month and to 90 days prior. Note which pages AI-referred visitors landed on and whether they converted to inquiry.

Week 3

Pull Google Search Console data for branded search queries. Compare branded impression and click volume to the prior month and to 90 days prior. Note the trend direction.

Week 4

Review the content audit findings from Altitude or your own audit process. Identify which service or condition pages failed the clinical author, FAQ, external citation, or freshness check. Assign fixes for the coming month.

Report to leadership monthly with three numbers: citation frequency rate (from Layer 1), branded search trend (from Layer 4), and AI-attributed new patient inquiry count (from Layer 4). Three numbers that tell a clear directional story are more useful than a dashboard full of metrics nobody knows how to interpret.

How Altitude automates this for behavioral health clients

Altitude was built by The Purpose Pilot specifically for healthcare practices that need GEO measurement built around their clinical context — not a generic dashboard designed for SaaS or ecommerce. The platform serves treatment centers, mental health clinics, dermatology practices, plastic surgery groups, dental offices, and physical rehabilitation centers.

The platform runs your practice's priority query bank against Google AI Mode, Perplexity, and ChatGPT monthly and surfaces citation frequency, competitive share of voice, and which pages are being cited. It tracks NAP consistency across the authoritative directories that matter for your specialty — SAMHSA for treatment centers, Psychology Today and NIMH for mental health, Healthgrades and Google Business Profile for all — and flags discrepancies that suppress citation probability. It audits every content page against the citation-readiness signals in Layer 3, surfacing fix-now recommendations in severity order.

The result is the three-number monthly report described above, generated automatically, without manual query running or spreadsheet tracking.

Frequently asked questions

How long before GEO measurement shows meaningful data?

For open-world engines like Perplexity and Google AI Overviews, you should see directional citation movement within four to eight weeks of implementing GEO fundamentals. For ChatGPT, which updates on longer training cycles, expect three to six months before consistent citation improvements appear. Branded search lift typically becomes visible at 60 to 90 days if citation activity is compounding. Build a 90-day baseline before drawing conclusions about any individual change.

Do I need a paid tool to measure GEO?

Not to start. The manual query bank approach described in Layer 1 requires only a spreadsheet and 30 minutes per month. GA4 AI referral tracking requires configuration but no additional cost. Google Search Console branded search data is free. For facilities that want automated citation monitoring across multiple engines, paid tools like Otterly.ai (starting around $29/month) or Altitude's built-in citation tracker are available. The manual baseline is worth establishing before investing in tooling so you have something to benchmark against.

Why doesn't standard GA4 show AI referral traffic accurately?

Most AI-generated answers do not result in a direct click from the AI interface to your site. Users read the AI answer, form an impression, and later navigate to your site directly or via a branded search. That journey does not produce a referral data point. What GA4 captures is only the fraction of users who click a cited link directly from the AI interface. Server log tracking of AI bot user agents is more reliable for understanding crawler activity, and branded search trend is the most reliable proxy for overall AI influence on user behavior.

What is a realistic citation frequency benchmark for a healthcare practice?

Benchmarks vary by market competitiveness, query type, and how long GEO investment has been active. As a starting point: a practice with GEO fundamentals in place for 90 days should expect to appear in 30 to 50% of relevant local service queries on Perplexity and Google AI Mode, and 10 to 20% on ChatGPT. This applies whether you're a treatment center, mental health clinic, dermatology practice, dental office, or plastic surgery group — the benchmarks are consistent across healthcare specialties. Practices in less competitive markets or with strong earned media footprints will see higher rates.

How do I explain GEO measurement to ownership or a board that only knows traditional SEO metrics?

Use branded search volume as your primary business-language metric. It is visible in tools they recognize, it has a clear directional trend, and it has an intuitive interpretation: more people searching for us by name means more people know we exist, and AI search is a primary driver of that awareness. Pair it with AI-attributed new patient inquiry count from your intake process and you have a story that connects AI visibility to the patient pipeline without requiring the audience to understand the technical measurement framework.

What is the one metric to start tracking if we are not tracking anything yet?

Run your 20 most important queries in Google AI Mode today. Note whether your practice appears in each answer. Do it again in 30 days. The delta between those two snapshots is your baseline GEO trend and the most useful single piece of information you can have right now.

What's next

See your facility's AI citation score

Altitude runs your priority query bank across major AI engines monthly, tracks NAP consistency across the behavioral health sources that matter, and surfaces the highest-leverage fixes in priority order. Free audit on every plan.

Trusted sources referenced in this post

Altitude is the SEO and GEO platform built specifically for healthcare practices — including treatment centers, mental health clinics, dermatology practices, dental offices, and plastic surgery groups — by The Purpose Pilot. Get your facility's AI citation score and automated monthly GEO report at altitude.thepurposepilot.com.