← Back to Blog

Three Signals That Tell AI Engines Your Site Is Abandoned, Unqualified, or Locked Behind a Vendor

Three Signals That Tell AI Engines Your Site Is Abandoned, Unqualified, or Locked Behind a Vendor

A page can pass every traditional SEO check and still get skipped by AI retrieval engines. Three patterns cause this, and they are invisible to classic audit tools: the page runs on technology from 2013, it sits in a high-trust vertical without the trust signals AI engines require, or the fixes it needs are locked behind a vendor template that the site owner cannot touch.

The Mega Analyzer now detects all three.

Abandoned-site fingerprints

When a page loads html5shiv, selectivizr, respond.js, IE conditional comments, jQuery 1.x, Bootstrap 2 or 3, or AngularJS 1.x, every crawler and retrieval engine reads a clear signal: this site has not been maintained. The technology exists to support Internet Explorer 6-8, a browser that Microsoft itself stopped supporting in 2016.

This matters for AI citation because freshness is a retrieval signal. A page that loads polyfills for a browser no one uses tells the ranker that the page content is similarly outdated, even if the body text was updated last week. Ahrefs documented this in their 2024 freshness study, and Profound's 2024 LLM-citation cohort (N=8.6M citations) showed abandoned-tech pages receiving measurably fewer citations than equivalent pages without legacy dependencies.

The fix is straightforward: remove the polyfills. No modern browser needs them. If jQuery is required, upgrade to 3.x. If Bootstrap is required, upgrade to 5.x. If AngularJS 1.x is required, that is a larger conversation, but loading it on a marketing page is never necessary.

The Mega Analyzer flags these as Critical because the signal is binary and the fix is fast.

YMYL vertical classification

Google's Search Quality Rater Guidelines (v2024, section 3.4) define YMYL ("Your Money or Your Life") as content that could significantly impact a person's health, financial stability, or safety. AI engines apply the same principle more aggressively: Google AI Overviews, Perplexity, and ChatGPT all apply tighter source-quality filters to health, legal, and financial queries.

The Mega Analyzer now classifies every scanned page into a vertical:

  • YMYL-Health-Dental requires Dentist or DentalClinic schema, license disclosure, HIPAA notice, Person schema with credential, and a medical disclaimer
  • YMYL-Health (general medical) requires MedicalBusiness or Physician schema, license disclosure, HIPAA notice, Person schema with credential, and a medical disclaimer
  • YMYL-Legal requires LegalService schema, bar number or jurisdiction disclosure, Person schema with credential, and an attorney-advertising disclaimer
  • YMYL-Finance requires FinancialService schema, FINRA/CFP/CPA credential disclosure, and a financial disclaimer
  • Local-Service, Local-Restaurant, Local-RealEstate, Local-Storage, Local-Trades, Local-Hospitality each have their own schema and trust-signal baselines

When a page lands in a YMYL vertical but is missing required trust signals, the Mega Analyzer flags it as Critical. A dental practice page without Dentist schema and a license number is not just missing schema niceties; it is missing the signals that AI engines use to decide whether to cite it in response to health queries.

Vendor platform detection

Not every fix is something the site owner can make. A dental practice running on ProSites cannot change their HTTP headers. A law firm on FindLaw cannot edit their template's schema output. A restaurant on BentoBox cannot modify the CDN caching rules.

The Mega Analyzer now fingerprints 23 known platforms across dental (ProSites, Officite, Sesame Communications, PBHS, Smile Marketing, Dental Branding), legal (FindLaw, Justia, Scorpion, Mockingbird), restaurant (BentoBox, Toast, Popmenu, ChowNow), real estate (Placester, Real Geeks, Sierra Interactive), and general CMS (WordPress, Squarespace, Wix, Shopify, Webflow, GoDaddy Builder).

When a platform is detected, every fix recommendation in the AI prompt gets tagged:

  • [OWNER] means the site owner can apply the fix directly (content edits, adding text to a page)
  • [PLATFORM] means the fix requires CMS or hosting configuration (CDN headers, redirect rules, caching settings)
  • [VENDOR-CONTACT-REQUIRED] means the fix is locked at the template level and requires a vendor support ticket
  • [INFRASTRUCTURE] means the fix is at the DNS or TLS or domain-registrar level

This converts the output from "things to do" into "things you can do right now" versus "things you need to escalate." For a small business owner reviewing a 40-item audit, that distinction is the difference between an actionable report and noise.

Evidence standards

Every recommendation the Mega Analyzer makes now carries an evidence-class tag:

  • [SPEC] means it is documented in the engine's published documentation
  • [OBSERVED] means it comes from a published study with a cited source, date, and sample size
  • [INFERRED] means it is reasoned from architecture but not confirmed
  • [HYPOTHESIS] means it is an educated guess and is not presented as fact

This matters because AI engine behavior is a moving target. "GPTBot does not execute JavaScript" is a [SPEC] claim documented in OpenAI's bots documentation. "97% of LLM citations are non-brand sources" is an [OBSERVED] claim from Profound's 2024 study (N=8.6M citations). "Perplexity weights llms.txt presence" is [INFERRED]. Collapsing these into a single undifferentiated recommendation list would be dishonest.

Using these checks

Run any URL through the Mega Analyzer and the cross-validation layer fires automatically. The results card shows a YMYL classification badge, a platform detection badge (when a known vendor is found), and any abandoned-tech fingerprints detected.

The AI fix prompt emitted by the tool includes all three sections with per-item evidence citations and fix-action tags, so the prompt you paste into ChatGPT or Claude already contains the context needed to produce correctly scoped fix recommendations.

Related reading

Fact-check notes and sources

  • Google Search Quality Rater Guidelines v2024, section 3.4 defines YMYL standards and the tighter evaluation criteria applied to health, finance, and legal content. Source: Google Search Central documentation, published 2024.
  • Ahrefs 2024 freshness study documented the correlation between legacy technology markers and lower crawl frequency. Source: Ahrefs blog, 2024.
  • Profound 2024 LLM-citation cohort (N=8.6M citations) measured citation rates across page-quality segments. Source: Profound research publication, 2024.
  • GPTBot documentation confirming JavaScript non-execution: OpenAI bots documentation, current as of 2025.
  • RFC 6125 section 6 covers server identity verification for TLS certificates, the foundation of the re-fetch matrix's cert validation. Source: IETF, published 2011.
  • schema.org v27.0 (January 2025) is the current schema version for LocalBusiness, Dentist, LegalService, FinancialService, and related types.

This post is informational, not SEO-consulting advice. Mentions of specific platforms (ProSites, FindLaw, BentoBox, etc.) are nominative fair use for identification purposes. No affiliation is implied.

← Back to Blog

Accessibility Options

Text Size
High Contrast
Reduce Motion
Reading Guide
Link Highlighting
Accessibility Statement

J.A. Watte is committed to ensuring digital accessibility for people with disabilities. This site conforms to WCAG 2.1 and 2.2 Level AA guidelines.

Measures Taken

  • Semantic HTML with proper heading hierarchy
  • ARIA labels and roles for interactive components
  • Color contrast ratios meeting WCAG AA (4.5:1)
  • Full keyboard navigation support
  • Skip navigation link
  • Visible focus indicators (3:1 contrast)
  • 44px minimum touch/click targets
  • Dark/light theme with system preference detection
  • Responsive design for all devices
  • Reduced motion support (CSS + toggle)
  • Text size customization (14px–20px)
  • Print stylesheet

Feedback

Contact: jwatte.com/contact

Full Accessibility StatementPrivacy Policy

Last updated: April 2026