Part of the AEO / GEO / AI-search audit tool stack. See the pillar post for the full catalog of sibling audits and where this one fits in the lineup.
Google shows video thumbnails in the main SERP when:
- Your page has
VideoObjectJSON-LD with the required properties - The thumbnail is large enough (minimum 120×90, recommended 600×338)
- The video is embedded via an
<iframe>or<video>with discoverable source - The video has a transcript the crawler can extract
- The page itself is about the same topic as the video
Missing any of those drops you out of the video SERP. A page with a YouTube embed and no schema is invisible to this layer — Google ranks the YouTube listing instead of your page.
The Video Schema + Transcript Audit tests all five conditions in a single pass.
What it checks
Video schema presence
VideoObjectJSON-LD in the page- Required:
name,description,thumbnailUrl,uploadDate - Recommended:
duration,contentUrl/embedUrl,publisher,interactionStatistic - For live streams:
BroadcastEventnested inside
Schema quality
- Thumbnail URL is
https://and resolves durationin ISO-8601 format (PT4M13S)uploadDatein ISO-8601descriptionat least 50 chars
Video element discovery
- Detects embedded
<iframe src>pointing at youtube.com, vimeo.com, wistia.com, etc. - Detects
<video>with<source>children - Flags missing
aria-labelon embeds (accessibility) - Flags missing
titleattribute
Transcript presence
- Searches the page for a
<track kind="captions">on any<video> - Searches for a visible transcript block (heuristics: "Transcript:" heading, long text near video)
- Searches for schema.org
transcriptproperty
Caption / accessibility
- Flags videos with no captions
- Flags autoplay (user hostility + accessibility)
- Flags missing poster image
Why the transcript matters
YouTube auto-generates captions. Google can read them — but only inside YouTube. A transcript on your page is the version Google reads for your page's topic signal. That text is what ties your page to the video's subject and qualifies you for the video SERP as the page URL, not just the YouTube URL.
A 10-minute video with a visible transcript is also how you:
- Rank for long-tail queries inside the transcript
- Qualify for featured snippets extracted from the transcript text
- Serve accessibility (WCAG 1.2.1 — audio-only / video-only alternatives)
How to use it
- Go to /tools/video-schema-transcript-audit/
- Paste a URL with an embedded video
- Tool fetches, parses, and reports
- Read the score and each check
- Copy the fix prompt — it includes a valid VideoObject JSON-LD template you can paste into your CMS
Related reading
- Schema Test-Case Generator — valid JSON-LD starter templates
- Rich Results Eligibility Audit
- Featured-Snippet Extractability
Fact-check notes and sources
- Video structured data requirements: Google Search Central — Video.
- schema.org VideoObject reference: schema.org/VideoObject.
- WCAG 1.2.1 Audio-only / Video-only: W3C WCAG 2.1 — 1.2.1.
- YouTube auto-caption accuracy: Google Accessibility Blog — captions.
This post is informational, not SEO-consulting or accessibility-consulting advice. Mentions of Google, YouTube, Vimeo, Wistia, and similar products are nominative fair use. No affiliation is implied.