SEO Essentials Checklist.
Complete these critical checks to ensure your pages are accessible, fast, and optimized for search engine bots and AI clouds.
A Technical & Crawlability
Provide a complete list of URLs to Google so crawler bots can discover every page. Make sure the sitemap path is mentioned in robots.txt.
How to configure sitemap in robots.txt
Sitemap: https://runseoaudit.com/sitemap.xml Ensure you are not blocking search engines from index-critical assets (like main scripts) while blocking private paths (like dashboard endpoints).
Robots.txt standard template
User-agent: *
Allow: /
Disallow: /admin/ Encrypt data in transit. Google marks plain HTTP pages as insecure and downgrades their indexing weight.
Avoid soft 404 errors (where a missing page returns a 200 OK code). A custom 404 helps search engines de-index missing pages while keeping users on-site.
Resolve duplicate homepage indexing by redirecting all HTTP and non-WWW visits to their secure canonical equivalent.
Redirect rules template (Nginx)
server {
listen 80;
server_name www.runseoaudit.com runseoaudit.com;
return 301 https://runseoaudit.com$request_uri;
} URLs should be short, descriptive, and entirely lowercase. Use hyphens (-) instead of underscores (_) as search engines interpret hyphens as space separators.
Avoid pages with zero inbound internal links. Link every public page from your main navigation, sitemap, or blog index so bots can pass link equity.
B On-Page & Semantic HTML
Specify unique, descriptive titles under 60 characters and description snippets under 160 characters to capture user clicks.
Astro / HTML header metadata structure
<title>Descriptive Title Under 60 Characters</title>
<meta name="description" content="Description copy under 160 characters summarizing the page context." /> Avoid duplicate content issues by telling search engines which URL version is the primary authoritative source.
Authoritative canonical link
<link rel="canonical" href="https://runseoaudit.com/seo-checklist" /> Ensure heading structures descend logically (H1 → H2 → H3). A single H1 tag declares the main page context to crawlers.
Alt text describes image content to screen readers and search crawlers, improving accessibility and visibility in image search engines.
Image alt syntax
<img src="chart.png" alt="Website PageSpeed performance audit scoring card" /> Avoid generic links like "click here". Use keyword-rich anchor text describing the destination, and ensure you have no empty '#' link targets.
Ensure production pages do not accidentally contain a "noindex" instruction that blocks Google from displaying your pages in search indexes.
C UX & Performance (Core Web Vitals)
Prevent browser layout zooming. Viewport scale instructs the browser to adapt layouts to mobile screen widths automatically.
Standard mobile viewport tag
<meta name="viewport" content="width=device-width, initial-scale=1.0" /> Ensure images reserve layout space before they download. Setting explicit dimensions prevents Cumulative Layout Shift.
Layout-safe image code
<img src="hero.jpg" width="800" height="450" alt="Hero representation" class="w-full h-auto" /> Configure touch interactive targets (links, icons, buttons) with padding to meet mobile accessibility standards and prevent errant clicks.
Use modern WebP or AVIF image formats which provide superior compression compared to JPEG/PNG, reducing page payload weight and speed.
Instruct browsers to display a system fallback font while the custom web font is downloading to prevent invisible text blocks on slow connections.
CSS font-display syntax
@font-face {
font-family: 'Geist';
src: url('/fonts/geist.woff2') format('woff2');
font-display: swap;
} Load third-party analytics or widgets asynchronously or defer them so that the browser can render HTML structure first.
D AI & Structured Schema Visibility
Provide machine-readable semantic definitions of page content. Crucial for rich search indexing cards and LLM database mappings.
Structured schema template
<script type="application/ld+json">
{
"@context": "https://schema.org",
"@type": "WebSite",
"name": "Run SEO Audit",
"url": "https://runseoaudit.com"
}
</script> Instruct AI crawler bots (e.g. OpenAI's GPTBot, Anthropic's Claude-Web) in robots.txt so they are allowed to scan your content for AI answer generation.
Robots.txt AI cloud permissions
User-agent: GPTBot
Allow: /
User-agent: ClaudeBot
Allow: / Provide custom share titles, site descriptions, and preview image thumbnails that display beautifully when URLs are shared in chats or social feeds.
Standard OpenGraph tags
<meta property="og:title" content="Page Title" />
<meta property="og:description" content="Description" />
<meta property="og:image" content="https://site.com/image.jpg" /> Organize layout structure using semantic elements like <header>, <nav>, <main>, <article>, and <footer> to declare content hierarchy contextually.
AI visual parsing models scan site images to answer detailed user searches. Providing detailed alternative text on schematics/charts helps models map data models.
Help search engines serve your website in the correct regional search pages and assist automated translation crawlers by configuring a primary language code.
Language tag format
<html lang="en"> Prevent search engines or LLM crawlers from extracting dynamic or private content (like shopping cart elements) into search previews using the snippet attribute blocker.
No-snippet tag template
<div data-nosnippet>
This content will not appear in search results listings.
</div> E GEO & AEO (Generative & Answer Engine Optimization)
Structure key information using explicit Question-and-Answer formats. AI models search for exact conversational query patterns followed by direct, concise answers (under 80 words) to output as instant search summaries.
Include conceptual synonyms and natural related topics rather than stuffing exact keywords. Generative models construct vector space representations where semantic proximity and entity clarity determine rankings.
Add unique research figures, survey results, and expert opinions. Answer engines like Perplexity prioritize citing original data-rich resources to validate their summary statements.
Provide machine-readable microdata schemas describing questions and answers. Explicit schema nesting helps LLMs map data nodes accurately without requiring complex regex scraping.
Structured FAQPage JSON-LD snippet
<script type="application/ld+json">
{
"@context": "https://schema.org",
"@type": "FAQPage",
"mainEntity": [{
"@type": "Question",
"name": "How to optimize for GEO?",
"acceptedAnswer": {
"@type": "Answer",
"text": "Optimize structure using Q&A formats and clear semantic HTML."
}
}]
}
</script> Include direct key-takeaway summaries at the top of content blocks. Retrieval-Augmented Generation (RAG) pipelines segment text into index chunks, making clear summaries highly retrievable.
Link to authoritative external entities (e.g., Wikipedia pages, Wikidata nodes, or official documentation) when discussing core topics. This maps your site elements directly into the AI's known Knowledge Graphs.