Page Content Audit
Read-only: lists all pages and blog posts, flags empty or short content and missing SEO fields.
shopify-admin-page-content-audit
Purpose
Scans all store pages and blog articles for missing or thin content (short body, missing SEO title/description, empty body). Thin content pages are penalized by search engines and create a poor customer experience. Read-only — no mutations. Complements seo-metadata-audit (which covers products and collections).
Prerequisites
shopify store auth --store --scopes read_content read_contentParameters
| Parameter | Type | Required | Default | Description |
|---|---|---|---|---|
| store | string | yes | — | Store domain (e.g., mystore.myshopify.com) |
| min_body_length | integer | no | 100 | Flag pages with body content shorter than this (characters) |
| include_unpublished | bool | no | false | Also audit unpublished pages and articles |
| format | string | no | human | Output format: human or json |
Safety
> ℹ️ Read-only skill — no mutations are executed. Safe to run at any time.
Workflow Steps
pages — query Inputs: first: 250, select title, body, bodySummary, seo, publishedAt, pagination cursor
Expected output: All pages with content and SEO data
articles — query Inputs: first: 250, select title, body, seo, publishedAt, blog { title }, pagination cursor
Expected output: All blog articles with content data
min_body_length, missing SEO title, missing SEO descriptionGraphQL Operations
# pages:query — validated against api_version 2025-04
query PageContentAudit($after: String) {
pages(first: 250, after: $after) {
edges {
node {
id
title
handle
publishedAt
bodySummary
body
seoTitle: metafield(namespace: "global", key: "title_tag") { value }
seoDescription: metafield(namespace: "global", key: "description_tag") { value }
}
}
pageInfo {
hasNextPage
endCursor
}
}
}
# articles:query — validated against api_version 2025-04
query ArticleContentAudit($after: String) {
articles(first: 250, after: $after) {
edges {
node {
id
title
handle
publishedAt
body
summary
seoTitle: metafield(namespace: "global", key: "title_tag") { value }
seoDescription: metafield(namespace: "global", key: "description_tag") { value }
blog {
id
title
}
}
}
pageInfo {
hasNextPage
endCursor
}
}
}
Session Tracking
Claude MUST emit the following output at each stage. This is mandatory.
On start, emit:
╔══════════════════════════════════════════════╗
║ SKILL: Page Content Audit ║
║ Store: <store domain> ║
║ Started: <YYYY-MM-DD HH:MM UTC> ║
╚══════════════════════════════════════════════╝
After each step, emit:
[N/TOTAL] <QUERY|MUTATION> <OperationName>
→ Params: <brief summary of key inputs>
→ Result: <count or outcome>
On completion, emit:
For format: human (default):
══════════════════════════════════════════════
PAGE CONTENT AUDIT
Pages scanned: <n> | Issues: <n>
Articles scanned: <n> | Issues: <n>
Issues:
Page: "<title>" — empty body
Article: "<title>" — missing SEO description
Output: content_audit_<date>.csv
══════════════════════════════════════════════
For format: json, emit:
{
"skill": "page-content-audit",
"store": "<domain>",
"pages_scanned": 0,
"articles_scanned": 0,
"issues_found": 0,
"output_file": "content_audit_<date>.csv"
}
Output Format
CSV file content_audit_ with columns:
type, id, title, handle, published_at, body_length, has_seo_title, has_seo_description, issue
Error Handling
| Error | Cause | Recovery |
|---|---|---|
THROTTLED | API rate limit exceeded | Wait 2 seconds, retry up to 3 times |
| No pages or articles | Minimal store | Exit with 0 results |
Best Practices
min_body_length: 1500 (characters ≈ ~250 words) as a starting threshold.seo-metadata-audit for a complete SEO audit covering products, collections, pages, and blog articles in one workflow.