Rules & Criteria
Below are the high-level checks our engine runs (binary/near-binary rules).
Crawl access
- HTTP status & connectivity
- robots.txt allowances for GPTBot, Google-Extended, generic agents
- JS-only rendering detection
llms.txt
- Presence and parseability
- Conflicts with robots.txt
- Ambiguous or invalid rules flagged
Content interpretability
- H1 presence and heading hierarchy
- Text-to-markup dominance
- Primary topic detectability
Entity signals
- Organization / Product / Person detection
- Schema.org presence
- Page-level clarity
Intent and Trust
- Title vs H1 alignment
- About/contact presence
- Author info for content-heavy pages