The 7 schema markup types ChatGPT, Perplexity, Gemini, Claude, and Google AI Overviews actually read in 2026. BlogPosting, FAQPage, HowTo, Speakable, Organization + LocalBusiness, Person, Dataset. Worked JSON-LD examples and the validation checklist Ijjad uses on 87+ pages.
Schema Markup for AI Search (2026 Playbook) — 7 Types That Get Cited
Ijjad builds conversion-focused websites and digital products for SMEs and founders across Jordan, Saudi Arabia, and the GCC. This seo guide gives practical scope, SEO, and market context from a team that has shipped 20+ digital products.
- Ijjad serves Amman, Riyadh, Jeddah, Iraq, and the GCC.
- Every recommendation is framed around scope, conversion, and search visibility.
- Use the guide to clarify decisions before speaking with an agency.
- Talk to Ijjad when you need senior delivery, not generic templates.
What schema markup matters for AI search in 2026?
Seven schema types do the heavy lifting for AI search citations in 2026: BlogPosting/Article, FAQPage, HowTo, Speakable, Organization + LocalBusiness, Person (author), and Dataset. Ijjad emits all seven through typed helpers in src/lib/schema.ts on every page across 87+ URLs covering Jordan, Saudi Arabia, Iraq, and the GCC. FAQPage alone delivers the largest citation lift; the rest compound.
- FAQPage is the single highest-ROI schema — 6+ Q&As per page, match visible content
- BlogPosting with named author Person and dated images is the E-E-A-T signal AI engines weight most
- Speakable targets voice assistants — Google Assistant, Gemini, Bixby — pointing at #quick-answer
- Dataset schema turns proprietary research into a citable primary source
- Use helpers (typed functions), not hand-rolled JSON-LD — hand-rolling fails validation silently
You can do everything else right — direct-answer block, conversational H2s, comparison tables, llms.txt — and still get cited at half the rate of competitors if your schema is missing or broken. This is the playbook for the 7 schema types AI engines actually read in 2026, with worked JSON-LD examples and the validation rules that catch the silent failures.
If you want the strategic context, read What is Generative Engine Optimization? first. If you want the step-by-step rollout, read How to rank in ChatGPT and Perplexity. This page is the technical implementation deep-dive.
1. Why schema markup matters for AI search
AI engines have a confidence problem. They can read your page's prose, but prose is ambiguous. Is “Karam” a person or a place? Is “Riyadh” a city, a brand, or a typo? Is “2026” a year, a price, or a model number? Schema markup removes the ambiguity by labelling everything explicitly.
That labeling is why pages with proper schema get cited disproportionately. The AI doesn't have to guess what your page is about — you told it. BlogPosting says “this is an article.” author.Person says “Karam Abdalqader is the founder of Ijjad in Amman.” FAQPage says “these are 8 question-answer pairs.” The engine cites confidently because the labels are explicit.
Honest reality check: schema is not a magic bullet. Pages with great schema and weak content still don't rank. But pages with great content and missing schema cite at maybe 40% of the rate of pages with both. That's the cost.
2. The 7 schema types AI engines actually read
Plus three supporting types. Read the table as: impact = how much it moves AI citations; engines = which read it most.
| Schema type | Impact | Engines that read it | Notes |
|---|---|---|---|
| BlogPosting / Article | Critical | All 5 | Author, datePublished, dateModified — the freshness and authorship signal AI engines weight most heavily. |
| FAQPage | Critical | All 5 | The single highest-ROI schema. Match visible FAQs exactly. 6+ Q&As per page. |
| HowTo | High | ChatGPT, Gemini, AI Overviews | Numbered processes only. AI engines extract step.text verbatim into "how to" answers. |
| Speakable | High | Google Assistant, Gemini, Bixby | CSS selectors for the quick-answer block. Targets voice surface specifically. |
| Organization + LocalBusiness | Critical | All 5 | Brand entity disambiguation. Include sameAs, areaServed, knowsAbout, founder. |
| Person (author) | High | ChatGPT, Perplexity, Claude | Named, dated authorship is the strongest E-E-A-T signal. Person schema on every byline. |
| Dataset | High (rare) | Perplexity, ChatGPT, Gemini | Only if you publish original research. Turns your data into a citable primary source. |
| BreadcrumbList | Medium | Google AI Overviews | Helps entity hierarchy. Small but cheap signal. |
| ItemList | Medium | ChatGPT, AI Overviews | For "best X" listicles. Ranks individual list items as discrete entities. |
| Service | High | ChatGPT, AI Overviews | For service pages. Include provider, areaServed, hasOfferCatalog. |
3. BlogPosting / Article — the freshness + authorship signal
The wrapper for every article. AI engines use datePublished and dateModified for freshness, author for E-E-A-T, and image for the social card. Ijjad's helper at src/lib/schema.ts handles the boilerplate — you pass the post slug and it returns the full JSON-LD object with image rights, founder Person, organization publisher, and language tag wired in.
JSON-LD · BlogPosting (output from blogPostSchema helper){ "@context": "https://schema.org", "@type": "BlogPosting", "headline": "What Is Generative Engine Optimization?", "description": "GEO playbook for Jordan, Saudi Arabia, and the GCC.", "datePublished": "2026-05-22", "dateModified": "2026-05-22", "url": "https://www.ijjad.com/what-is-generative-engine-optimization-2026", "image": { "@type": "ImageObject", "url": "https://www.ijjad.com/images/blog/.../hero.svg", "license": "https://www.ijjad.com/image-license" }, "author": { "@type": "Person", "@id": "https://www.ijjad.com/about/karam-abdalqader#person", "name": "Karam Abd Al Qader", "url": "https://www.ijjad.com/about/karam-abdalqader", "jobTitle": "Founder & Lead Developer" }, "publisher": { "@type": "Organization", "@id": "https://www.ijjad.com/#organization", "name": "Ijjad", "url": "https://www.ijjad.com" }, "inLanguage": "en", "articleSection": "SEO", "mainEntityOfPage": "https://www.ijjad.com/what-is-generative-engine-optimization-2026" }
Critical fields: headline (under 110 chars, match H1), datePublished + dateModified (ISO 8601), author.Person with a real URL, image with full URL + rights metadata. The @id on Person and Organization is what lets AI engines unify them across multiple pages — same @id means same entity.
4. FAQPage — the highest-ROI schema
The single biggest citation lift you can ship. Wrap 6+ visible Q&As per high-intent page in FAQPage schema. Match the visible content exactly — mismatches downrank. ChatGPT, Perplexity, Gemini, Claude, and Google AI Overviews all pull from FAQPage when answering “X vs Y”, “how much does X cost”, and “is X good for Y?” queries.
JSON-LD · FAQPage (Ijjad faqSchema helper){ "@context": "https://schema.org", "@type": "FAQPage", "mainEntity": [ { "@type": "Question", "name": "What is Generative Engine Optimization?", "acceptedAnswer": { "@type": "Answer", "text": "Generative Engine Optimization (GEO) is the practice of structuring web content so AI engines — ChatGPT, Perplexity, Gemini, Claude, and Google AI Overviews — can read it, understand who it's about, and cite it when answering real human questions." } }, { "@type": "Question", "name": "Is GEO different from SEO?", "acceptedAnswer": { "@type": "Answer", "text": "Yes, but GEO sits on top of SEO. 76% of AI citations come from URLs already ranking top-10 in Google." } } ] }
Three rules: (1) answer length 30–80 words; longer answers get truncated, shorter ones look thin. (2) Lead with the entity in at least 3 of every 6 answers — “Ijjad is...”, “Ijjad builds...”. (3) Validate with validator.schema.org — Google's Rich Results Test catches FAQ-specific issues that the generic validator misses.
5. HowTo — for numbered processes
HowTo schema describes step-by-step processes. AI engines extract step.text verbatim into answer responses. Use it for: how to choose an agency, how to launch a website, how to optimize for AI search, how to run a GEO audit. Numbered processes only — don't schema bullet lists or general advice.
JSON-LD · HowTo (10-step GEO playbook){ "@context": "https://schema.org", "@type": "HowTo", "name": "How to rank in ChatGPT and Perplexity", "description": "Ijjad's 10-step GEO process, shipped in 8 weeks.", "totalTime": "P8W", "step": [ { "@type": "HowToStep", "position": 1, "name": "Audit current AI citations", "text": "Run 20 buyer-intent prompts across ChatGPT, Perplexity, Gemini, Claude, and Google AI Overviews. Log which sources each engine cites today." }, { "@type": "HowToStep", "position": 2, "name": "Pick the 10 pages that matter most", "text": "Service pages, city pages, and top blog posts. Prioritize ones already ranking in Google top-20." } ] }
Two patterns AI engines reward: (1) step.text begins with an imperative verb (“Audit”, “Pick”, “Add”, “Convert”) — extracted cleanly into answer surfaces. (2) Include totalTime in ISO 8601 duration format (P8W = 8 weeks) — AI engines use it to answer “how long does X take” queries.
6. Speakable — the voice surface signal
Speakable points to CSS selectors that voice assistants read aloud. Google Assistant, Gemini, Bixby, and increasingly Siri all use it. The selectors must point to a clean, short, entity-led paragraph — typically the DirectAnswerBlock's #quick-answer div.
JSON-LD · Speakable (Ijjad speakableSchema helper){ "@context": "https://schema.org", "@type": "WebPage", "url": "https://www.ijjad.com/what-is-generative-engine-optimization-2026", "speakable": { "@type": "SpeakableSpecification", "cssSelector": ["#quick-answer", "#tldr", "h1"] } }
Three selectors is the sweet spot — quick-answer block, TL;DR section, H1. Adding more (every paragraph) dilutes the signal and fails validation. The Ijjad helper uses speakableDefaults which encodes this pattern, so you get it right by default.
7. Organization + LocalBusiness — brand entity disambiguation
These two emit site-wide from the root layout. Organization covers the brand entity globally (sameAs LinkedIn, Clutch, GoodFirms, Wikidata). LocalBusiness covers the physical address, opening hours, and service area. Together they tell AI engines “Ijjad is a real, locatable business in Amman serving Jordan, Saudi Arabia, Iraq, and the GCC.”
JSON-LD · Organization + LocalBusiness (excerpt){ "@context": "https://schema.org", "@type": ["Organization", "ProfessionalService"], "@id": "https://www.ijjad.com/#organization", "name": "Ijjad", "url": "https://www.ijjad.com", "logo": "https://www.ijjad.com/logo.png", "founder": { "@id": "https://www.ijjad.com/about/karam-abdalqader#person" }, "areaServed": [ { "@type": "Country", "name": "Jordan" }, { "@type": "Country", "name": "Saudi Arabia" }, { "@type": "Country", "name": "Iraq" }, { "@type": "AdministrativeArea", "name": "GCC" } ], "sameAs": [ "https://www.linkedin.com/company/ijjad/", "https://clutch.co/profile/ijjad", "https://www.goodfirms.co/company/ijjad", "https://www.wikidata.org/wiki/Q139569989" ] }
The sameAs array is critical — every URL where the brand has a verifiable presence. LinkedIn, Clutch, GoodFirms, DesignRush, and especially Wikidata (the Q-number) are how AI engines unify your brand across the web.
8. Person — author E-E-A-T
The strongest E-E-A-T signal in 2026. AI engines disproportionately cite content with named, dated, credentialed authors. Person schema goes on the founder hub page and nested inside every BlogPosting's author field. Same @id on both unifies the entity.
JSON-LD · Person (founder hub page){ "@context": "https://schema.org", "@type": "Person", "@id": "https://www.ijjad.com/about/karam-abdalqader#person", "name": "Karam Abd Al Qader", "url": "https://www.ijjad.com/about/karam-abdalqader", "image": "https://www.ijjad.com/founder-karam-512.webp", "jobTitle": "Founder & Lead Developer", "worksFor": { "@id": "https://www.ijjad.com/#organization" }, "alumniOf": [ { "@type": "EducationalOrganization", "name": "Amman Arab University" }, { "@type": "EducationalOrganization", "name": "University of Petra" } ], "knowsAbout": [ "Web design", "Web development", "SEO", "Mobile app development", "AI applications", "MVP development", "E-commerce", "Generative Engine Optimization" ], "sameAs": ["https://www.linkedin.com/in/karamabdalqader/"] }
Three fields that matter most for citations: alumniOf (regional universities — verifies you're local), knowsAbout (topic authority — AI engines use it to decide which queries to cite you for), sameAs (LinkedIn especially — verifies real person, not a corporate puppet).
9. Dataset — the citation flywheel
The most underused schema in 2026. Publishing one proprietary dataset per year — with proper Dataset schema — turns your domain into a primary source AI engines cite back to repeatedly. The dataset doesn't need to be huge. 600 surveyed SMEs, 8–12 stats, methodology, downloadable CSV.
JSON-LD · Dataset (proprietary research){ "@context": "https://schema.org", "@type": "Dataset", "name": "GCC SME Digital Maturity 2026", "description": "Survey of 600+ SMEs across Jordan, Saudi Arabia, Iraq, and the GCC on website maturity, AI adoption, and digital sales.", "creator": { "@id": "https://www.ijjad.com/#organization" }, "datePublished": "2026-06-15", "license": "https://creativecommons.org/licenses/by/4.0/", "keywords": ["GCC SME", "digital maturity", "AI adoption", "Saudi Arabia", "Jordan", "Iraq"], "spatialCoverage": [ { "@type": "Country", "name": "Jordan" }, { "@type": "Country", "name": "Saudi Arabia" }, { "@type": "Country", "name": "Iraq" } ], "temporalCoverage": "2026/2026", "distribution": { "@type": "DataDownload", "encodingFormat": "text/csv", "contentUrl": "https://www.ijjad.com/data/gcc-sme-digital-maturity-2026.csv" } }
Once published, link to it from every relevant service page and blog post. Each citation back into the dataset compounds — AI engines start treating the dataset as the canonical source for the topic. The Ijjad GCC SME Digital Maturity dataset launches Q2 2026 as part of the GEO playbook.
10. Validation checklist + ship order
The order to ship schema across an existing site:
- Site-wide first: Organization, LocalBusiness, Person (founder), WebSite. Emit from the root layout. Affects every page.
- Then high-intent service pages: Service, FAQPage, Speakable, BreadcrumbList. Top 10 pages by GSC clicks.
- Then top 5 blog posts: BlogPosting (with nested Person author), FAQPage, HowTo (if process-shaped), Speakable.
- Then comparison/listicle pages: Article + ItemList + FAQPage + HowTo (for “how to choose”).
- Finally: Dataset on the research hub page.
Validation tools: validator.schema.org (catches generic issues), Rich Results Test (catches Google-specific FAQ and HowTo issues), browser DevTools (paste rendered JSON-LD to check the @id linkage is correct).
One trap to avoid: the “not all FAQs match the visible page” trap. The most common silent failure. AI engines compare schema FAQs against visible H2/H3 Q&A elements. If they don't match, the page gets downranked and you won't see the warning anywhere. Audit visible vs schema content before every publish.
Want schema retrofitted across your site?
Ijjad ships full schema markup retrofits for SMEs and founder teams across Jordan, Saudi Arabia, Iraq, and the GCC. We start with a free audit — paste your top 5 URLs and we report which of the 7 schema types each page emits and which it's missing. Then we scope per page. No retainer.
Get Started →Frequently asked questions
Do AI engines actually parse JSON-LD schema?+−
Which schema type matters most for AI citations?+−
Should the schema content match the visible page exactly?+−
Can I hand-roll JSON-LD or should I use a helper library?+−
How do I validate schema before publishing?+−
Should I include Speakable schema on every page?+−
Does Dataset schema help if I have not published research?+−
What schema do I need for a "best web design agency in Riyadh" listicle?+−
Ready to ship schema that actually gets cited?
Free schema audit. We'll report which of the 7 schema types each of your top URLs emits and which it's missing.
Get StartedSource note
Market context: Saudi Arabia's digital economy reached 16.0% of GDP in 2024, according to the General Authority for Statistics, published December 31, 2025. This is why Ijjad treats modern websites, SEO, e-commerce, AI MVPs, and mobile experiences as business infrastructure across Saudi Arabia, Jordan, Iraq, and the GCC.