Word count, headings, paragraphs (structural dimensions):
Word count: Word count tiers are calibrated from Zhang et al. (2026, preprint — not yet peer-reviewed), which found high-influence pages were on average 11.44× longer than low-influence pages. The specific word count thresholds are the tool's internal calibration.
Headings: Heading count tiers are calibrated from Zhang et al. (2026, preprint — not yet peer-reviewed), which found high-influence pages had 12.50× more headings. Specific tier thresholds are the tool's internal calibration.
Paragraphs: Paragraph count tiers are calibrated from Zhang et al. (2026, preprint — not yet peer-reviewed), which found high-influence pages had 5.69× more paragraphs. Specific tier thresholds are the tool's internal calibration.
Definition sentences:
Definition sentence detection is based on Zhang et al. (2026, preprint — not yet peer-reviewed), which found pages with high definitional content showed approximately 57% higher absorption. This is a page-level finding applied as a document-level signal — an informed inference, not a directly measured sentence-level effect.
Comparative sentences:
Comparative sentence detection is based on Zhang et al. (2026, preprint — not yet peer-reviewed), which found comparative content was associated with approximately 55% higher absorption. Page-level finding applied as a document-level signal.
Statistics presence:
Statistics presence is associated with approximately 61% higher absorption in Zhang et al. (2026, preprint — not yet peer-reviewed). Note: a separate peer-reviewed study (Aggarwal et al. 2024) found approximately +31% for citation selection — this is a related but distinct phenomenon.