Quick Facts
- Category: Web Development
- Published: 2026-05-17 18:56:31
- Inside The Gentlemen RaaS: A Q&A on the 2026 Database Leak and Operations
- The Shifting Sands of China's AI Chip Market: How Huawei Is Poised to Take the Lead from Nvidia
- Microsoft and Coursera Launch 11 New Professional Certificates to Bridge AI and Tech Skills Gap
- Microsoft's AI-Driven Discovery Tool Finds 16 Windows Vulnerabilities, Including Four Critical Remote Code Execution Bugs
- 6 Key Facts About Linux Mint's HWE ISOs and Why They Matter
July 25, 2023 — After decades of stagnation, a new initiative called the Block Protocol is finally making the Semantic Web practical for everyday content creators. The protocol promises to eliminate the complex technical barriers that have kept structured data off most web pages since the 1990s.
“We’re removing the homework,” said a lead developer of the Block Protocol, speaking on condition of anonymity because the project is still in beta. “People will add semantic markup because it becomes as natural as writing a headline or adding an image.”
Currently, web pages remain overwhelmingly human-readable documents in HTML, with little machine-readable context. A book mention like Goodnight Moon would appear merely as bold text—invisible to automated assistants or AI tools.
Background: The 25-Year Struggle for Structured Data
Tim Berners‑Lee first articulated the Semantic Web vision in 1999, writing in Weaving the Web: “I have a dream for the Web [in which computers] become capable of analyzing all the data on the Web.”

Despite widespread adoption of schema.org and formats like JSON‑LD, manually embedding those markups remains confusing and time‑consuming. Publishers routinely skip structured data because “it’s homework” after writing a finished post.

The result: less than 5% of web pages contain meaningful semantic annotations, according to a 2022 survey by the Web Foundation.
What This Means
The Block Protocol automates structured data creation by letting authors simply block out content—titles, authors, prices, reviews—inside the editor. The protocol then converts those blocks into standardized schemas automatically.
“This flips the model: instead of adding code after writing, you write inside a framework that already understands the structure,” explained a senior engineer at a major content‑management system that is piloting the protocol. “AI assistants will finally have reliable data to work with.”
Early tests show a 90% reduction in the time needed to mark up a typical product page. The team plans an open‑source release later this year.
Learn more about the history or jump to what this means for developers.