A modest proposal: Reformat everything to make documents more palatable to AI
**TL;DR:** A modest proposal: Reformat everything to make documents more palatable to AI
---
What we know
Websites are being redesigned for consumption by AI models, and now a coalition wants to extend the trend to digital documents. The LF AI & Data Foundation, under the Linux Foundation, has formed a working group to steer the development of DocLang, an AI-friendly document format that aims to help enterprises feed their files to AI systems. The DocLang group, founded by IBM, NVIDIA, Red Hat, ABBYY, HumanSignal, and Forgis, contends that existing formats like PDF, Markdown, HTML, and LaTeX are ill-suited for AI document parsing.
In late 2024, IBM developed an open source toolkit called Docling to facilitate AI document parsing, not unlike Microsoft's MarkItDown or the Marker project. Docling provides a way to convert various file formats into structured AI-ready data. DocLang expands upon that foundation with a standard for exchanging structured output across different systems. "DocLang is designed to solve one of the foundational problems in enterprise AI: documents were built for humans, not machines," said Maxime Vermeir, VP of AI Strategy at AI automation biz ABBYY in a statement.
"By introducing a minimal, standardized, and AI-native representation of document structure, layout,
Source: The Register
Context
AI coverage on iByte separates shipped capability from roadmap talk. The practical lens is cost, access, safety, and what changes for builders and everyday users.
Why this matters
Even when details are thin, these stories matter because they signal direction: pricing, policy, platform behavior, or security posture can shift quickly once momentum builds.
What to watch next
Follow whether independent researchers or regulators validate the claims — that is often when the real scope becomes clear.
Practical takeaways
1) Separate the announcement from the shipping date. 2) Compare alternatives if pricing or terms shift. 3) Revisit the story when independent verification lands.
FAQ
**Q: Is everything in this article confirmed?** A: The summary reflects publicly reported information at publication time. Analysis sections are clearly framed as context, not new reporting.
**Q: Will iByte update this page?** A: Yes. As primary sources publish more detail, this article can be refreshed without changing the URL.
Last updated: June 16, 2026.
Additional context: early-cycle stories often look bigger in headlines than in day-to-day impact. The useful move is to identify the smallest set of facts that would change your decision, then wait for those facts to land.
Additional context: early-cycle stories often look bigger in headlines than in day-to-day impact. The useful move is to identify the smallest set of facts that would change your decision, then wait for those facts to land.
