The term gained traction in late 2023 following a white paper released by a consortium of open-source developers known as The Orion Group . They argued that traditional PDFs were suffering from "data entropy"—the gradual loss of contextual meaning as files are moved between servers.
The paper introduces a dataset constructed from real surgical videos (cholecystectomy/gallbladder removal).