arXiv cs.CV
· Papers
Page image classifier fine-tuned on century-spanning archives of scanned documents for further content-specific processing
arXiv:2606.07558v2 Announce Type: replace Abstract: Purpose: Digitization projects in the humanities produce vast, heterogeneous archives of historical documents, making manual sorting impractical at scale. This work addresses the need for an automated system to classify scanned page images based on visual content type