Skip to content
r/LocalLLaMA · Communities

How Baidu’s newly released Unlimited-OCR transcribes dozens of pages in one forward pass

https://i.redd.it/zjduf8zns79h1.gif Baidu released Unlimited-OCR 2 days ago, and they claim it can transcribe dozens of pages in one forward pass. I read the research paper, and decided to make a post (link if anyone's interested) Problem they are solving The problem it targets basically well known. end-to-end OCR mode