Show HN: Local-first fast CPU image to text for screenshots, PDFs, webpages(github.com)
19 points | by mrkn1 2 days ago
9 comments
- KetoManx64 1 day agoWhat's the performance like compared to tesseract? I don't see tesseract mentioned anywhere in the readme, which is surprising considering that's the number one tool most go to for Image > text OCR.
- lavaman131 1 day agoThis is awesome! Been needing something like this for some research paper diagrams I've been indexing.
- abstract257 2 days agoCurious how it does on multi-page scanned PDFs vs. single screenshots? The ORT vision/decoder split is the part that usually makes or breaks CPU VLM OCR...
- krunck 2 days agoI had to extract the image from a PDF for it to work. Then run it on each page image extracted.
- abstract257 1 day agoThanks
- vivzkestrel 2 days ago- how well do you think this ll work with code? i mean take code screenshots and convert it into actual code for vscode
- mrkn1 2 days agoJust ran
and got thistextsnap "https://i.ytimg.com/vi/LBNDfxjEYlA/maxresdefault.jpg"$('.count').each(function () { $('this').prop('Counter', 0).animate({ Counter: $('this').text() }, { duration: 4000, easing: 'swing', step: 'function (now) { $('this").text(Math.ceil(now)); } }); });
- monosma 2 days agoWhat was the reason for adopting PaddleOCR? Can other OCR models be used as well?
- mrkn1 2 days agoNo reason other than their Q4 model working reasonably well and fast on my CPU laptop. Should work with any ONNX VLM model
- kouru225 2 days agoRoman alphabet only or does this work with other alphabets?
- mrkn1 2 days ago109 languages, including other alphabets.
- garrett2558 2 days agoVery cool, I'm building my own local-first product as well
- mrkn1 2 days agothank you! what is it about?
- BIGFOOT_EXISTS 2 days agoNow this is legit cool, keep up the great work.
- mrkn1 2 days agothank you!
- nabertronic 51 minutes ago[dead]
https://imgur.com/a/i2eQra8