Hi all I'm managing a project that needs to OCR documents in real time. We expect to have multiple users scanning and OCRing documents in the order of tens of users simultaneously, maybe 100 users at a time or more. We need to get OCR done for documents with about 50 pages in less than 20 seconds. Our documents will be scaned with 300dpi. As we are in a huge organization in a public administration, we can afford to buy very powerful servers to run tesseract.
Do you have any advice on what HW is best suited for tesseract? I've revised the Intel Xeon family of processors, and I think that choosing the Xeon Platinum processors would be a good option. Apart from having fast processors, what other components affect the performance of tesseract, amount and speed of memory, having SSD or a RamDisk? Thanks in advance Juan Carlos -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/0ae6e4c4-3ba5-46a4-a82f-5eb6b6afd814n%40googlegroups.com.