Hi Zdenko, May I know how do you cut region and pre-processed it?
Much appreciate if you could guide me on this. Thanks, Lee On Saturday, July 8, 2023 at 11:54:17 PM UTC+8 zdenop wrote: > I am not sure what you mean by "I have tried setting the Region of > Interest (ROI) ", but when I cut region and pre-processed it as described > in the documentation I got the correct results: > > tesseract frame_1-ROI1_preprocessed.png - --psm 7 > GOH SCE YUAN > > tesseract frame_1-ROI2_preprocessed.png - --psm 4 > 0197782267 > 073351668 > 0197732267 > > Zdenko > > > st 28. 6. 2023 o 4:26 Lee Kar Yee <leeka...@gmail.com> napísal(a): > >> Hi, >> >> Apologies. Kindly refer to the following. >> >> With the following code, I managed to draw rectangle on the region that >> are processed by Tesseract OCR. >> >> pytesseract.pytesseract.tesseract_cmd = r"C:\Program >> Files\Tesseract-OCR\tesseract.exe" >> >> video = r"C:\Users\User\Downloads\FATHER\test.mp4" >> cap = cv2.VideoCapture(video) >> frame_count = 0 >> >> while cap.isOpened() and frame_count < 2: >> ret, frame = cap.read() >> >> if not ret: >> break >> >> # Perform OCR on the entire frame without dictionaries >> text = pytesseract.image_to_string(frame, config='--psm 1 -l eng >> --oem 1') >> >> print(text) >> >> # Get the bounding box coordinates of the detected text regions >> boxes = pytesseract.image_to_boxes(frame, config='--psm 1 -l eng >> --oem 1') >> >> # Draw bounding box rectangles on the frame >> for box in boxes.splitlines(): >> _, x, y, w, h, _ = box.split(' ') >> x, y, w, h = int(x), int(y), int(w), int(h) >> # Draw rectangles on the frame >> cv2.rectangle(frame, (x, y), (w, h), (0, 0, 255), 1) >> >> # Save the frame as an image >> cv2.imwrite(f"frame_{frame_count}.jpg", frame) >> >> frame_count += 1 >> >> cap.release() >> cv2.destroyAllWindows() >> >> And the results are as below. >> >> ntes F-Farm Annlicatinns Service Reiest 9 Oar >> >> individual Name IDK >> >> GOH SCE YUAN 600 >> >> >> nten F-Farm Annlicatinns Service Request «9 [or >> >> Individual Name IDK >> >> GOH SCE YUAN 600 >> >> Kindly refer to the objective.jpg for what I actually intend to capture. >> >> Thanks, >> >> Lee >> >> On Tuesday, June 27, 2023 at 6:25:50 PM UTC+8 zdenop wrote: >> >>> without an example image nobody can help you. >>> >>> Zdenko >>> >>> >>> ut 27. 6. 2023 o 12:01 Lee Kar Yee <leeka...@gmail.com> napísal(a): >>> >>>> Hi all, >>>> >>>> I am new to Tesseract OCR. I am trying to achieve extracting alphabets >>>> and numbers from images. >>>> These images are being converted from a mp4 video into frames as JPG. >>>> >>>> While using Page segmentation method 3, it is working wonders on >>>> extracting alphabets, but it failed to extract numbers. >>>> >>>> I have tried setting the Region of Interest (ROI) but yet it still >>>> failed. >>>> >>>> Any thoughts or direction that can point me to so that I can improve it? >>>> >>>> Thanks, >>>> >>>> Lee >>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To unsubscribe from this group and stop receiving emails from it, send >>>> an email to tesseract-oc...@googlegroups.com. >>>> To view this discussion on the web visit >>>> https://groups.google.com/d/msgid/tesseract-ocr/67b453f2-7781-44a5-be05-05676d3ee5fan%40googlegroups.com >>>> >>>> <https://groups.google.com/d/msgid/tesseract-ocr/67b453f2-7781-44a5-be05-05676d3ee5fan%40googlegroups.com?utm_medium=email&utm_source=footer> >>>> . >>>> >>> -- >> You received this message because you are subscribed to the Google Groups >> "tesseract-ocr" group. >> To unsubscribe from this group and stop receiving emails from it, send an >> email to tesseract-oc...@googlegroups.com. >> > To view this discussion on the web visit >> https://groups.google.com/d/msgid/tesseract-ocr/be985e79-79d0-41f9-8b14-139af74e94e9n%40googlegroups.com >> >> <https://groups.google.com/d/msgid/tesseract-ocr/be985e79-79d0-41f9-8b14-139af74e94e9n%40googlegroups.com?utm_medium=email&utm_source=footer> >> . >> > -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to tesseract-ocr+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/c9e2a768-a063-4a48-92e2-1e1131d559bbn%40googlegroups.com.