Working through scraping text with the Tesseract OCR, the application I’m working with requires me to scroll down to capture any and all text in the window… however some cases have less text than others, which means as it proceeds to scroll down, it will inevitably come across blank space with no text and return the following error:
Is there a way to alleviate this and have the bot continue the next step even though the text it last scraps is blank? It has no problem initially scraping the first text it sees in this window because it is filled with Forms (seen as in the example, but most removed for privacy reasons)
This is the same window with text in it before it scrolls:
This is what the bot seems after it starts scrolling down the page to scrape this entire window:
Here is the window with the error and the properties
5 posts - 2 participants