Misspellings of Words

Problem: The OCR model sometimes misreads characters, especially in certain fonts or noisy images. This can result in words being misclassified or misspelled, which then causes the automation to fail when it searches for exact matches. Example:

✅ Expected Behavior
Text is correctly spelled:
✅ Hallo ✅
👍 Works with click().text("Hallo")

❌ Actual Issue
Text is misspelled
HaII0
👎 Can’t find click().text("Hallo"). Because of recognition issues. (l->Iand o -> 0)

Solutions

Text Detection Issues

1. Icon Text Merging

Problem: Sometimes, Text Detector/annotation tool, merges an icon and texts into one, even though they look merged on screen. Example: Say you want to click just the name “Alice Johnson” field or just the position field in a interface - but OCR detects them as one long string:

✅ Expected Behavior
🖼️ Icon and Text are detected separately:
Icon and text detected separately
🧑 ✅ Name ✅ 🤖 ✅ Role ✅
👍 Works with click().text("Name") or click().text("Name")

❌ Actual Issue
🖼️ Icon and text are detected together:
Icon and text merged together
🧑 Name🤖 Role
👎 Can’t find click().text("Name").

Solution

2. Merged Texts

Problem: Sometimes, Text Detector/ annotation tool, merges two separate texts into one, even though they look clearly split on screen. Example: Say you want to click just the name “Alice Johnson” field or just the position field in a interface - but OCR detects them as one long string:

✅ Expected Behavior
🖼️ Text fields detected separately:
Text fields detected separately
Alice JohnsonSoftware Engineer
👍 Works with text("Alice Johnson") or text("Software Engineer")

❌ Actual Issue
🖼️ Texts merged into one block:
Texts merged into one block
Alice Johnson Software Engineer
👎 Can’t find either one on its own.

Solutions

3.Text Separation

Problem: Sometimes, Text Detector/ annotation tool, septerates a text into two texts, even though they look clearly merged on screen. Example: Say you want to click just the name “Alice Johnson” field or just the position field in a interface - but OCR detects them as two words:

✅ Expected Behavior
🖼️ Words are detected as one sentence:
Words detected as one sentence
Alice Johnson
👍 Works with text("Alice Johnson")

❌ Actual Issue
🖼️ Words are detected as separated texts:
Words detected separately
AliceJohnson
👎 Can’t find either text("Alice Johnson") on its own.

Solution

4. Vertical Text Merging

Problem: Sometimes, Text Detector/ annotation tool, merges two lines to one text, even though they look clearly as two lines on screen. Example: Say you want to click just the name “Alice Johnson” field or just the position field in a interface - but OCR detects them as one:

✅ Expected Behavior
🖼️ Texts are detected as two lines:
Texts detected as two lines
Alice Johnson
👍 Works with text("Alice Johnson")

❌ Actual Issue
🖼️ Texts are detected as one text:
Texts merged vertically
<no words recognized>
👎 Can’t find either text("Alice Johnson") on its own.

Solution

5. Single Character not Detected

Problem: Sometimes, Text Detector/ annotation tool, does not detect single charactors, even though they look clearly on screen. Example: Say you want to click **just the character “2” - but OCR does not detects them:

✅ Expected Behavior
🖼️ Single chars are detected:
Single characters detected
123
👍 Works with text("2")

❌ Actual Issue
🖼️ Char 2 is not detected:
Character 2 not detected
123
👎 Can’t find either text("2") on its own.

Solution

6. Text not Detected

Problem: Sometimes, for no apparent reason, Text Detector/ annotation tool does not detect a text, even though you can see it clearly on screen. Example: Say you want to click just the name “Alice Johnson” field - but OCR does not detects the text at all:

✅ Expected Behavior
🖼️ Text was detected:
Text detected properly
Alice Johnson
👍 Works with text("Alice Johnson")

❌ Actual Issue
🖼️ Text wasn’t detected
Text not detected
Alice Johnson
👎 Can’t find either text("Alice Johnson") on its own.

Solution