Question 1

Is my image uploaded to a server?

Accepted Answer

No. OCR processing happens entirely in your browser using Tesseract.js WebAssembly. Your image is never uploaded, logged, or transmitted to any server.

Question 2

How accurate is the OCR?

Accepted Answer

Accuracy depends on the image quality, text size, and language. Clean, high-resolution (300+ DPI) printed text typically achieves 95%+ accuracy. Handwritten or low-resolution text will be less accurate.

Question 3

What languages are supported?

Accepted Answer

Tesseract.js supports 100+ languages including English, Spanish, French, German, Italian, Portuguese, Chinese (Simplified and Traditional), Japanese, Korean, Arabic, Hindi, Bengali, Russian, and many more. Use the language selector to add multiple languages for mixed-content images.

Question 4

What image formats are supported?

Accepted Answer

JPG, PNG, WEBP, BMP, and GIF. HEIC files are not directly supported in the browser — convert them to JPG first using our HEIC to JPG converter.

Question 5

Why is the first run slow?

Accepted Answer

The first run downloads the language data (typically 1–15 MB depending on the language). After the first run, the data is cached by your browser and subsequent OCR is much faster.

Question 6

Can I use the OCR offline?

Accepted Answer

Once the language data is cached, you can use the OCR tool offline. The TryDocsy service worker caches the page and the WebAssembly bundle so the tool continues to work without an internet connection.

Question 7

Is there a file size limit?

Accepted Answer

There is no hard limit. Because processing happens on your device, the practical limit is your available memory. Most images up to 50 MB process smoothly.

Image to Text (OCR) — Extract Text from Any Image

Image

Settings

Extracted Text

How it works

Supported languages

Related tools

Frequently Asked Questions