farscry install-lang
farscry ships with English built in. Additional languages are downloaded on demand and stored in ~/.farscry/models/lang/.
Usage
farscry install-lang <code>Examples
farscry install-lang por # Portuguesefarscry install-lang rus # Russianfarscry install-lang chi_sim # Chinese Simplifiedfarscry install-lang jpn # JapanesePriority languages for v0.1.0
| Code | Language | Model size |
|---|---|---|
eng | English (built in) | ~12MB |
por | Portuguese | ~10-20MB |
rus | Russian | ~10-20MB |
chi_sim | Chinese Simplified | ~10-20MB |
How it works
farscry uses PP-OCRv5 language-specific ONNX recognition models - not Tesseract .traineddata files.
Each language pack is:
- Downloaded from GitHub Releases via HTTPS
- Verified with SHA256 before use
- Stored in
~/.farscry/models/lang/<code>.onnx
Language auto-detection
When --lang is not specified, farscry uses whatlang-rs or lingua-rs for language detection. Tesseract’s built-in language detection is not used (it is inaccurate for UI screenshots).
Using a language
Explicit languagefarscry extract screenshot.png --lang por
Multi-language (mixed content)farscry extract screenshot.png --lang eng+porStorage location
~/.farscry/ models/ det.onnx <- detection model (shared) rec_eng.onnx <- English recognition (default) lang/ por.onnx rus.onnx chi_sim.onnx