An Android app for Optical Character Recognition (OCR) to extract data from Vietnamese documents like passports. It uses CameraX for image capture, ML Kit for text and face recognition, and supports translation into Vietnamese, English, and Russian. Built with MVVM for maintainability.
demo.mp4
- Clone the repo:
git clone https://github.com/lnlan1810/Document-OCR.git
- Open in Android Studio and sync dependencies.
- Run on an Android device/emulator (API 26+).
- Open the app and click "Scan" to capture/select an image.
- View extracted text and face (if detected) on the result screen.
- Select a language (Vietnamese, English, Russian) to translate text.
- Copy results to the clipboard.
- Capture images via CameraX or gallery.
- Recognize text and faces using ML Kit.
- Extract passport data (name, date of birth, etc.).
- Translate text into multiple languages.
- Clean, user-friendly UI.
- Kotlin
- CameraX
- Google ML Kit
- LiveData & Coroutines
- Fork the repo.
- Create a branch (
git checkout -b feature/your-feature
). - Commit changes (
git commit -m 'Add feature'
). - Push (
git push origin feature/your-feature
). - Open a Pull Request.