10 Commits

Author SHA1 Message Date
Yury Kossakovsky
64fc0adc67 Squash merge develop into main 2025-08-29 19:46:38 -06:00
Yury Kossakovsky
34668fb817 Remove ocr_config.yaml and update OCR service command in docker-compose.yml
- Deleted the `ocr_config.yaml` file as it is no longer needed.
- Updated the OCR service command in `docker-compose.yml` to reference the pipeline directly instead of the configuration file, streamlining the setup.
2025-08-29 17:37:35 -06:00
Yury Kossakovsky
d51c9c8ff6 Add document preprocessing submodules to OCR configuration
- Introduced new `SubPipelines` section in `ocr_config.yaml` to include `DocPreprocessor` settings.
- Added submodules for document orientation classification and unwarping, enhancing the OCR pipeline's ability to process various document formats and improve accuracy.
2025-08-29 17:31:47 -06:00
Yury Kossakovsky
12e2cf8ae1 Update OCR configuration to enhance text processing capabilities
- Reorganized `ocr_config.yaml` to include a new `text_type` key and options for document preprocessing.
- Introduced `SubModules` for `TextDetection` and `TextRecognition`, specifying model names and parameters to improve the OCR pipeline's functionality and flexibility.
2025-08-29 17:29:40 -06:00
Yury Kossakovsky
c515c6c7f9 Refactor OCR configuration to nest document preprocessor settings
- Moved `doc_preprocessor` settings under the `OCR` key in `ocr_config.yaml` for better organization.
- Added `recognizer` settings to specify the language for the OCR recognizer, enhancing the configuration structure.
2025-08-29 17:21:10 -06:00
Yury Kossakovsky
0b6dba04be Add document preprocessing options to OCR configuration
- Introduced `doc_preprocessor` settings in `ocr_config.yaml` to enable document orientation classification and unwarping.
- These enhancements improve the OCR pipeline's ability to handle various document formats and orientations, increasing overall accuracy and usability.
2025-08-29 17:17:53 -06:00
Yury Kossakovsky
1417b5983e Add pipeline name to OCR configuration
- Introduced a new key `pipeline_name` in `ocr_config.yaml` to specify the OCR pipeline name as "OCR".
- This change enhances the clarity of the configuration and allows for better identification of the pipeline in the setup.
2025-08-29 16:51:07 -06:00
Yury Kossakovsky
6354e1ae1f Update OCR configuration in docker-compose.yml and add ocr_config.yaml
- Modified the OCR service command in `docker-compose.yml` to reference the new configuration file `ocr_config.yaml`.
- Introduced `ocr_config.yaml` to specify the Russian language for the OCR pipeline.
- These changes streamline the OCR service setup and enhance language support.
2025-08-29 16:44:57 -06:00
Yury Kossakovsky
455f67675e Update OCR service command in docker-compose.yml and clean up ocr_hpi.yaml
- Modified the OCR service command in `docker-compose.yml` to correctly quote the path to the HPI configuration file.
- Cleaned up the `ocr_hpi.yaml` file by removing unnecessary lines and ensuring proper formatting for the OCR pipeline configuration.
- These changes improve the clarity and functionality of the OCR service setup.
2025-08-29 16:41:11 -06:00
Yury Kossakovsky
cc13a8ac4e Add OCR HPI configuration and update docker-compose.yml
- Introduced a new configuration file, `ocr_hpi.yaml`, to define the OCR pipeline and specify the Russian language model.
- Updated the `docker-compose.yml` to mount the new configuration file and modified the OCR service command to include the configuration path.
- These changes enhance the OCR service's capabilities by allowing for customizable pipeline settings and improved language support.
2025-08-29 16:12:48 -06:00