Docling
To fine-tune the machine resources, and be able to use all machine resources in ingestion process, we provide the following configuration to configure based on the machine:Performance Configuration
- Number of workers: This is the configuration that determines how many workers Docling will have. We recommend leaving it at 2, where 1 worker will process large files and the other will process smaller files.
- Number of threads: This configuration will be used to limit the capacity that our internal extraction service will use. Consider that this number should be lower to the maximum cores that machine has.
Optional Configuration
-
Table mode: Controls the table extraction strategy:
accurate: Maximum precision for complex tables. Higher processing time. Default option.fast: Optimized for speed with good accuracy. Lower processing time.none: Disables table extraction. No processing time.
-
Do cell matching: Enables matching of table cells to improve structure recognition. Recommended for complex tables. By default is
true. -
Force full page OCR: Forces OCR processing on entire pages instead of selective regions. Use when standard extraction misses content. This will increase the processing time significantly. By default is
false.