The Rendition Server efficiently processes documents through various stations, each tailored to handle specific document types and characteristics. The processing flow is determined by the document type and its properties.
XfaConversionStation
- Purpose: Check for dynamic content (form fields) in PDF documents.
- Action: Convert dynamic content into static page elements suitable for PDF/A compliance.
- Note: Skips the check for non-PDF documents.
BornDigitalStation
- Purpose: Convert digital documents to the target format.
- Action: Converts PDF, Office, Plain-Text, XML, HTML, and email documents.
- Result: Generates a PDF/A document in the defined target format.
- Exception: Pure image documents (jpeg, tiff, png, bmp) bypass this station.
RasterStation
- Purpose: Rasterize image documents to 300dpi for OCR text recognition and compression.
- Action: Skips digital documents generated without images.
PdfRasterStation
- Purpose: Rasterize PDF documents page-wise as a fallback.
- Note: Activated if conversion in BornDigitalStation fails.
- Caution: Text on images within the PDF may be lost and requires reconstruction.
OmniPageStation
- Purpose: Perform text recognition for rasterized documents.
- Action: Recognize text in XML format.
- Exception: Skips PDF documents, preserving their existing text.
PdfOcrImagesStation
- Purpose: Examine PDF documents for images without text.
- Action: Perform text recognition for image-only content.
- Optimization: Exclude images with existing text for faster processing.
- Quality Control: Optionally discard poor-quality recognized text.
MrcRecodingStation
- Purpose: Examine and compress images in PDF documents.
MrcStationExt
- Purpose: Compose PDF pages from rasterized images or PDFs with recognized content.
- Action: Compress image documents.
PdfAnnotationStation
- Activation: Enables hyperlink analysis for recognized text in PDFs.
- Result: Generates links in the PDF document for clickable web addresses.
PdfTaggingStation
- Purpose: Add structural information for document content.
- Benefit: Enhances accessibility for visually impaired users.
- Consideration: OCR-generated text may lack structural information.
MergeAndSplitPdfStation
- Purpose: Combine pages from multiple documents or rasterized PDFs.
- Action: Merge all pages into a single document.
Themis.StructureInfoStation
- Context: Part of the Themis package tailored for the judiciary.
- Purpose: Transform non-text components in headers and footers into text elements.
- Customization: Strategy parameters control the transformation.
PageTransformationStation
- Purpose: Scale pages to defined sizes or set page margins.
FinalizeResultsStation
- Action: Verify if the generated document adheres to defined strategy parameters.
- Conclusion: All documents pass through this station at the end of the conversion process.
This comprehensive overview provides insight into the Rendition Server's intricate document processing workflow, ensuring optimal results tailored to varied document types and customer needs.
For more information, contact our Support Team at rendition_server@foxitsoftware.com.