Package io.goobi.viewer.controller.model.alto


package io.goobi.viewer.controller.model.alto
Classes for reading and enriching ALTO XML full-text data, including coordinate-based text extraction and named-entity highlighting within transcribed page content.
  • Class
    Description
    Parses an ALTO XML document and extracts its plain text content, preserving the structural hierarchy of pages, text blocks, lines, and words.
    Detects word coordinates in alto document.
    TextEnricher implementation that wraps ALTO word tokens referenced by named-entity tags in interactive HTML buttons.
    Strategy interface for enriching the text content of individual ALTO word elements.