Package io.goobi.viewer.controller.model.alto
package io.goobi.viewer.controller.model.alto
Classes for reading and enriching ALTO XML full-text data, including coordinate-based text extraction
and named-entity highlighting within transcribed page content.
-
ClassDescriptionParses an ALTO XML document and extracts its plain text content, preserving the structural hierarchy of pages, text blocks, lines, and words.Detects word coordinates in alto document.
TextEnricherimplementation that wraps ALTO word tokens referenced by named-entity tags in interactive HTML buttons.Strategy interface for enriching the text content of individual ALTO word elements.