Skip to content
English
  • There are no suggestions because the search field is empty.

2. Structural Tags

Use structural tags to divide up your documents into structural sections like paragraphs, headers, marginalia or your customised categories

Previous step: Layout Recognition


Structural tags can be used for Field Models or when you want to restrict the text recognition to certain structure types instead of recognising the whole page.

They can also be helpful when you want to tag the layout elements (text regions and lines) and export this information in the XML alongside the coordinates of each shape.  

Structural tags are centrally managed at the collection level, allowing for easy editing and utilisation by all collaborators within the collection.

Step 1:

To work with structural tags, you will first of all need to make them visible. To do this, click on the "Settings" icon located in the menu on the right side of the screen. From there, select "Image". Under the "Visibility" section, you will find two buttons: Show structure labels and Show structure colors. These buttons allow you to to show structure labels and colours and can easily switch these buttons on and off with a simple click. You can also change the label size in the "Scale" section below.Structural Tags - Image

Step 2:

If you want to add and change customised tags, click on the "Settings" icon located in the menu on the right side of the screen and go to "Tags". Here you can choose between "Structural tags" and "Textual tags" and switch tag visibility on or off by clicking on the buttons.

Regarding the default tags, it is not possible to delete or edit them.
However, you have the option to make them visible or not.

To modify or add customised structural tags, simply click on "Edit tags in collection settings" and you will be redirected to the Tags page on your desktop. This is where you can make changes to all your tags.

You can also manage the structural tags of a collection through your Transkribus desktop. Open the collection in question and on the left-side “Transkribus organiser” menu, click on Tag Manager. Then, at the top, select “Structure Tags”. Here you can add, edit and remove structural tags. 

To add a new tag, click on “Create new”, type the name, choose the colour and click “Create” to save your changes.

Click on the structural tag to edit the name or the colour, or click the “Remove” button to delete it.

Please note that all the changes done here are saved only for the collection in question (the one opened in the background). Other collection users will see and be able to use your newly added or edited structural tags.

Structure Tags-Manager

Step 3:

To assign a structural tag to a text region or line, select the shape and right-click on it: the first menu item is "Assign structure type". Click on it and choose the relevant tag to assign to the selected shape. Here you will only see the tags you have made visible in Settings. To remove a structure tag from a shape, select the shape and choose "none" from the list of tags.

To assign the same tag to more regions at once, hold CTRL and select the relevant regions, then right-click and choose the structural tag.
Structure Tags - Tagging
The structural tag information will then be exported in the XML file of the page. 

Step 4:

Moreover, it is possible to restrict text tecognition only to the text regions tagged with specific structural tags. After selecting the model, click Advanced settings and select the relevant tags. Unflag the "Delete text from other regions" option to keep the text in the other text regions.
This feature is particularly useful when you need to extract specific text from a particular text region or when you have a combination of handwritten and printed text on the same page, and you want to use separate models for each.
Structure tags - TR (1)

Relations

You can also create Relation tags between shapes by right-clicking on them. This allows you to connect layout elements that are related to each other, which can be useful for various purposes. The relational data will also be included in the XML document.

The most common type of relation is the "Article" relation. This relation is useful when an article's text is divided into multiple columns. It allows you to link the different text regions that make up the article and indicate the reading order.

To delete a Relation tag, click on the circle at the centre of the text region where the relation starts or ends and press Delete on your keyboard. You can make circles more visible by enlarging the circle size in the Settings menu.

Structure Tag - Relations

Next step: Textual Tags