Inline Annotation¶

This category encompasses (linguistic) annotation types describing a single structural element. Examples are Part-of-Speech Annotation or Lemmatisation, which often describe a single token.

These annotation types are encoded in an inline fashion in FoLiA, i.e. they appear within the structural element to which they apply (often words/tokens but not necessarily so) and make use of the hierarchical nature of XML.

FoLiA defines the following types of inline annotation:

Inline Annotation – This category encompasses (linguistic) annotation types describing a single structural element. Examples are Part-of-Speech Annotation or Lemmatisation, which often describe a single token.
- Part-of-Speech Annotation – <pos> – Part-of-Speech Annotation, one of the most common types of linguistic annotation. Assigns a lexical class to words.
- Lemmatisation – <lemma> – Lemma Annotation, one of the most common types of linguistic annotation. Represents the canonical form of a word.
- Domain/topic Annotation – <domain> – Domain/topic Annotation. A form of inline annotation used to assign a certain domain or topic to a structure element.
- Sense Annotation – <sense> – Sense Annotation allows to assign a lexical semantic sense to a word.
- Error Detection Annotation (DEPRECATED) – <errordetection> – This annotation type is deprecated in favour of Observation Annotation and only exists for backward compatibility.
- Subjectivity Annotation (DEPRECATED) – <subjectivity> – This annotation type is deprecated in favour of Sentiment Annotation and only exists for backward compatibility.
- Language Annotation – <lang> – Language Annotation simply identifies the language a part of the text is in. Though this information is often part of the metadata, this form is considered an actual annotation.