FoLiA: Format for Linguistic Annotation - Documentation and Reference GuideΒΆ
version: 2.5.2
Abstract
FoLiA, an acronym for Format for Linguistic Annotation, is a data model and file format to represent digitised language resources enriched with linguistic annotation, e.g. linguistically enriched textual documents or transcriptions of speech. The format is intended to provide a standard for the storage and exchange of such language resources, including corpora and to promote interoperability amongst Natural Language Processing tools that use the format.
- Introduction
- Metadata
- Set Definitions (Vocabulary)
- Annotation Types
- Content Annotation
- Higher-order Annotation
- Inline Annotation
- Span Annotation
- Structure Annotation
- Token Annotation
- Division Annotation
- Paragraph Annotation
- Head Annotation
- List Annotation
- Figure Annotation
- Vertical Whitespace
- Linebreak
- Sentence Annotation
- Event Annotation
- Quote Annotation
- Note Annotation
- Reference Annotation
- Table Annotation
- Part Annotation
- Utterance Annotation
- Entry Annotation
- Term Annotation
- Definition Annotation
- Example Annotation
- Hidden Token Annotation
- Subtoken Annotation
- Text Markup Annotation
- Foreign Annotation
- Querying
- Form
- Implementations
- Guidelines