The most common CATMA terms briefly explained. (Note that the “CATMA Modules” section is organized logically rather than alphabetically.)
CATMA Modules
Project
CATMA’s Project module lets you manage, for a particular project, the resources (documents and annotation collections, tagsets) as well as the members and their roles. In the gray drawer on the left you can view project activities.
Tags
CATMA’s Tags module allows for the creation and editing of tagsets. Here you can create as many tags, subtags and properties with values as you like.
Annotate
CATMA’s Annotate module is made for close reading and manual annotation of text. Here you can highlight text passages, choose tags from a tagset and thus create annotations. In the gray drawer on the left you can choose from all tagsets and annotation collections that are part of your project.
Analyze
CATMA’s Analyze module is made for the analysis and visualization of text and annotation data. Here you can realize quantitative text analytical processes, run queries with the help of Build Query, search for your annotations, view keywords in their context (KWIC), select aspects that you want to visualize, or semi-automatically annotate your text.
General Terms
Annotations
You can add as many annotations to a text as you want. An annotation is created by applying a tag to some part of the text. Every user creates individual annotations when annotating a specific text and saves them in an annotation collection.
Annotation Collection
Annotations are user- and text-specific. You can think of annotation collections as “containers” for annotations. Each user saves their annotations in an annotation collection that is linked to a particular document.
Build Query
The Build Query function in the Analyze module helps you to create a query using a wizard interface. It is particularly useful if you are not yet familiar with CATMA’s query language. The function asks you some questions about what you are looking for and consequently builds the query for you.
Markup
Whenever you add something to a text (in CATMA by annotating it), you create what is sometimes called markup.
Project
CATMA has a project-centered architecture. In a project you upload your texts, create or upload tagsets and annotation collections, and manage project members and their roles.
Property
You can assign properties to tags, which enable you to differentiate or refine their meaning. Properties offer the possibility to extend the declaratively organized tag-based annotation by a scalar concept of qualitative evaluations, i.e. by categories that can appear on different levels of a tagset. Each property can have multiple property values.
Property Value
You can assign values to properties. They can be numerical or contain words.
Query
With queries you analyze a text or a text corpus. They range from simple to complex. CATMA has its own query language. Build Query can help you if you are not familiar with the query language.
Tag
A tag is a descriptive label that you can assign (as annotations) to any piece of text loaded into CATMA. Tags are grouped in tagsets and can form a hierarchy.
Tagset
You can think of tagsets as “containers” for tags. A CATMA project can contain multiple tagsets, which can also be exported and re-used in other projects.
Types of (Predefined) Queries
For a complete overview of CATMA’s query language, please see this page.
Comments
The comments query allows you to find all comments (comment=”%”), or only those containing particular keywords or phrases (e.g. comment=”discuss”).
Taglist
The taglist query allows you to find all annotations (tag=”%”), or only those using particular tags or subtags (e.g. tag=”/style/repetition”).
Taglist with Properties
The property query allows you to find all annotations together with their properties and values (property=”%”), or only those using particular properties (e.g. property=”gender”) and/or with particular values (e.g. property=”gender” value=”female”). Property queries are often combined with tag queries (e.g. tag=”/character” property=”gender”).
Wildcard
The wildcard query allows you to search for word beginnings (e.g. wild=”dig%”), word endings (e.g. wild=”%tal”) or words containing some sequence of letters (e.g. wild=”%it%”).
Wordlist
The wordlist query, which actually searches based on the frequency of words (freq>0), is an easy way to find all words appearing in one or multiple texts.
Types of Visualizations
These are the different visualization options provided by CATMA’s Analyze module.
Distribution Chart
This can be used to visualize the distribution of one or more words, phrases, or tags throughout one or more texts.
DoubleTree
This can be used to visualize keywords in their contexts, similar to KWIC, but in a more interactive and exploratory way.
KWIC (KeyWord In Context)
This can be used to show one or more words, phrases or annotations together with the contexts they appear in (i.e. the words that appear before and after in the source text) in a tabular format.
Wordcloud
You can create wordclouds of words, phrases or tags.