T-LAB 10.2 - ON-LINE HELP - T-LAB Tools for Text Analysis

T-LAB 10.2 - ON-LINE HELP

T-LAB

Introduction

What T-LAB does and what it enables us to do

Requirements and Performances

Corpus Preparation

Corpus Preparation

Structural Criteria

Formal Criteria

File

Import a single file...

Prepare a Corpus (Corpus Builder)

Open an existing project

Settings

Automatic and Customized Settings

Dictionary Building

Co-occurrence Analysis

Word Associations

Co-Word Analysis and Concept Mapping

Comparison between Word pairs

Sequence and Network Analysis

Co-occurrence Toolkit

Thematic Analysis

Thematic Analysis of Elementary Contexts

Modeling of Emerging Themes

Thematic Document Classification

Dictionary-Based Classification

Texts and Discourses as Dynamic Systems

Comparative Analysis

Specificity Analysis

Correspondence Analysis

Multiple Correspondence Analysis

Cluster Analysis

Singular Value Decomposition

Lexical Tools

Text Screening / Disambiguations

Corpus Vocabulary

Multi-Word List

Word Segmentation

Other Tools

Variable Manager

Advanced Corpus Search

Classification of New Documents

Key Contexts of Thematic Words

Export Custom Tables

Import-Export Identifiers list

Glossary

Association Indexes

Cluster Analysis

Corpus and Subsets

Correspondence Analysis

Elementary Context

Frequency Threshold

Key-Word (Key-Term)

Lexie and Lexicalization

Occurrences and Co-occurrences

Poles of Factors

Primary Document

Thematic Nucleus

Variables and Categories

Words and Lemmas

Words and Lemmas

Any text analysis software first of all identifies the so called raw forms, that is the strings of letters separated by blank spaces. Then, according either to their specific algorithms or to the categories used by the specialists, the software recognizes lexemes, key-words, etc.

T-LAB tables, for all the lexical units present in the corpus database, provide two types of information:

· the first one, named "word", contains the transcript of the lexical units (single words or multi-words) as "strings" which are recognized by the software;

· the second, named "lemma", contains the labels (or tags) used for grouping and classifying the lexical units.

According to the case, a lemma can be:

- the result of the automatic lemmatization process;
- an item of a "customized dictionary";
- a category grouping synonyms;
- a content analysis category;
- etc.