T-LAB 10.2 - ON-LINE HELP - T-LAB Tools for Text Analysis

T-LAB 10.2 - ON-LINE HELP

T-LAB

Introduction

What T-LAB does and what it enables us to do

Requirements and Performances

Corpus Preparation

Corpus Preparation

Structural Criteria

Formal Criteria

File

Import a single file...

Prepare a Corpus (Corpus Builder)

Open an existing project

Settings

Automatic and Customized Settings

Dictionary Building

Co-occurrence Analysis

Word Associations

Co-Word Analysis and Concept Mapping

Comparison between Word pairs

Sequence and Network Analysis

Co-occurrence Toolkit

Thematic Analysis

Thematic Analysis of Elementary Contexts

Modeling of Emerging Themes

Thematic Document Classification

Dictionary-Based Classification

Texts and Discourses as Dynamic Systems

Comparative Analysis

Specificity Analysis

Correspondence Analysis

Multiple Correspondence Analysis

Cluster Analysis

Singular Value Decomposition

Lexical Tools

Text Screening / Disambiguations

Corpus Vocabulary

Multi-Word List

Word Segmentation

Other Tools

Variable Manager

Advanced Corpus Search

Classification of New Documents

Key Contexts of Thematic Words

Export Custom Tables

Import-Export Identifiers list

Glossary

Association Indexes

Cluster Analysis

Corpus and Subsets

Correspondence Analysis

Elementary Context

Frequency Threshold

Key-Word (Key-Term)

Lexie and Lexicalization

Occurrences and Co-occurrences

Poles of Factors

Primary Document

Thematic Nucleus

Variables and Categories

Words and Lemmas

www.tlab.it

Normalization

In T-LAB, corpus normalization has the double goal of:

a) allowing correct word detection as raw forms;

b) solving some ambiguity cases.

This means that T-LAB, in the first place, carries out a number of processes on the file under analysis: blank space in excess elimination, apostrophe marking, space addition after punctuation marks, capital letter reduction, etc.

Secondly, T-LAB marks a set of strings recognized as proper nouns; then converts the sequences of row forms recognized as multiwords in unitary strings, in order to use them in that form during the analysis process ("in terms of" and "point of view" become respectively "in_terms_of" and "point_of_view").

These operation parameters cannot be modified by the user.

In order to have a correct recognition of raw forms, in the normalization routine, T-LAB uses the following marks:

, ; : . ! ? ' " ( ) < > + / = [ ] { }