T-LAB 10.2 - ON-LINE HELP - T-LAB Tools for Text Analysis

T-LAB Home

T-LAB 10.2 - ON-LINE HELP

T-LAB

Introduction

What T-LAB does and what it enables us to do

Requirements and Performances

Corpus Preparation

Corpus Preparation

Structural Criteria

Formal Criteria

File

Import a single file...

Prepare a Corpus (Corpus Builder)

Open an existing project

Settings

Automatic and Customized Settings

Dictionary Building

Co-occurrence Analysis

Word Associations

Co-Word Analysis and Concept Mapping

Comparison between Word pairs

Sequence and Network Analysis

Concordances

Co-occurrence Toolkit

Thematic Analysis

Thematic Analysis of Elementary Contexts

Modeling of Emerging Themes

Thematic Document Classification

Dictionary-Based Classification

Texts and Discourses as Dynamic Systems

Comparative Analysis

Specificity Analysis

Correspondence Analysis

Multiple Correspondence Analysis

Cluster Analysis

Singular Value Decomposition

Lexical Tools

Text Screening / Disambiguations

Other Tools

Advanced Corpus Search

Classification of New Documents

Key Contexts of Thematic Words

Export Custom Tables

Editor

Import-Export Identifiers list

Glossary

Correspondence Analysis

Lexie and Lexicalization

MDS

Occurrences and Co-occurrences

Variables and Categories

Words and Lemmas

Bibliography

TF-IDF

This measure, proposed by G. Salton (1989), allows us to evaluate the weight of a term (lexical unit) within a document (context unit).

Its formula is the following:

w i,j = tf i,j x idf i (Term Frequency x Inverse Document Frequency)

Where:

tf i,j = number of occurrences of i (term) in j (document)
df i = number of documents containing i
N = total number of documents

Term Frequency (tf i,j ) value can be normalized as follows:

tf i,j = tf i,j / Max (f i,j )

where Max (f i,j ) is the maximum frequency of i(any term) in the j (document).