T-LAB Home
T-LAB 10.2 - ON-LINE HELP Prev Page Prev Page
T-LAB
Introduction
What T-LAB does and what it enables us to do
Requirements and Performances
Corpus Preparation
Corpus Preparation
Structural Criteria
Formal Criteria
File
Import a single file...
Prepare a Corpus (Corpus Builder)
Open an existing project
Settings
Automatic and Customized Settings
Dictionary Building
Co-occurrence Analysis
Word Associations
Co-Word Analysis and Concept Mapping
Comparison between Word pairs
Sequence and Network Analysis
Concordances
Co-occurrence Toolkit
Thematic Analysis
Thematic Analysis of Elementary Contexts
Modeling of Emerging Themes
Thematic Document Classification
Dictionary-Based Classification
Texts and Discourses as Dynamic Systems
Comparative Analysis
Specificity Analysis
Correspondence Analysis
Multiple Correspondence Analysis
Cluster Analysis
Singular Value Decomposition
Lexical Tools
Text Screening / Disambiguations
Corpus Vocabulary
Stop-Word List
Multi-Word List
Word Segmentation
Other Tools
Variable Manager
Advanced Corpus Search
Classification of New Documents
Key Contexts of Thematic Words
Export Custom Tables
Editor
Import-Export Identifiers list
Glossary
Analysis Unit
Association Indexes
Chi-Square
Cluster Analysis
Coding
Context Unit
Corpus and Subsets
Correspondence Analysis
Data Table
Disambiguation
Dictionary
Elementary Context
Frequency Threshold
Graph Maker
Homograph
IDnumber
Isotopy
Key-Word (Key-Term)
Lemmatization
Lexical Unit
Lexie and Lexicalization
Markov Chain
MDS
Multiwords
N-grams
Naïve Bayes
Normalization
Occurrences and Co-occurrences
Poles of Factors
Primary Document
Profile
Specificity
Stop Word List
Test Value
Thematic Nucleus
TF-IDF
Variables and Categories
Words and Lemmas
Bibliography
www.tlab.it

Key Contexts of Thematic Words


According to the type of input, this T-LAB can be used for two different purposes:

A) to extract lists of meaningful context units (i.e. elementary contexts or short documents) which allow us to deepen the thematic value of specific key terms;
B) to extract the context units which are the most similar to sample texts chosen by the user.

Here are some explanations for the two above cases.

Case (A)

Unlike Concordances, which allows the extraction of all elementary contexts in which the selected key words are just present (occurrences), and unlike Word Associations, which allows the extraction of all elementary contexts in which the selected key words are in matching pairs (co-occurrences), this tool allow us to extract the elementary contexts in which each selected key word is associated with other words (multiple co-occurrences) defining its thematic field.

It works in the following way:

1- the user chooses a thematic word "X" (see "Muslim" below);
2- T-LAB proposes a list of words (max. 50) whose co-occurrence values with "X" are the most significant;
3- the user can remove irrelevant items from the list provided (just double click each item);
4- after clicking 'Extract Key Contexts' T-LAB assumes that the user list is a query vector and computes its association indexes (i.e. cosine coefficients) with all the elementary contexts of the corpus or of the selected corpus subset.

The output provided, both in HTML and TXT format, contain a list of the most significant key-contexts of "X", listed according to the descending order of their association indexes.

The 1-4 steps can be reiterated for "n" thematic words.

Case (B)

It works in the following way:

1- the user copy/paste a text (Max 5,000 characters) in the appropriate box;
2- after clicking the 'extract key contexts' button, T-LAB transforms the input text into a query vector and computes its association indexes (i.e. cosine coefficients) with all the elementary contexts of the corpus or of the selected corpus subset
.

The output provided, both in HTML and TXT format, contain a list key-contexts which are the most similar to the input text.

N.B.: In such a case the similarity measure doesn't take into account multi-words the strings of which, either with or without the underscore ('_') character, do not correspond to the analysed text.

The above 1-2 steps can be reiterated for "n" sample texts.
.