www.tlab.it
Occurrences and
Co-occurrences
The Occurrences, in fact,
are quantities which result from the computation of how many times
(frequences) a single lexical unit
(LU)occurs within a corpus or within the
context units (CU) in which it is
subdivided.
Their distribution can be represented in contingency tables as
follows:
Co-occurrences, then, are
quantities which result from a computation of how many times two or
more lexical units are present together in the same elementary contexts (EC).
Their distribution can be represented in tables such as
the following:
With a simple transformation, the "A" type table
(rectangular) can be transformed into "B" type (squared and
symmetrical) in which for each pair of lexical units the quantity
of their co-occurrences is indicated, that is the total number of
the elementary contexts in which they are present together.
In T-LAB text analysis is mostly carried out by
the study of relationships among occurrences and co-occurrences,
either through specific association
indexes, or through the use of multidimensional statistical
techniques like cluster analysis and
correspondence analysis
|