Ana Luísa Varani Leal

Márcia Schmaltz
Chao Chi Meng
Chiu Lai Fan
Luís Felipe Rodrigues

In Computational Linguistics, work on textual analysis mainly attempts to explain behaviors and relations of linguistic components, properly identified in the textual structure. Such research is especially related to morphological and syntactic structures. Little research has focused on semantic analysis, where the text is the concrete object of study. From textual analysis, the processes related to their constitution are exploited and justified. Textual coherence, which emerges from the rhetorical relations that occur in the textual configuration, is the representative element of the discourse structure. Therefore, it must be analyzed and understood in global terms. In pursuit of this, it is necessary to consider all textual levels related to the process of signification. In order to explore textual coherence, we propose and develop a methodology for the analysis of discourse themes. The proposal was evaluated to identify the localized processes and their relations in the constitution of the subject, aiming to produce an automatic macro-proposition and macro-structure. The results are promising, in terms of precision and recall.
Screenshots of AuTema-Dis I system
Figure 1: Data input.Figure 2: Parser output.

Figure 3: Segmentation.

Figure 4: Rhetorical relations.

Figure 5: Generated macroproposition.

