Public Groupactive 2 years, 1 month ago
This project CorpusRedEs aims at building an annotated corpus of Computer-Mediated Communication in Spanish. The corpus will gather texts from different cybergenres or socio-technical modes of CMC, including the diatopic varieties of Spanish as well as several domains. The annotation of the macrostructure of texts is based on the TEI-XML standard adapted to CMC, in order to favor the interoperability between platforms and the easy recovery of data by users. In this sense, we suggest that the posting element considered in other projects for the segmentation of CMC interaction units (see the TEI Special Interest Group on CMC), may be enriched with further elements and attributes used for the annotation of spoken language corpora, for an accurate description of the interactional dynamics within these texts.