October 2018

Corpora in scientific research

A corpus is a “reasoned” set of texts governed by an internal coherence that can be constituted according to different modalities and structured according to several methods and approaches. The corpora pose a number of practical and epistemological questions that refer to their typologies, structures and modes of organization, but also to the conditioning of methodological paradigms related to their analysis and exploitation. Enriched by incessant numerical advances, the corpora are quickly the subject of dedicated standards proposed by research communities in the current of the Digital Humanities.