Osnovne teorijsko-metodološke karakteristike korpusne lingvistike


  • Tanja Samardžić


This paper reviews the key notions within the corpus lingusitsics approach. Texts are considered to be scientifically observable, quantifiable language units. Corpus is a sample of language. Continuous scales are seen as a possible instrument for the quantification of linguistic data. Context plays a significant role in the identification and descritpion of linguistic units. Reocurrence, co-occurence and frequency are seen as possible evidence for linguistic facts. A patter is a generalized linguistic unit based on this evidence. The idiom principle is seen as primary when denoting strategy and the open choice principle.


Barnbook, G. 2002. Language of Definitions. Amsterdam: John Benjamins Publishing Company.

Biber, D. 1988. Variation across Speech and Writing. Cambridge: Cambridge University Press.

Combley, R. et al. 2002. Collins COBUILD New Student's Dictionary. Glasgow: HarperCollins.

Daniellsson, P. 2001. The Automatic Identification of Meaningful Units in Language. Sprakdata, Department of Swedish, Goteborg University (PhD Dissertation).

Francis, G. et al. 1996. Collins COBUILD Grammar Patterns 1: Verbs. London: HaprerCollins.

Hunston, S. and G. Francis. 1999. Pattern Grammar: A Corpus-driven Approach to the Lexical Grammar of English. Amsterdam: John Benjamins Publishing Company.

Mel'cuk, I. 1998. Collocations and Lexical Functions. In A. P. Cowie (ed.) Phraseology, Theory, Analysis, and Applications. Oxford: Oxford University Press.

Stubbs, M. 2001. Words and Phrases - Corpus Studies of Lexical Semantics. Oxford: Blackwell.




How to Cite

Samardžić, T. (2021). Osnovne teorijsko-metodološke karakteristike korpusne lingvistike. Philologia, 2(1), 15–18. Retrieved from http://philologia.org.rs/index.php/ph/article/view/382



Nauka o jeziku/Linguistics