Today"s Corpus Linguistics: Some Open Questions


František Čermák
Today's Corpus Linguistics: Some Open Questions

International Journal of Corpus Linguistics 7,2, 2003, 265-282.


In general, the paper is concerned with problems of methodology. Against this background, situation of today´s corpora is being discussed and some fields are being pointed at as being in a far from satisfactory shape. Place of corpora in linguistics is briefly looked at, suggesting that structuralist tradition might be using them most. Problems of annotation and ways, less (statistical) or more succesful (rule-based), are noted. Here, some of the most serious shortcomings, such as multi-word units or status of language units in general that computational linguists should deal with, is listed. In a more general direction, implications and status of paradigmatics and syntagmatics is discussed, too, with a considerable and critical attention paid to ontologies.