A Poem on Corpus Linguistics

From Corpus4u wiki

Jump to: navigation, search

The following poem was accidentally discovered on a table among the debris at the end of Nobel Symposium. A range of stylistic research techniques (including those of forensic linguistics) have been used in order to establish the identity of its originator. While these efforts have not led to conclusive results, the editor suspects that the cryptic pseudonym “A.K. (Anna Kerr-Luther)” holds the key to the poet’s identity. Since this poem captures – succinctly, accurately and elegantly – the spirit of the whole Symposium, he takes the liberty of it as an “Afterword” in the Proceedings.

The Purpora of Corpora: How to Dispose of the Body or The Corpus-Maker’s (Di-)Lemma

by A.K. (Anna Kerr-Luther)

A base of misty tulle
Beyond the bank of trees?
Go plant a modal hedge
Lest your annealing freeze.
-- But as for language processing, mechanical or muscular,
There is nothing can improve upon a database corpuscular!
In London or in Lund,
From Birmingham to Brown,
No adjectives escapes,
They never lose a noun.
-- But you cannot make a dictionary, notional or rational,
Without a Bank of Collocates and Converse International!
A Hidden Markoff Process
Lurks beneath your text,
To guard the crossing gates
From the wor(l)d to the next.
 -- But you run (or take) the risk that your results are merely tentative
Unless you base your corpus on a sample representative!
Don’t try a demonstration,
Just register your name.
If not “I saw, I conquered”
You can always say “ICAME”.
-- Where Elf, the Sacred River, runs (with conference facilities)
Through caverns that are measureless as global probabilities!
Beware the Svart(vik) Boxes
With their electronic brains –
They’ll have you tagged and parsed
In referential chains.
-- But if you feel constrained to have your grammar realisticall,
You particles based properly, your lexis authentisticall,
Don’t rely on introspection, on the magic and the mystical,
Don’t hope to find a software pack of strategies heuristical.
Just build yourself a corpus (or some corpora) statistical.

Source: Svartvik, Jan (ed.). 1992. Directions in Corpus Linguistics: Proceedings of Nobel Symposium 82, Stockholm, 4-8 August 1991. Berlin and New York: Mouton de Gruyter (pp: 471-474).

Personal tools