corpus
English
[edit]Etymology
[edit]Borrowed from Latin corpus (“body”). Doublet of corpse, corps, and riff.
Pronunciation
[edit]- (Received Pronunciation) IPA(key): /ˈkɔːpəs/
- (General American) IPA(key): /ˈkɔɹpəs/
Audio (General Australian): (file) - Rhymes: -ɔː(ɹ)pəs
- Hyphenation: cor‧pus
Noun
[edit]corpus (plural corpora or corpuses or corpusses or (proscribed) corpi)
- A collection of written or spoken texts.
- 2011, Patrick Spedding, James Lambert, “Fanny Hill, Lord Fanny, and the Myth of Metonymy”, in Studies in Philology, volume 108, number 1, page 113:
- No one suggests that Browning intended to mean vagina when he wrote “owls and bats, / Cowls and twats,” because the context does not allow for it, nor does the greater context of the Browning corpus.
- (specifically, linguistics) Such a collection in form of an electronic database used for linguistic analyses.
- Synonyms: digital corpus, text corpus
- 2007, Mihail Mihailov, Hannu Tommola, “Compiling Parallel Text Corpora: Towards Automation of Routine Procedures”, in Wolfgang Teubert, editor, Text Corpora and Multilingual Lexicography (Benjamins Current Topics; 8), Amsterdam: John Benjamins Publishing Company, →ISBN, page 60:
- Text corpora are being used in most current lexicographic projects. Applied linguistic research is another field where text corpora are welcome as an inexhaustible source of empirical information, a polygon for testing various linguistic tools – spell-checkers, OCRs, machine translation systems, NLP systems, etc.
- 2008, Anabel Borja, “Corpora for Translators in Spain. The CDJ-GITRAD Corpus and the GENITT Project.”, in Gunilla [M.] Anderman, Margaret Rogers, editors, Incorporating Corpora: The Linguist and the Translator, Clevedon, North Somerset: Multilingual Matters, →ISBN, page 248:
- Comparable corpora are made up of texts in different languages that may be related in various ways, but are not translations of each other. They may have nothing in common at all, or be on the same subject, of the same genre, or from the same chronological period, etc.
- 2013, “Introduction”, in Gerry Knowles, Briony Williams, L[ita] Taylor, editors, A Corpus of Formal British English Speech: The Lancaster/IBM Spoken English Corpus, Abingdon, Oxon.; New York, N.Y.: Routledge, →ISBN, page 1:
- The Lancaster/IBM Spoken English Corpus began in September 1984 as part of a research project into the automatic assignment of intonation […] The original design of the corpus was determined by the need to provide data for research into speech synthesis. As a result, unlike most other corpora currently being used in the computational linguistics field, the SEC exists in several forms. […] However, whatever the original motivation for compiling a corpus, it quickly becomes an object of interest in its own right. New users find it valuable for applications for which it was not designed.
- 2014, Giuseppina Balossi, “Corpus Approaches to the Study of Language and Literature”, in A Corpus Linguistic Approach to Literary Language and Characterization: Virginia Woolf's The Waves (Linguistic Approaches to Literature; 18), Amsterdam: John Benjamins Publishing Company, →ISBN, page 41:
- A corpus approach is a useful methodology for observing, describing and interpreting the stylistic features of language in literary and non-literary texts.
- 2018, James Lambert, “A multitude of ‘lishes’: The nomenclature of hybridity”, in English World-Wide[1], page 4:
- Today, computer databases and corpora infinitely increase the ease of this type of research, but the collecting process remains essentially the same.
- (physics) A structure of a special character or function in the animal body.
- the corpus of the uterus
- (uncommon) A collection or body of objects with similar characteristics.
- Synonyms: collection; see also Thesaurus:body
- 1998, Dimitǎr Draganov, “New Coin Types of Hadrianopolis”, in Ulrike Peter, editor, Stephanos Nomismatikos: Edith Schönert-Geiss zum 65. Geburtstag (Griechisches Münzwerk), Berlin: Akademie Verlag, →ISBN, page 221:
- About a hundred years ago in Germany, the publishing of corpuses of the ancient Greek coinages was started. […] The significance of those, and some other corpuses is exclusive, because they allowed an enormous amount of numismatic material kept in museum and private collections all over the world, to be studied and systematized.
- 2014, Margaret Darling, Barbara Precious, “Introduction”, in A Corpus of Roman Pottery from Lincoln (Lincoln Archaeological Studies; 6), Oxford: Oxbow Books, →ISBN, page 1:
- An assessment in 1991 proposed publication of the results of this work in three stages: […] secondly, a corpus of the Roman pottery to present the type series and to discuss the fabrics and forms recovered, […]
- (archaic) The body of a man or animal.
Usage notes
[edit]- Of the plurals, corpora is the most common one.[1]
Derived terms
[edit]Related terms
[edit]Translations
[edit]
|
|
See also
[edit]See also
[edit]References
[edit]- James A. H. Murray et al., editors (1884–1928), “Corpus”, in A New English Dictionary on Historical Principles (Oxford English Dictionary), volume II (C), London: Clarendon Press, →OCLC, page 1012: “1. The body of a man or animal. / […] / 2. Phys. A structure of a special character or function in the animal body […]”
Anagrams
[edit]Basque
[edit]Etymology
[edit]Pronunciation
[edit]Noun
[edit]corpus inan
- corpus (a collection of writings)
Declension
[edit]| indefinite | singular | plural | proximal plural | |
|---|---|---|---|---|
| absolutive | corpus | corpusa | corpusak | corpusok |
| ergative | corpusek | corpusak | corpusek | corpusok |
| dative | corpusi | corpusari | corpusei | corpusoi |
| genitive | corpusen | corpusaren | corpusen | corpuson |
| comitative | corpusekin | corpusarekin | corpusekin | corpusokin |
| causative | corpusengatik | corpusarengatik | corpusengatik | corpusongatik |
| benefactive | corpusentzat | corpusarentzat | corpusentzat | corpusontzat |
| instrumental | corpusez | corpusaz | corpusez | corpusotaz |
| innesive | corpusetan | corpusean | corpusetan | corpusotan |
| locative | corpusetako | corpuseko | corpusetako | corpusotako |
| allative | corpusetara | corpusera | corpusetara | corpusotara |
| terminative | corpusetaraino | corpuseraino | corpusetaraino | corpusotaraino |
| directive | corpusetarantz | corpuserantz | corpusetarantz | corpusotarantz |
| destinative | corpusetarako | corpuserako | corpusetarako | corpusotarako |
| ablative | corpusetatik | corpusetik | corpusetatik | corpusotatik |
| partitive | corpusik | — | — | — |
| prolative | corpustzat | — | — | — |
Further reading
[edit]- “corpus”, in Euskaltzaindiaren Hiztegia [Dictionary of the Basque Academy] (in Basque), Euskaltzaindia [Royal Academy of the Basque Language]
Catalan
[edit]Etymology
[edit]Borrowed from Latin corpus. Doublet of cos.
Pronunciation
[edit]Noun
[edit]corpus m (invariable)
- corpus (a collection of writings)
Further reading
[edit]- “corpus”, in Diccionari de la llengua catalana [Dictionary of the Catalan Language] (in Catalan), second edition, Institute of Catalan Studies [Catalan: Institut d'Estudis Catalans], April 2007
Dutch
[edit]Etymology
[edit]Borrowed from Latin corpus. Doublet of corps and korps.
Pronunciation
[edit]Noun
[edit]corpus n (plural corpora or corpussen, diminutive corpusje n)
Usage notes
[edit]The word retained the original Latin neuter gender. It is one of the few Dutch words ending on -us that is not masculine.
Derived terms
[edit]Descendants
[edit]- → Indonesian: korpus
Further reading
[edit]- “corpus” in Woordenlijst Nederlandse Taal – Officiële Spelling, Nederlandse Taalunie. [the official spelling word list for the Dutch language]
French
[edit]Etymology
[edit]Unadapted borrowing from Latin corpus (“body”). Doublet of corps.
Pronunciation
[edit]Audio (Switzerland, Lausanne): (file)
Noun
[edit]corpus m (invariable)
- (linguistics) a corpus, a body of texts
Further reading
[edit]- “corpus”, in Trésor de la langue française informatisé [Digitized Treasury of the French Language], 2012.
Latin
[edit]Etymology
[edit]From Proto-Italic *korpos, from Proto-Indo-European *krépos (“body”), from the root *krep-. Equivalent to the Proto-Germanic neuter noun *hrefaz (“body, torso”), whence e.g. Old High German href, Old Dutch ref, Old English hrif (> English riff).[1]
Pronunciation
[edit]- (Classical Latin) IPA(key): [ˈkɔr.pʊs]
- (modern Italianate Ecclesiastical) IPA(key): [ˈkɔr.pus]
- Hyphenation: cor‧pus
Noun
[edit]corpus n (genitive corporis); third declension
- (anatomy) body, person (person when used to mean "human body", e.g., "on one's person")
- c. 65 AD, Seneca Minor, Epistulae Morales ad Lucilium, Epistula XCII
- Nemo liber est qui corpori servit.
- No one is free who is a slave to the body.
- Nemo liber est qui corpori servit.
- c. 65 AD, Seneca Minor, Epistulae Morales ad Lucilium, Epistula XCII
- substance, material (physical, perceptible to the senses)
- Synonym: rēs
- the flesh of an animal's body
- a corpse
- the trunk or shaft of something
- (figuratively) the wood under the bark of a tree
- (Medieval Latin) a corpus (collection of writings by a single author or addressing a certain topic)
- (metonymic) person, individual
- 27 BCE – 25 BCE, Titus Livius, Ab Urbe Condita 3.56:
- qui liberum corpus in servitutem addixisset
- who as a free person might have been sentenced to slavery
- qui liberum corpus in servitutem addixisset
- (metonymic) a frame, body, system, structure, community, corporation
- (New Latin, physics) a physical object, a body (as in a celestial body)
Declension
[edit]Third-declension noun (neuter, imparisyllabic non-i-stem).
| singular | plural | |
|---|---|---|
| nominative | corpus | corpora |
| genitive | corporis | corporum |
| dative | corporī | corporibus |
| accusative | corpus | corpora |
| ablative | corpore | corporibus |
| vocative | corpus | corpora |
Derived terms
[edit]Related terms
[edit]Descendants
[edit]- Insular Romance:
- Balkano-Romance:
- Megleno-Romanian: corp
- Italo-Dalmatian:
- Rhaeto-Romance:
- Gallo-Italic:
- Gallo-Romance:
- Ibero-Romance:
- Borrowings:
- → Basque: gorputz
- → Catalan: corpus
- → English: corpus
- → Esperanto: korpo
- Ido: korpo
- → German: Korpus
- → Romanian: corp
- → Italian: corpus
- → Middle High German: korper, körper, körpel
- German: Körper
- → Old Irish: corp
- → Portuguese: corpus
- → Proto-Brythonic: *korf
- → Russian: ко́рпус (kórpus)
- → Spanish: corpus
References
[edit]- ^ De Vaan, Michiel (2008), “corpus, -oris”, in Etymological Dictionary of Latin and the other Italic Languages (Leiden Indo-European Etymological Dictionary Series; 7), Leiden, Boston: Brill, →ISBN, pages 137-8
Further reading
[edit]- “corpus”, in Charlton T. Lewis and Charles Short (1879), A Latin Dictionary, Oxford: Clarendon Press
- “corpus”, in Charlton T. Lewis (1891), An Elementary Latin Dictionary, New York: Harper & Brothers
- "corpus", in Charles du Fresne du Cange’s Glossarium Mediæ et Infimæ Latinitatis (augmented edition with additions by D. P. Carpenterius, Adelungius and others, edited by Léopold Favre, 1883–1887)
- “corpus”, in Gaffiot, Félix (1934), Dictionnaire illustré latin-français, Hachette.
- Carl Meißner; Henry William Auden (1894), Latin Phrase-Book[2], London: Macmillan and Co.
- to spread over the whole body: per totum corpus diffundi
- bodily strength: vires corporis or merely vires
- a good constitution: firma corporis constitutio or affectio
- sensual pleasure: voluptates (corporis)
- to refresh oneself, minister to one's bodily wants: corpus curare (cibo, vino, somno)
- to devote oneself body and soul to the good of the state: totum et animo et corpore in salutem rei publicae se conferre
- the free men are sold as slaves: libera corpora sub corona (hasta) veneunt (B. G. 3. 16. 4)
- wounds (scars) on the breast: vulnera adverso corpore accepta
- to spread over the whole body: per totum corpus diffundi
- “corpus”, in William Smith et al., editor (1890), A Dictionary of Greek and Roman Antiquities, London: William Wayte. G. E. Marindin
- Sihler, Andrew L. (1995), New Comparative Grammar of Greek and Latin, Oxford, New York: Oxford University Press, →ISBN
Anagrams
[edit]Portuguese
[edit]Alternative forms
[edit]Etymology
[edit]Unadapted borrowing from Latin corpus. Doublet of corpo and cós.
Pronunciation
[edit]
Noun
[edit]corpus m (plural corpora or (nonstandard) corpus)
- (linguistics) corpus (collection of writings)
Usage notes
[edit]This term is prescriptively spelled without an accent and italicized; the adapted spelling córpus is nonstandard.
Further reading
[edit]- “corpus”, in Michaelis Dicionário Brasileiro da Língua Portuguesa (in Portuguese), São Paulo: Editora Melhoramentos, 2015–2025
- “corpus”, in Dicionário infopédia da Lingua Portuguesa (in Portuguese), Porto: Porto Editora, 2003–2025
- “corpus”, in Dicionário da Língua Portuguesa (in Portuguese), Lisboa: Academia das Ciências de Lisboa, 2001–2025
- “corpus”, in Dicionário Priberam da Língua Portuguesa (in Portuguese), Lisbon: Priberam, 2008–2025
- “corpus”, in Dicionário Aulete Digital (in Portuguese), Rio de Janeiro: Lexikon Editora Digital, 2008–2025
Romanian
[edit]Etymology
[edit]Borrowed from Latin corpus. Doublet of corp.
Pronunciation
[edit]Noun
[edit]corpus n (plural corpusuri)
Declension
[edit]| singular | plural | ||||
|---|---|---|---|---|---|
| indefinite | definite | indefinite | definite | ||
| nominative-accusative | corpus | corpusul | corpusuri | corpusurile | |
| genitive-dative | corpus | corpusului | corpusuri | corpusurilor | |
| vocative | corpusule | corpusurilor | |||
Further reading
[edit]- “corpus”, in DEX online—Dicționare ale limbii române (Dictionaries of the Romanian language) (in Romanian), 2004–2025
Sardinian
[edit]Etymology
[edit]From Latin corpus, from Proto-Italic *korpos, from Proto-Indo-European *krépos ~ *krépesos, derived from the root *krep- (“body”). Compare English riff.
Pronunciation
[edit]Noun
[edit]corpus m (plural corpos)
- (anatomy) body (physical structure of a human or animal)
- tènnere unu corpus atlèticu ― to have an athletic body
- body (fleshly or corporeal nature of a human)
- body (any physical object or material thing)
- Cale si siat corpus est sugetu a sa fortza de gravidade ― Any body is subject to gravitational force
- body, corpse
- body (organisation, company or other authoritative group)
Spanish
[edit]Etymology
[edit]Borrowed from Latin corpus, possibly through the intermediate of English corpus, according to the RAE.[1] Doublet of the inherited cuerpo.
Pronunciation
[edit]Noun
[edit]corpus m (plural corpus)
- corpus (a collection of writings)
References
[edit]- ^ “corpus”, in Diccionario de la lengua española [Dictionary of the Spanish Language] (in Spanish), online version 23.8, Royal Spanish Academy [Spanish: Real Academia Española], 10 December 2024
Further reading
[edit]- “corpus”, in Diccionario de la lengua española [Dictionary of the Spanish Language] (in Spanish), online version 23.8, Royal Spanish Academy [Spanish: Real Academia Española], 10 December 2024
- English terms derived from Proto-Indo-European
- English terms derived from the Proto-Indo-European root *krep-
- English terms borrowed from Latin
- English terms derived from Latin
- English doublets
- English 2-syllable words
- English terms with IPA pronunciation
- English terms with audio pronunciation
- Rhymes:English/ɔː(ɹ)pəs
- Rhymes:English/ɔː(ɹ)pəs/2 syllables
- English lemmas
- English nouns
- English countable nouns
- English nouns with irregular plurals
- English terms with quotations
- en:Linguistics
- en:Physics
- English terms with usage examples
- English terms with uncommon senses
- English terms with archaic senses
- Basque terms derived from Latin
- Basque 2-syllable words
- Basque terms with IPA pronunciation
- Rhymes:Basque/orpus̺
- Rhymes:Basque/orpus̺/2 syllables
- Rhymes:Basque/us̺
- Rhymes:Basque/us̺/2 syllables
- Basque lemmas
- Basque nouns
- Basque terms spelled with C
- Basque inanimate nouns
- Catalan terms borrowed from Latin
- Catalan terms derived from Latin
- Catalan doublets
- Catalan terms with IPA pronunciation
- Catalan lemmas
- Catalan nouns
- Catalan countable nouns
- Catalan indeclinable nouns
- Catalan masculine nouns
- Dutch terms borrowed from Latin
- Dutch terms derived from Latin
- Dutch doublets
- Dutch terms with IPA pronunciation
- Dutch terms with audio pronunciation
- Dutch lemmas
- Dutch nouns
- Dutch irregular nouns
- Dutch nouns with plural in -en
- Dutch neuter nouns
- French terms borrowed from Latin
- French unadapted borrowings from Latin
- French terms derived from Latin
- French doublets
- French 2-syllable words
- French terms with IPA pronunciation
- French terms with audio pronunciation
- French lemmas
- French nouns
- French countable nouns
- French indeclinable nouns
- French masculine nouns
- fr:Linguistics
- Latin terms derived from the Proto-Indo-European root *krep-
- Latin terms derived from Proto-Italic
- Latin terms inherited from Proto-Italic
- Latin terms derived from Proto-Indo-European
- Latin terms inherited from Proto-Indo-European
- Latin 2-syllable words
- Latin terms with IPA pronunciation
- Latin lemmas
- Latin nouns
- Latin third declension nouns
- Latin neuter nouns in the third declension
- Latin neuter nouns
- la:Anatomy
- Medieval Latin
- Latin metonyms
- Latin terms with quotations
- New Latin
- la:Physics
- Latin words in Meissner and Auden's phrasebook
- Portuguese terms borrowed from Latin
- Portuguese unadapted borrowings from Latin
- Portuguese terms derived from Latin
- Portuguese doublets
- Portuguese 2-syllable words
- Portuguese terms with IPA pronunciation
- Rhymes:Portuguese/ɔʁpus
- Rhymes:Portuguese/ɔʁpus/2 syllables
- Rhymes:Portuguese/ɔɾpuʃ
- Rhymes:Portuguese/ɔɾpuʃ/2 syllables
- Portuguese terms with homophones
- Portuguese lemmas
- Portuguese nouns
- Portuguese countable nouns
- Portuguese nouns with multiple plurals
- Portuguese masculine nouns
- pt:Linguistics
- Romanian terms borrowed from Latin
- Romanian terms derived from Latin
- Romanian doublets
- Romanian terms with IPA pronunciation
- Romanian lemmas
- Romanian nouns
- Romanian countable nouns
- Romanian neuter nouns
- Sardinian terms derived from Latin
- Sardinian terms derived from Proto-Italic
- Sardinian terms derived from Proto-Indo-European
- Sardinian terms with IPA pronunciation
- Sardinian lemmas
- Sardinian nouns
- Sardinian masculine nouns
- sc:Anatomy
- Sardinian terms with usage examples
- Spanish terms borrowed from Latin
- Spanish terms derived from Latin
- Spanish terms derived from English
- Spanish doublets
- Spanish 2-syllable words
- Spanish terms with IPA pronunciation
- Rhymes:Spanish/oɾpus
- Rhymes:Spanish/oɾpus/2 syllables
- Spanish lemmas
- Spanish nouns
- Spanish countable nouns
- Spanish masculine nouns
