About: Hapax legomenon is a research topic. Over the lifetime, 135 publications have been published within this topic receiving 1052 citations. The topic is also known as: hapax.
TL;DR: The study of lexis: the groundwork Nodes and collocates Spans Frequency German ad hoc compounds Lexis and linguistic theory The interaction of lex is and grammar Semantic preference and semantic prosody Semantic reversal Conclusion.
Abstract: Lexis and Creativity in Translation: Contents Acknowledgements Introduction 1. Is 'linguistics' singular or plural? Introduction Chomskyan linguistics Chomsky and translation theory Firthian linguistics Firth and translation theory The postmodern critique of linguistics in translation studies Conclusion 2. The soft option: corpus linguistics Introduction Corpus linguistics Corpora: a brief history Corpora: users and uses Corpora and neo-Firthian linguistics Corpus processing Global statistics Word lists Keyword lists Clusters Concordancing Conclusion 3. Turning corpus linguistics on its head:corpus-based translation studies Introduction Descriptive translation studies Norms, universals, and laws of translation Corpora in translation studies Monolingual single and comparable corpora Parallel corpora Bilingual and multilingual comparable corpora Normalization in translation Advantages and limitations of corpora in translation studies Conclusion 4. A word about words Introduction The word 'word' Word formation Compounding Derivation Structural semantics Collocation Conflicting definitions of collocation Beginning the study of lexis: the groundwork Nodes and collocates Spans Frequency German ad hoc compounds Lexis and linguistic theory The interaction of lexis and grammar The idiom principle and the open-choice principle Semantic preference and semantic prosody Semantic reversal Conclusion 5. The how of it: creating and using a parallel corpus Introduction Issues in corpus compilation Representativeness Sampling strategies Random sampling vs stratified approaches Internal vs external criteria Text selection Full texts vs texts extracts The German-English Parallel Corpus of Literary Texts (GEPCOLT) Sampling frame and text selection Data capture, editing and mark-up Corpus alignment and bilingual concordancing Multiconcord Comparative data The Mannheim Corpora The British National Corpus Extracting instances of lexical creativity from GEPCOLT Hapax legomena Writer-specific forms Unusual collocations The node AUGE Clusters Evaluating the creativity of translations in GEPCOLT 140 Conclusion 6. Lonely words: creative hapax legomena and writer-specific forms Introduction Hapax Legomena Creative orthography Creative derivation Complex verbal nouns Compounds Wordplay Anaphoric relations Semantic preference Semantic prosody Creative imagery Co-ordinating and copulative compounds Summary statistics and discussion Writer-specific forms Conclusion 7. Two left eyes: creative collocations in GEPCOLT Introduction Exploitations of collocational norms Decomposed compounds Lexical cohesion Other unusual collocations Repeated idiosyncrasies Summary statistics and discussion Conclusion Appendix 1: Works included in the German-English Parallel Corpus of Literary Texts (GEPCOLT) Appendix 2: Sample Header Appendix 3: Creative Hapax Forms in the German Subcorpus of GEPCOLT and their Translations into English References Index
TL;DR: In this article, a corpus-based investigation of speech, thought and writing presentation in English narrative texts is presented, focusing on the use of clustering techniques to reveal semantic relations between words.
Abstract: Preface. I. CORPUS DEVELOPMENT: METHODS AND ISSUES. Kristine HASUND: Protecting the innocent: The issue of informants' anonymity in the COLT corpus Error! Bookmark not defined. Jukka KERANEN: The Corpus of early English correspondence: Progress report. II. CORPUS ANALYSIS: SYNCHRONIC AND DIACHRONIC STUDIES. Gunnar BERGH, Aimo SEPPANEN & Joe TROTTA: Language corpora and the Internet: A joint linguistic resource. Pieter de HAAN: How 'native-like' are advanced learners of English? Christine JOHANSSON and Christer GEISLER: Pied Piping in Spoken English. Goran KJELLMER: Why is nrather dlifficult rather difficult? On changes in English initial consonant clusters. Magnus LEVIN: On concord with collective nouns in English. Magnus LJUNG: 'It is believed that he is dead'. Unspecified source attribution in news texts. Christian MAIR: Man/woman which... - Last of the old, or first of the new? Manfred MARKUS: A-adjectives (asleep etc.) in postnominal position: Etymology as a cause of word order (corpus-based). Minna NEVALA: By him that loves you: Address forms in letters written to 16th-century social aspirers. Arja NURMI: Periphrastic DO and the language of social aspirers: Evidence from the Corpus of Early English Correspondence. Nelleke OOSTDIJK: Language use in a restricted domain. Antoinette RENOUF and R. Harald BAAYEN: Aviating among the hapax legomena: Morphological grammaticalisation in current British newspaper English. Hakan RINGBOM: High-frequency verbs in the ICLE Corpus. Andrea SAND: First findings from ICE-Jamaica: The verb phrase. Anne WICHMANN: Using intonation to create conversational space: Projecting topics and turns. Martin WYNNE, Mick SHORT and Elena SEMINO: A corpus-based investigation of speech, thought and writing presentation in English narrative texts. III. CORPUS LINGUISTICS RESULTS: CREATION OF RESOURCES AND TOOLS. Bas AARTS, Gerald NELSON and Justin BUCKLEY: The Internet grammar of English: New horizons in grammar pedagogy. Alex COLLIER: Identifying diachronic change in semantic relations. Mike PACEY: The use of clustering techniques to reveal semantic relations between words. Pam PETERS: In quest of international English: Mapping the levels of regional divergence.
TL;DR: Three methods of measuring productivity of word formation are proposed in this text: productivity tests (open-ended and judgment tasks), dictionary comparison, and the ratio of "hapax legomena" to tokens in corpora.
Abstract: Morphological productivity is the likelihood of a morphological pattern being used or comprehended in new word formation. Three methods of measuring productivity of word formation are proposed in this text: productivity tests (open-ended and judgment tasks), dictionary comparison (newer with older dictionaries, supplements with earlier versions), and the ratio of "hapax legomena" to tokens in corpora. Processes which score highly by all three criteria can be regarded as productive. The model is examined in light of data from Israeli Hebrew, which as a Semitic language offers a rich array of discontinuous and linear derivation patterns. The Hebrew data also support the claims that in essence, lexical formation is semantically based; that it is constrained by a requirement for distinctiveness; and that it may vary significantly with the type of derivation base.
TL;DR: This paper proposes an empirical method of description of lexical richness byaveraging measures on multiple chunks of text of a standard length within a literary work or corpus, and reveals the possibility of significant variance of these measures of vocabulary among works of a single authorss corpus.
Abstract: The measure of the lexical richness of literary texts as a tool in thecomparative analysis of literary style has been hampered by the problem ofthe inequality of text lengths within and between literary corpora Thispaper proposes an empirical method of description of lexical richness byaveraging measures on multiple chunks of text of a standard lengthwithin a literary work or corpus A workss average vocabulary richness,average portion of hapax legomenaof the corpus from which it derives,and average repetition of frequently appearing vocabulary may thencharacterize that work relative to other works partitioned along withit This method reveals the possibility of significant variance of thesemeasures of vocabulary among works of a single authorss corpus and warnsagainst the notion of some absolute authorial stylistic character Weapply this method of vocabulary averaging to the corpora of threeplaywrights from classical antiquity whose works are chronologicallyrankable: Euripides, Aristophanes, and Terence We look for trends in vocabulary richness over time, which we posit functions as anindicator of progressively changing authorial ability or inclination This method then holds the potential of predicting datesfor undateable or tenuously dated works within a corpus of otherwisesecurely dated texts From the results derived, a relatively late date forthe composition of the redrafted version ofAristophaness Clouds appearslikely; we predict an early composition date for the redraft of TerencessHecyra (and thus are inclined to think that the playwright did verylittle redrafting); and finally we findEuripidess Electra andSupplices exhibiting vocabulary characteristics of extremely latecomposition and we predict dates much later than those assigned based onmetrical considerations