Journal Article10.1515/CLLT-2015-0030
Log-likelihood and odds ratio: Keyness statistics for different purposes of keyword analysis
120
TL;DR: This study compares the use of log-likelihood (LL), a probability statistic, and odds ratio (OR), an effect size statistic, for keyword identification and argues that the two methods produce different keywords applicable to research focusing on different purposes.
read more
Abstract: Abstract Keyword analysis is used in a range of sub-disciplines of applied linguistics from genre analyses to critically-oriented studies for different purposes ranging from producing a general characterization of a genre to identifying text-specific ideological issues. This study compares the use of log-likelihood (LL), a probability statistic, and odds ratio (OR), an effect size statistic, for keyword identification and argues that the two methods produce different keywords applicable to research focusing on different purposes. Through two case studies, keyword analyses of advance fee scams against the British National Corpus and research articles in applied linguistics against research articles from other academic disciplines, we show that both the LL and OR keywords concern the aboutness of the corpus, but differ in their specificity and pervasiveness through the corpus. LL highlights words which are relatively common in general use serving genre purposes, whereas OR highlights more specialized words serving critically-oriented purposes. Methodological and practical contributions to keyword analysis are discussed.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Using corpus methods to analyze modal verbs in government science communication on Twitter
TL;DR: In this paper , a corpus-based approach is presented to analyze how modal verbs convey scientific information to the public on Twitter, and the results showed that the frequent use of the modal verb can by all three SGAs suggests an emphasis on the ability or permission of specific agents to take or complete actions without conveying a sense of obligation or necessity.
1
Finding social (mis)alignment in older adult and opioid health policy implementation with corpus-assisted discourse analysis
TL;DR: In this paper , the authors compare linguistic features, keywords and collocations between policy texts and agents' talk to find a complex, socially mediated relationship between priorities and stances in official documents and the enacting agents, especially regarding the causes and effects of the opioid epidemic.
1
The Key to the City: Using Digital Tools to Understand Tablet Provenience
Sara Brumfield
- 26 May 2019
TL;DR: Text mining, the practice of deriving information from blocks of text using pattern recognition or trend analysis, has already been applied to corpora ranging from Shakespeare to Twitter and has significant potential for revealing new levels of data in cuneiform texts.
Framing the stateless children in sabah: an examination through corpus analysis
Daron Benjamin Loo,Linda Lagason +1 more
TL;DR: In this article , the framing of stateless children in Sabah by conducting a corpus analysis of news articles published online in 2019 was analyzed using AntConc, and five keywords were identified based on their keyness level, which was determined through a comparison with the top 5,000 words from the Corpus of Contemporary American English.
Panning for gold: Comparative analysis of cross-platform approaches for automated detection of political content in textual data
Mykola Makhortykh,Ernesto de León,Aleksandra Urman,Teresa Gil‐López,Clara Christner,Maryna Sydorova,Silke Adam,Michaela Maier +7 more
TL;DR: This study compares automated content analysis techniques for detecting German-language political content across platforms, evaluating dictionary, supervised machine learning, and deep learning methods on three validation datasets with varying levels of noise.
1
References
Categorical Data Analysis
TL;DR: In this article, categorical data analysis was used for categorical classification of categorical categorical datasets.Categorical Data Analysis, categorical Data analysis, CDA, CPDA, CDSA
15.1K
•Book
An introduction to categorical data analysis
Alan Agresti
- 01 Jan 1990
TL;DR: In this paper, the authors present a tour of categorical data analysis for Contingency Tables and Logit and Loglinear models for contingency tables, as well as generalized linear models for Matched Pairs.
7.9K
Categorical data analysis
TL;DR: In this article, the authors present a generalized linear model for categorical data, which is based on the Logit model, and use it to fit Logistic Regression models.
5.8K
•Journal Article
Accurate methods for the statistics of surprise and coincidence
TL;DR: The basis of a measure based on likelihood ratios that can be applied to the analysis of text is described, and in cases where traditional contingency table methods work well, the likelihood ratio tests described here are nearly identical.
•Book
Research Methods in Applied Linguistics
Zoltán Dörnyei
- 29 Dec 2007
TL;DR: In this article, a very practical and accessible book that offers a comprehensive overview of research methodology in applied linguistics by describing the various stages of qualitative and quantitative investigations, from collecting the data to reporting the results.
2.1K