Journal Article10.1016/j.acorp.2021.100012
Measuring the frequency of the academic formulas list across corpora: A case study based in TED talks and Yale lectures
1
TL;DR: The authors measured the frequency of occurrence of the Academic Formulas List (AFL; Simpson-Vlach and Ellis, 2010) across academic lectures (OYCLC) and an academic-adjacent corpus of TED talks (TTC).
read more
Abstract: Measuring lists of lexis across corpora is a well-established method in corpus linguistics. This article takes a novel approach and measures the frequency of occurrence of the Academic Formulas List (AFL; Simpson-Vlach and Ellis, 2010) across academic lectures (OYCLC) and an academic-adjacent corpus of TED talks (TTC). Frequency of occurrence is measured at three levels: overall inter- and intra-corpus variation; the composition of representation, to see which formulas are represented; and an investigation of the behaviour of formulas within texts. The corpora were found to be significantly different from each other in terms of overall representation with a medium effect size. The greatest difference concerned referential expressions and the smallest difference concerned stance expressions. In terms of intra-corpus variation the AFL was found to occur less often in the humanities and most often in the natural sciences for both corpora. The composition of coverage revealed Zipfian distributions for the AFL, with both corpora presenting a similar set of high frequency formulas within each group category. A combined ratio and minimum frequency measure identified salient formulas to each corpus. Concerning formula behaviour, differences were found between the corpora concerning the use of the same formulas. Pedagogic and methodological implications are discussed in the conclusion.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Collocations, Corpora and Language Learning
Paweł Szudarski
- 23 Jun 2023
TL;DR: The authors provide a systematic overview and synthesis of corpus-based research into collocations focusing on the learning and use of collocations by second language (L2) users, highlighting the importance of collocation as a key notion within the field of corpus linguistics.
8
References
A New Academic Word List
TL;DR: The AWL contains 570 word families that account for approximately 10.0% of the total words (tokens) in academic texts but only 1.4% of total words in a fiction collection of the same size.
2.7K
An Academic Formulas List: New Methods in Phraseology Research
Rita Simpson-Vlach,Nick C. Ellis +1 more
TL;DR: The Academic Formulas List (AFL) as discussed by the authors is an empirically derived, pedagogically useful list of formulaic sequences for academic speech and writing, comparable with the Academic Word List (Coxhead 2000), called the AFL.
Is There an "Academic Vocabulary"?
Ken Hyland,Polly Tse +1 more
TL;DR: This article examined the value of the term by using Coxhead's (2000) Academic Word List (AWL) to explore the distribution of its 570 word families in a corpus of 3.3 million words from a range of academic disciplines and genres.
680
A New Academic Vocabulary List
Dee Gardner,Mark Davies +1 more
TL;DR: The Academic Vocabulary List (AVL) as discussed by the authors is derived from a 120-million-word academic sub-corpus of the 425-millionword Corpus of Contemporary American English (COCA) and is used in English language education.
679
The Processing of Formulaic Language
Kathy Conklin,Norbert Schmitt +1 more
TL;DR: The research surveyed in this chapter strongly supports the position that there is an advantage in the way that native speakers process formulaic language compared to nonformulaic language.