Journal Issue10.1002/ASI.V59:9
A balanced approach to health information evaluation: A vocabulary-based naïve Bayes classifier and readability formulas
TL;DR: A vocabularly-based, naive Bayes classifier to distinguish between three difficulty levels in text is developed, indicating that vocabulary usage is frequently appropriate in text considered too difficult by readability formula evaluations.
read more
Abstract: Since millions seek health information online, it is vital for this information to be comprehensible. Most studies use readability formulas, which ignore vocabulary, and conclude that online health information is too difficult. We developed a vocabularly-based, naive Bayes classifier to distinguish between three difficulty levels in text. It proved 98% accurate in a 250-document evaluation. We compared our classifier with readability formulas for 90 new documents with different origins and asked representative human evaluators, an expert and a consumer, to judge each document. Average readability grade levels for educational and commercial pages was 10th grade or higher, too difficult according to current literature. In contrast, the classifier showed that 70-90% of these pages were written at an intermediate, appropriate level indicating that vocabulary usage is frequently appropriate in text considered too difficult by readability formula evaluations. The expert considered the pages more difficult for a consumer than the consumer did. © 2008 Wiley Periodicals, Inc.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Measuring a journal's input rhythm based on its publication–reference matrix
TL;DR: This article creates a journal's publication–reference matrix (p–r matrix) and defines the rR′ indicator, which is used to measure the so-called input rhythm of a journal, and presents and analyzes the input rhythms of the Journal of the American Society for Information Science and Technology and of theJournal of Documentation.
6
•Proceedings Article
$N$-gram Fragment Sequence Based Unsupervised Domain-Specific Document Readability
Shoaib Jameel,Xiaojun Qian,Wai Lam +2 more
- 01 Dec 2012
TL;DR: This work develops a novel unsupervised framework for computing domain-specific document readability and investigates an automatic sequential n-gram determination scheme that aids in capturing appropriate n- gram fragments which are semantically associated with the document’s theme and cohesive with the context.
6
•Proceedings Article
LEARNING ABOUT SOCIAL ACCOUNTING IN THE KNOWLEDGE SOCIETY - A wiki-Webquest
Carmen-Pilar Martí-Ballester,Soledad Moya-Gutiérrez,Diego Prior-Jiménez +2 more
- 01 Jan 2009
TL;DR: This paper examines how new communication and collaboration technologies can influence university students learning process for the area of accounting and specifically for social accounting.
6
Institutional Reflections on Organizational Corruption Control: The Case of FIFA
TL;DR: In this paper, the authors apply and expand on a typology of organizational corruption control to analyze the various mechanisms used to address corruption within the Fédération Internationale de Football Association (FIFA).
6
Development and evaluation of a multifaceted magazine image categorization model
TL;DR: The model is a practical categorization tool that may be used in workplaces, such as magazine editorial offices, and may also serve to guide the development of computational methods for image understanding, selection of concepts for automatic detection, and approaches to support browsing and exploratory image search.
6
References
A Coefficient of agreement for nominal Scales
TL;DR: In this article, the authors present a procedure for having two or more judges independently categorize a sample of units and determine the degree, significance, and significance of the units. But they do not discuss the extent to which these judgments are reproducible, i.e., reliable.
Emergence of Scaling in Random Networks
TL;DR: A model based on these two ingredients reproduces the observed stationary scale-free distributions, which indicates that the development of large networks is governed by robust self-organizing phenomena that go beyond the particulars of the individual systems.
39.1K
•Book
The Structure of Scientific Revolutions
Thomas S. Kuhn
- 01 Jan 1962
TL;DR: The Structure of Scientific Revolutions as discussed by the authors is a seminal work in the history of science and philosophy of science, and it has been widely cited as a major source of inspiration for the present generation of scientists.
36.9K
•Book
Nonparametric statistics for the behavioral sciences
Sidney Siegel
- 01 Jan 1956
TL;DR: This is the revision of the classic text in the field, adding two new chapters and thoroughly updating all others as discussed by the authors, and the original structure is retained, and the book continues to serve as a combined text/reference.
36.3K
•Book
Judgment Under Uncertainty: Heuristics and Biases
Amos Tversky,Daniel Kahneman +1 more
- 01 Jan 1974
TL;DR: The authors described three heuristics that are employed in making judgements under uncertainty: representativeness, availability of instances or scenarios, and adjustment from an anchor, which is usually employed in numerical prediction when a relevant value is available.