Measurement in STEM education research: a systematic literature review of trends in the psychometric evidence of scales

doi:10.1186/s40594-023-00430-x

Open AccessJournal Article10.1186/s40594-023-00430-x

Measurement in STEM education research: a systematic literature review of trends in the psychometric evidence of scales

Danka Maric, +3 more

- 02 Jun 2023

- International Journal of STEM Education

- Vol. 10, Iss: 1

12

TL;DR: In this article , the authors identify characteristics, trends, and gaps in measurement in Science, Technology, Engineering, and Mathematics (STEM) education research, focusing on the psychometric development of scales developed on college/university students for the context of post-secondary STEM education.

Abstract: Abstract Background The objective of this systematic review is to identify characteristics, trends, and gaps in measurement in Science, Technology, Engineering, and Mathematics (STEM) education research. Methods We searched across several peer-reviewed sources, including a book, similar systematic reviews, conference proceedings, one online repository, and four databases that index the major STEM education research journals. We included empirical studies that reported on psychometric development of scales developed on college/university students for the context of post-secondary STEM education in the US. We excluded studies examining scales that ask about specific content knowledge and contain less than three items. Results were synthesized using descriptive statistics. Results Our final sample included the total number of N = 82 scales across N = 72 studies. Participants in the sampled studies were majority female and White, most scales were developed in an unspecified STEM/science and engineering context, and the most frequently measured construct was attitudes. Internal structure validity emerged as the most prominent validity evidence, with exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) being the most common. Reliability evidence was dominated by internal consistency evidence in the form of Cronbach’s alpha, with other forms being scarcely reported, if at all. Discussion Limitations include only focusing on scales developed in the United States and in post-secondary contexts, limiting the scope of the systematic review. Our findings demonstrate that when developing scales for STEM education research, many types of psychometric properties, such as differential item functioning, test–retest reliability, and discriminant validity are scarcely reported. Furthermore, many scales only report internal structure validity (EFA and/or CFA) and Cronbach’s alpha, which are not enough evidence alone. We encourage researchers to look towards the full spectrum of psychometric evidence both when choosing scales to use and when developing their own. While constructs such as attitudes and disciplines such as engineering were dominant in our sample, future work can fill in the gaps by developing scales for disciplines, such as geosciences, and examine constructs, such as engagement, self-efficacy, and perceived fit.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Couplet scoring for research based assessment instruments

Michael Vignal, +3 more

- 06 Jul 2023

TL;DR: Couplet scoring as discussed by the authors employs the couplet as an alternative unit of assessment, where a couplet is essentially an item viewed and scored through the lens of a specific assessment objective (AO).

...read moreread less

2

Journal Article•10.1021/acs.jchemed.4c00258

Evaluation of the Open-Ended Green Chemistry Generic Comparison (GC)<sup>2</sup> Prompt for Probing Student Conceptions about the Greenness of a Chemical Reaction

Krystal Grieger, +1 more

- 27 Jun 2024

- Journal of Chemical Education

TL;DR: This study evaluates the Green Chemistry Generic Comparison (GC)² prompt's ability to assess student conceptions about green chemistry, finding it sensitive for detecting knowledge gains and suitable for measuring student understanding of green chemistry principles.

...read moreread less

1

Journal Article•10.20527/jipf.v8i1.9042

STEM-Based Science E-Module: Is It Effective to Improve Students' Creative Thinking Skills?

Wulan Octi Pratiwi, +2 more

- 04 Mar 2024

- Jurnal ilmiah pendidikan fisika

TL;DR: STEM-based e-modules are effective in improving elementary school students' creative thinking skills about electrical energy.

...read moreread less

1

Journal Article•10.20511/pyr2023.v11n3.1868

Propiedades psicométricas de las escalas de competencias investigativas: una revisión sistemática

Calixto Tapullima-Mori, +6 more

- 31 Dec 2023

- Propósitos y Representaciones

TL;DR: This systematic review examines the psychometric properties of 11 investigative competence scales published between 2014 and 2023, finding adequate factorial validity and high internal consistency, suggesting their effectiveness in evaluating and developing research skills in university students.

...read moreread less

1

Book Chapter•10.4018/978-1-6684-7813-4.ch008

Designing and Implementing a Globally Focused Interdisciplinary STEM Program

Moe Debbagh Greene, +2 more

- 12 Apr 2024

- Advances in educational technologies and...

TL;DR: Designing and implementing a globally-focused interdisciplinary STEM program in preservice teacher education emphasizes collaborative learning, experiential learning, and leveraging instructional technology to enhance student learning.

...read moreread less

References

•Journal Article•10.1187/CBE.18-04-0064

One Size Doesn’t Fit All: Using Factor Analysis to Gather Validity Evidence When Using Surveys in Your Research

Eva Knekta, +2 more

- 01 Mar 2019

- CBE- Life Sciences Education

TL;DR: The aspects of validity that researchers should consider when using surveys are reviewed and factor analysis is focused on, a statistical method that can be used to collect an important type of validity evidence.

...read moreread less

453

Journal Article•10.1002/SCE.21522

What are we talking about when we talk about STEM education? A review of literature

Tobías Martín-Páez, +3 more

- 01 Jul 2019

- Science Education

404

Journal•10.18260/3-1-1153

Advances in Engineering Education

25 Oct 2022

352

•Journal Article•10.3389/FPSYG.2015.01064

Editorial: Measurement Invariance.

Rens van de Schoot, +8 more

- 28 Jul 2015

- Frontiers in Psychology

TL;DR: The first formal treatment of different forms of MI and their consequences for the validity of multi-group/multi-time comparisons is attributable to Meredith (1993), as well as a recent book by Millsap (2011) containing a general systematic treatment of the topic of MI.

...read moreread less

293

Journal Article•10.1002/JEE.20121

Measuring Undergraduate Students' Engineering Self‐Efficacy: A Validation Study

Natasha A. Mamaril, +4 more

- 01 Apr 2016

- Journal of Engineering Education

TL;DR: In this article, the authors evaluated the factor structure, validity, and reliability of general and skill-specific engineering self-efficacy measures created for use with undergraduate engineering students, and found evidence for the reliability, validity and predictive utility of the engineering selfefficacy scales.

...read moreread less

249