| Exploring educational standard alignment: in search of 'relevance' |
| Full text |
Pdf
(467 KB)
|
Source
|
International Conference on Digital Libraries
archive
Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries
table of contents
Pittsburgh PA, PA, USA
SESSION: Education
table of contents
Pages 57-65
Year of Publication: 2008
ISBN:978-1-59593-998-2
|
|
Authors
|
|
René Reitsma
|
Oregon State University, Corvallis, OR, USA
|
|
Byron Marshall
|
Oregon State University, Corvallis, OR, USA
|
|
Michael Dalton
|
Oregon State University, Corvallis, OR, USA
|
|
Martha Cyr
|
Worcester Polytechnic Institute, Worcester, MA, USA
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): n/a, Downloads (12 Months): n/a, Citation Count: 2
|
|
|
ABSTRACT
The growing availability of online K-12 curriculum is increasing the need for meaningful alignment of this curriculum with state-specific standards. Promising automated and semi-automated alignment tools have recently become available. Unfortunately, recent alignment evaluation studies report low inter-rater reliability, e.g., 32% with two raters and 35 documents. While these results are in line with studies in other domains, low reliability makes it difficult to accurately train automatic systems and complicates comparison of different services. We propose that inter-rater reliability of broadly defined, abstract concepts such as 'alignment' or 'relevance' must be expected to be low due to the real-world complexity of teaching and the multidimensional nature of the curricular documents. Hence, we suggest decomposing these concepts into less abstract, more precise measures anchored in the daily practice of teaching. This article reports on the integration of automatic alignment results into the interface of the Teach Engineering collection and on an evaluation methodology intended to produce more consistent document relevance ratings. Our results (based on 14 raters x 6 documents) show high inter-rater reliability (61 - 95%) on less abstract relevance dimensions while scores on the overall 'relevance' concept are (as expected) lower (64%). Despite a relatively small sample size, regression analysis of our data resulted in an explanatory (R2 = .75) and statistically stable (p-values < .05) model for overall relevance as indicated by matching concepts, related background material, adaptability to grade level, and anticipated usefulness of exercises. Our results suggest that more detailed relevance evaluation which includes several dimensions of relevance would produce better data for comparing and training alignment tools.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Devaul, H., Diekema, A. R., Ostwald, J. (2007) Computer-assisted Assignment of Educational Standards Using Natural Language Processing. Paper presented at the Annual Meeting of the National Science Digital Library (NSDL). Arlington, VA.
|
 |
3
|
|
| |
4
|
Gateway (2007). NSDL:ASN Achievement Standards Network. Available: http://www.thegateway.org/asn.
|
| |
5
|
Greenwood, J. D. (1982) On the Relation Between Laboratory Experiments and Social Behavior: Causal Explanation and Generalization. Journal for the Theory of Social Behavior. 12. 225--250.
|
| |
6
|
|
| |
7
|
Lagoze, C., Kraft, D. B., Payette, S., Jesuroga, S. (2005) What Is a Digital Library Anymore, Anyway? Beyond Search and Access in the NSDL. D-Lib Magazine. 11.
|
| |
8
|
Lazarsfeld, P. F. (1954) A Conceptual Introduction to Latent Structure Analysis. In: Lazarsfeld, P. F. (Ed.) Mathematical Thinking in the Social Sciences. Free Press. New York. NY.
|
| |
9
|
Levins, R., Lewontin, R. C. (1980) Dialectics and Reductionalism in Ecology. Synthese. 43. 47--78.
|
 |
10
|
|
| |
11
|
Noble, W. G. (1981) Gibsonian Theory and the Pragmatist Perspective. Journal for the Theory of Social Behavior. 11. 65--85.
|
| |
12
|
Saracevic, T. (1975) Relevance: A Review of the Literature and a Framework for Thinking on the Notion in Information Science. Journal of the American Society of Information Science. 26(6). 321--343
|
| |
13
|
|
| |
14
|
|
| |
15
|
Van der Smagt (1985) Definieren en Relateren in Sociaal Wetenschappelijk Onderzoek. (Definitions and Relations in Social Science Research). Dissertation. University of Nijmegen, The Netherlands.
|
CITED BY 2
|
|
Steven Bethard , Philipp Wetzer , Kirsten Butcher , James H. Martin , Tamara Sumner, Automatically characterizing resource quality for educational digital libraries, Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries, June 15-19, 2009, Austin, TX, USA
|
|
|
|
INDEX TERMS
Primary Classification:
K.
Computing Milieux
K.3
COMPUTERS AND EDUCATION
K.3.1
Computer Uses in Education
Subjects:
Computer-assisted instruction (CAI)
General Terms:
Design,
Experimentation,
Human Factors,
Measurement,
Reliability,
Theory
Keywords:
context-specific measurement,
curriculum-standard alignment,
digital library,
inter-rater reliability,
reification,
relevance,
social science theory
|