Comparing Linear Equating and Equipercentile Equating Methods Using Random Groups Design (Pages:227-241)

Author :  

Year-Number: 2013-Volume 5, Issue 1
Language : null
Konu : null

Abstract

In this study, raw scores which taken from 9th grade 2009 ÖBBS Form C of Social Sciences were equated to 2009 ÖBBS Form A of Social Sciences with linear equating and three different (unsmoothed, presmoothed and postsmoothed) equipercentile equating methods. The random equating error of different equating methods was compared by Root Mean Squared Difference (RMSD) index and mean of bootstrap standard errors (MBSE). The results indicated that there was a linear relationship between equivalent scores of Form A and raw score of Form C, and Form C was easier than Form A along the score scale. It was seen that results based different equipercentile equating method were consistent with each other, there was a nonlinear relationship between them and test difficulty varied along the score scale. Finally, although the least MBSE and RMSD coefficients were got from linear equating because of nonlinear relationship between raw scores and equivalent scores, concluded that the most appropriate equating method was postsmoothed equipercentile equating which had relatively the lowest random equating error among equipercentile equating methods.

Keywords

Abstract

In this study, raw scores which taken from 9th grade 2009 ÖBBS Form C of Social Sciences were equated to 2009 ÖBBS Form A of Social Sciences with linear equating and three different (unsmoothed, presmoothed and postsmoothed) equipercentile equating methods. The random equating error of different equating methods was compared by Root Mean Squared Difference (RMSD) index and mean of bootstrap standard errors (MBSE). The results indicated that there was a linear relationship between equivalent scores of Form A and raw score of Form C, and Form C was easier than Form A along the score scale. It was seen that results based different equipercentile equating method were consistent with each other, there was a nonlinear relationship between them and test difficulty varied along the score scale. Finally, although the least MBSE and RMSD coefficients were got from linear equating because of nonlinear relationship between raw scores and equivalent scores, concluded that the most appropriate equating method was postsmoothed equipercentile equating which had relatively the lowest random equating error among equipercentile equating methods.

Keywords


  • Akhun, İ. (1984). İki korelasyon katsayısı arasındaki manidarlığın test edilmesi. Ankara Üniversitesi Eğitim Fakültesi Dergisi. 17, 1-7.

  • Angoff, W. H. (1984). Scales, norms and equivalent scores. New Jersey: Educational Testing Service.

  • Baykul, Y. (1996). İstatistik: Metodlar ve uygulamalar (3. Baskı). Ankara: Anı Yayıncılık.

  • Braun, H. I, & Holland, P. W. (1982). Observed- score test equating: A mathematical analysis of some ETS equating procedures. In P. W. Holland&D.B. Rubin (Eds.), Test equating (pp. 9-49). New York: Academic Press.

  • Butler, O. D., & Hanson, B. A. (1997). Examination of presmoothing and postsmoothing methods in equating a direct writing asssessment. Reports-Evaluative. http://www.eric.ed.gov/PDFS/ED412239.pdf

  • Büyüköztürk, Ş. (2007). Sosyal bilimler için veri analizi el kitabı (8. Baskı). Ankara: Pegem A Yayıncılık.

  • Crocker, L.,& Algina, J. (1986). Introduction to classical&modern test theory. New York: Harcourt Brace Jovanovich College Publishers.

  • Dorans, J. N., & Holland, P. W. (2000). Population invariance and the equitability of tests: Basic theory and the linear case. Journal of Measurement, 37, 281-306.

  • Eğitim, Araştırma ve Geliştirmesi Daire Başkanlığı (EARGED). (2010). Ortaöğretim ÖBBS raporu 2009. Ankara, Milli Eğitim Bakanlığı.

  • Hanson, B. (2004). Equating Error: A Program for Computing Equating Error Using the Bootstrap (Version 2.0) [ ]. Iowa, Lindquist Center S.

  • Hanson, B. A., Zeng, L., & Colton, D. (1994). A comprison of presmoothing and postsmoothing methods in equipercentile equating. ACT Report Series, 94-4, American College Testing: Iowa City.

  • Jaeger, R. M. (1981). Some explatory indices for selection of a test equating method. Journal of Educational Measurement, 18(1), 23-38.

  • Kolen, M. J. (1984). Effectiveness of analytic smoothing in equipercentile equating. Journal of Educational Statistics, 9(1), 24-44.

  • Kolen, M. J. (1988). An NCME instructional module on traditional equating methodology. Educational Measurement: Issues and Practice, 7, 29-36.

  • Kolen, M. J. (1991). Smoothing methods for estimating test score distributions. Journal of Educational Measurement, 28(3), 257-282.

  • Kolen, M. J., & Brennan, R. L. (1995). Test equating methods and practices. New York: Springer.

  • Kolen, M. J., & Brennan, R. L. (2004). Test equating, scaling, and linking: Methods and practices (2nd. ed.). New York: Springer.

  • Kolen, M.J.,& Whitney, D. R. (1982). Comparison of four procedures for equating the tests general educational development. Journal of Educational Measurement, 19(4), 279-293.

  • Liu, C. (2011). A comparison of statistics for selecting smoothing paramaters for loglinear presmoothing and cubic spline post smoothing under a random groups design (Unpublished doctoral dissertation). Available from Iowa Research Online. (UMI No. 1013).

  • Livingston, S. A. (1993). Small-sample equating with log-linear smoothing. Journal of Educational

  • Livingston, S. A. (2004). Equating test scores (Without IRT). Educational Testing Service.

  • Moses, T., & Holland, P. (2007). Kernel and traditional equipercentile equating with degrees of presmoothing (ETS Research Rep. No. RR-07-15). Princeton, NJ: ETS.

  • Petersen, N. S. (2007). Equating: best practices and challenges to best practices. N.J. Dorans, M.Pommerich &

  • Skaggs, G. (2005). Accuracy of random groups equating with very small samples. Journal of Educational Measurement, 42 (4), 309-330.

  • Woldbeck, T. (1998, April). Basic concepts in modern methods of test equating. Paper presented at the annual meeting of the Southwest Psychological Association, New Orleans.

  • Zhu, W. (1998). Test equating: What, why, how? Research Quarterly for Exercise and Sport, 69(1), 11-23.

                                                                                                                                                                                                        
  • Article Statistics