educational system, but present tests have important limitations. The computer is a very general tool for constructing environments which may improve the quality of assessment and the kind of information available to students, teachers, parents, society and school personnel. We should invest in a serious effort to design and develop computer-based assessment systems that reflect the educational goals we value.
Birenbaum, M., & Tatsuoka, K. K. ( 1987). "Openended versus multiple-choice response formats It does make a difference for diagnostic purposes". Applied Psychological Measurement, 11, 385-395.
Brown, J. S., & Burton, R. B. ( 1978). "Diagnostic models for procedural bugs in basic mathematical skills". Cognitive Science, 2, 155-192.
Brown, J. S., & VanLehn, K. ( 1980). "Repair theory: A generative theory of bugs in procedure skills". Cognitive Science, 4, 379-426.
Bunderson, R. V., & Forehand, G. ( 1989). ETS internal document. Princeton, NJ: Educational Testing Service.
Bunderson, C., Inouye, D. K., & Olsen, J. B. ( 1989). "The four generations of computerized educational measurement". In R. L. Linn (Ed.), Educational Measurement, 3rd edition. New York: American Council on Education/Macmillan.
Davis, R. B. ( 1984). "Learning mathematics. The cognitive science approach to mathematics education". Norwood, NJ: Ablex.
di A. A. Sessa ( 1982, January-March). "Unlearning aristotelian physics: A study of knowledge- based learning". Cognitive Science, 6, (1), 37-75.
Erlwanger, S. H. ( 1973, Autumn). "Benny's conception of rules and answers in IPI mathematics". Journal of Children's Mathematics Behavior, 1, ( 2), 7-26.
Frederiksen, N. ( 1994, March). "The real test bias: influences of testing on teaching and learning", American Psychologist, 193-202.
Gitomer, D. H., & Yamamoto, K. ( 1989, April). Using embedded cognitive task analysis in assessment. Paper presented at the annual meeting of the American Educational Research Association, San Francisco.
Hirsch, E. D., Jr. ( 1987). Cultural literacy: What every American needs to know. Boston: Houghton Mifflin.
Hoffman, B. ( 1962). The tyranny of testing. New York: Crowell-Collier.
Holden, C. ( 1989, May). "Computers make slow progress in class". Science, 244, 906.
Lipson, J. ( 1988). Testing in the service of learning: Learning assessment systems that promote educational excellence and equality. Assessment in the service of learning. Proceedings of the 1967 invitational conference. Princeton, NJ: Educational Testing Service.
Lundeberge, M. A., & Fox, P. W. ( 1989, March). Integrating laboratory and classroom findings on test epectancy. Paper presented at the annual meeting of the American Educational Research Association, San Francisco, CA.
Masters, G. N., & Mislevy, R. J. ( 1988). New views of student learning: implications for educational measurement. Unpublished memo.
McCloskey, M., Caramazza, A., & Green, B. ( 1980). "Curvilinear motion in the absence of external forces: Naive beliefs about the motion of objects". Science, 210, 1139-1141.
National Council of Teachers of Mathematics ( 1989). Curriculum and evaluation standards for school mathematics. Reston, VA: NCTM.
Rosnick, P., & Clement, J. ( 1980, Autumn). "Learning without understanding: The effect of tutoring strategies on algebra misconceptions". Journal of Mathematical Behavior, 3, ( 1), 3-27.
Schmitt, A. P., & Crocker, L. ( 1981, April). Improving examinee performance on multiple-choice tests. Paper presented at the annual meeting of the American Educational Research Association, Los Angeles.
Snow, R. E. ( 1987). "Aptitude complexes". In R. E. Snow & M. J. Farr (Eds.) Aptitude learning and instruction, vol. 3. Hillside, NJ: Lawrence Erlbaum Associates.
Snow, R. E., & Peterson, P. L. ( 1985). Cognitive analyses of tests: Implications for redesign. In S. E. Embretson (Ed.), Test design: Developments in psychology and psychometrics. New York: Academic Press.
Tatsuoka, K. K. ( 1983, Winter). "Rule space: An approach for dealing with misconceptions based on item response theory". Journal of Educational Measurement, 20, ( 4), 345-354.
University of Chicago School Mathematics Project ( 1989 and 1990). "Z. Usiskin & S.L. Senk" (Series Directors). Glenview, IL: Scott, Foresman.
VanLehn, K. ( 1982, Summer). "Bugs are not enough: Empirical studies of bugs, impasses, and repairs in procedure skills". Journal of Mathematical Behavior, 3, ( 2), 3-71.
Wainer, H., & Kiely, G. L. ( 1987). "Item clusters and computerized adaptive tests: A case for testlets". Journal of Educational Measurement, 24, ( 3), 185-201.
Wainer, H., Wadkins, J.RJ., & Rogers, A. ( 1983). Was them one distractor too many? Program Statistics Research Technical Report No. 83-