Access to Computer-Based Testing for Students with Disabilities


NCEO Synthesis Report 45

Published by the National Center on Educational Outcomes

Prepared by:

Sandra J. Thompson  • Martha L. Thurlow  • Rachel F. Quenemoen  • Camilla A. Lehr

June 2002


Any or all portions of this document may be reproduced and distributed without prior permission, provided the source is cited as:

Thompson, S. J., Thurlow, M. L., Quenemoen, R. F., & Lehr, C. A. (2002). Access to computer-based testing for students with disabilities (Synthesis Report 45). Minneapolis, MN: University of Minnesota, National Center on Educational Outcomes. Retrieved [today's date], from the World Wide Web: http://education.umn.edu/NCEO/OnlinePubs/Synthesis45.html


Executive Summary

Called the “next frontier in testing,” computer-based testing is being promoted as the solution to many of states’ testing problems. With pressure to find more cost effective and less labor intensive approaches to testing, states are seeing computer-based testing as a way to address the increasingly challenging prospect of assessing all students in a state at nearly all grades. Computer-based testing is viewed with optimism as an approach that will make testing less expensive in the long run, and that will produce better assessments of the wide range of students who must now be included in state and district assessments.

Unfortunately, most states and testing companies have not specifically considered the needs of students with disabilities as they pursue computer-based testing. Often, the approach has simply been to take the paper and pencil test and put it onto a computer. This is not enough. Poor design elements on the paper test will transfer to the screen, and there will be additional challenges created by the move as well, challenges that may reduce the validity of the assessment results and possibly exclude some groups from participation in the assessment.

This paper recognizes both the opportunities created by the new frontier of computer-based testing, but also identifies the challenges. Research findings and accommodations considerations are also addressed, with the end result being a process and considerations for the initial transformation of paper/pencil assessments to inclusive computer-based testing.

The recommended process for a good transformation of a paper and pencil test to computer-based testing assumes first that the principles of universally designed assessments have been followed. Then, the five step that are recommended (and discussed in the paper) are:

Step 1. Assemble a group of experts to guide the transformation.

Step 2. Decide how each accommodation will be incorporated into the computer-based test.

Step 3. Consider each accommodation or assessment feature in light of the constructs being tested.

Step 4. Consider the feasibility of incorporating the accommodation into the computer-based test.

Step 5. Consider training implications for staff and students.

The paper also presents initial considerations for common accommodations within the categories of timing/scheduling, presentation, response, and setting.


Overview

On January 8, 2002, President Bush signed the reauthorization of the Elementary and Secondary Education Act into law as the “No Child Left Behind Act of 2001.” This Act requires states to have annual assessments in place in reading and mathematics for all students in grades three through eight by the end of the 2005-2006 school year, with science assessments added by the beginning of the 2007-2008 school year. Only nine states currently administer standards-based tests in both subjects across grades three through eight (Quality Counts, 2002), setting an unprecedented opportunity for states to enhance the participation of all students as they build and improve their assessment systems. Increased requirements within the law for itemized score analyses, disaggregation within each school and district by gender, racial and ethnic group, migrant status, English proficiency, disability, and income will challenge states to create new and more efficient ways to administer, score, and report assessment results.

Computer-based testing has been called the “next frontier in testing” as educators, testing companies, and state departments quickly work to transform paper/pencil tests into technology-based formats (Trotter, 2001). These efforts have occurred in a variety of ways and for a variety of tests. For example, some educators have transferred all of their classroom quizzes and tests into a computer-based format. The paper/pencil version of the Graduate Record Exam™ has been replaced with a computerized version that is administered across a variety of locations. NCS Pearson has developed eMeasurement™ Services—a suite of tools that delivers tests and their results electronically.1 As a result of these advances, states are facing pressure to create computer-based large-scale assessments (Russell, 2002). Some states are investigating the possibility of computerized adaptive testing for their statewide assessments, where the difficulty level of questions are presented and adjusted based on whether students’ responses are correct. According to Bennett (1998), “Whereas there is certainly a concerted move toward technology-based large-scale tests, there is no question that this assessment mode is still in its infancy. Like many innovations in their early stages, today’s computerized tests automate an existing process without reconceptualizing it to realize the dramatic improvements that the innovation could allow. Thus, these tests are substantively the same as those administered on paper” (p. 3).

With the dramatic increase in the use of the Internet over the past few years, and with it, the considerable potential of online learning (Kerrey & Isakson, 2002), assessment will need to undergo a complete transformation to keep pace. According to the Web-based Education Commission, “Perhaps the greatest barrier to innovative teaching is assessment that measures yesterday’s learning goals…Too often today’s tests measure yesterday’s skills with yesterday’s testing technologies—paper and pencil” (p. 3).

Experts suggest that the Internet will be used to develop tests and present items through dynamic and interactive stimuli such as audio, video, and animation (Lewis, 2001). Given this momentum, it is not surprising that there is a trend toward investigating and incorporating the Internet as the testing medium for statewide assessments. Bennett (2001) stated, “The trend is clear: the infrastructure is quickly falling into place for Internet delivery of assessment to schools, perhaps first in survey programs like NAEP (National Assessment of Educational Progress) that require only a small participant sample from each school, but eventually for inclusive assessments delivered directly to the desktop” (p. 10).

As the trend toward computer-based testing moves forward, it is important to focus carefully on the requirements of the newly enacted No Child Left Behind Act of 2001, and on the assessment participation requirements in the 1997 reauthorization of the Individuals with Disabilities Education Act. In addition, a 1996 Department of Justice Policy Ruling states that Titles II and III of the Americans with Disabilities Act requires State and local governments to provide effective communication whenever they communicate through the Internet. The Office for Civil Rights discussed the provision of effective communication:

The issue is not whether the student with the disability is merely provided access, but the issue is rather the extent to which the communication is actually as effective as that provided to others. Title II [of the Americans with Disabilities Act of 1990] also strongly affirms the important role that computer technology is expected to play as an auxiliary aid by which communication is made effective for persons with disabilities (Pages 1-2, 1996 Letter; 28 C.F.R. 35.160 (a)).

In further clarification, the Office for Civil Rights lists three basic components of effective communication: “timeliness of delivery, accuracy of the translation, and provision in a manner and medium appropriate to the significance of the message and the abilities of the individual with the disability” (Page 1, 1997 Letter). This clarification presents a significant and timely responsibility in the design of computer-based testing.

For the full benefits of computer-based testing to be realized, a thoughtful and systematic process to examine the transfer of existing paper/pencil assessments must occur. It is not enough to simply transfer test items from paper to screen. Not only will poor design elements on the paper test transfer to the screen, additional challenges may result that reduce the validity of the assessment results and possibly exclude some groups of students from assessment participation.

This paper presents factors to consider in the design of computer-based testing for all students, including students with disabilities and students with limited English proficiency. We begin with the opportunities and challenges presented by this “new frontier” in testing, and then explore research about effective universally designed assessments and technology-based accommodations, and relate this knowledge to computer-based testing design features. Finally, we present a process and consideration for the initial transformation of paper/pencil assessments to inclusive computer-based testing.


Opportunities

Several advocates have articulated the positive merits of computer-based testing. Some of the advantages over paper/pencil tests that have been cited include: efficient administration, preferred by students, self-selection options for students, improved writing performance, built-in accommodations, immediate results, efficient item development, increased authenticity, and the potential to shift focus from assessment to instruction. This section describes each of these prospective opportunities.

 

Efficient Administration

Computer-based tests can be administered to individuals or small groups of students in classrooms or computer labs, eliminating timing issues caused by the need to administer paper/pencil tests in large groups in single sittings. Different students can take different tests simultaneously in the same room.

 

Preferred by Students        

In an evaluation of testing experience, students overwhelmingly preferred computerized testing to paper/pencil testing (Brown & Augustine, 2001). Most students, regardless of group or ability, believed that the computer was easier, faster, and more fun. Students also responded that using a computer helped concentration by presenting only one question at a time. A recent survey on computer use by students with disabilities in Germany (Ommerborn & Schuemer, 2001) found several more advantages than disadvantages to computer use.

Brown-Chidsey and Boscardin (1999) interviewed students with learning disabilities and found that the computer helped them deal with limitations that often interfered with the completion of their work. The researchers concluded, “Students’ beliefs about computers are likely to shape the extent to which instructional technology enhances their achievement” (Brown-Chidsey, Boscardin, & Sireci, 1999, p. 4). A study at the Boston College Center for the Study of Testing, Evaluation, and Assessment (Trotter, 2001) found, “Students who are accustomed to writing on computers tend to do better on computerized tests than on paper exams. Conversely, students who don’t use computers often to write tend to do better when they complete their tests on paper” (p. 3).

 

Self-Selection Options for Students

Students have the option to choose features on computer-based tests, including format features and built-in accommodations. For example, Calhoon et al. found that “teachers are unlikely to provide a reader to meet student needs because teachers prefer test accommodations that require little individualization and do not require curricular or environmental modifications” (p. 272). Other recent work on accommodations for English Language Learners (Anderson, Liu, Swierzbin, Thurlow, & Bielinski, 2000; Liu, Anderson, Swierzbin, & Thurlow, 1999) has shown that students may not want to use certain accommodations (e.g., headphones to have instructions read in English, bilingual dictionaries) unless they are provided in specific ways. Teachers have reported that students with learning disabilities may opt not to use certain accommodations at certain times because they are not seen as helpful. Having the ability to self-select a technology-based reader or other tool may provide students access to a necessary accommodation that may not be offered currently, due to issues of convenience.

 

Improved Writing Performance

As computers become more common in schools, many of today’s students are accustomed to using computers in their daily work. Students write and calculate on computers as easily and with more speed and efficiency than previous generations could on paper. Research has shown that writing on computers leads students to write more and revise more than writing with paper/pencil (Daiute, 1985; Morocco & Neuman, 1986). Paper/pencil tests that require writing may underestimate the writing ability of students who have grown accustomed to writing on computers (Russell & Haney, 1997). In a survey of computer use by students with disabilities in Germany, Ommerborn and Schuemer (2001) found that the greatest advantage to students was the ease in which computers allowed them to write essays. Several of the students surveyed said that it was very difficult for them to write by hand.

 

Built-in Accommodations

Computer technology has been touted as a tool that can be used to empower students with disabilities (Goldberg & O’Neill, 2000). Specifically, computer-based testing has been viewed as a vehicle to increase the participation of students with disabilities in assessment programs. For example, the windows operating system supports a great variety of adaptive devices (e.g., screen readers, Braille displays, screen magnification, self-voicing Web browsers). According to Greenwood and Rieth (1994), the primary strength of computer-based testing is its “potential for removing traditional barriers to the inclusion of persons with disabilities in the assessment process through adaptations and accommodations as well as through new forms” (p. 110).

Computer-based testing can provide flexibility in administration for students with various learning styles. For example, the National Research Council (NRC, 2001) found computer-based testing to be effective for students who perform better visually than with text, are not native English speakers, or are insecure about their capabilities. According to NRC, “Technology is already being used to assess students with physical disabilities and other learners whose special needs preclude representative performance using traditional media for measurement” (p. 286).

Standardization of accommodated assessment administrations can be facilitated by computer-based testing. According to Brown-Chidsey and Boscardin (1999), “Using a computer to present a test orally controls for standardization of administration and allows each student to complete the assessment at his/her own pace” (p. 2). Brown and Augustine (2001) cited educator appreciation of a computer’s ability to present items over and over, in both written and verbal form, without the need for a non-standard (and sometimes impatient) human reader. Several studies have shown the positive effects of providing a reader for math tests (see Calhoon, Fuchs & Hamlett, 2000; Fuchs, Fuchs, Eaton, Hamlett, & Karns, 2000; Tindal, Heath, Hollenbeck, Almond, & Harniss, 1998).

With the use of audio and video built into computer-based tests, specialized testing equipment such as audiocassette recorders and VCRs could become obsolete (Bennett, Goodman, Hessinger, Ligget, Marshall, Kahn, & Zack, 1999). According to Bennett (1995), “Test directions and help functions would be redundantly encoded as text, audio, video, and Braille, with the choice of representation left to the examinee. The digital audio would allow for spoken directions, whereas the video could present instruction in sign language or speech-readable form. Among other things, these standardized presentations should reduce the noncomparability associated with the uneven quality of human readers and sign-language interpreters” (p. 10).

Finally, just as the use of accommodations on paper/pencil tests has increased awareness and use of accommodations in the classroom, so can opportunities to use the built-in accommodation features of computer-based tests encourage and increase the use of those features in classroom and other environments. For example, Williams (2002) believes, “It is possible that new developments in speech recognition technology could increase opportunities for individual reading practice with feedback, as well as collecting assessment data to inform instructional decision making” (p. 41). In addition, most computer-based tests have built-in tutorials and practice tests. These tutorials provide students with both opportunities for familiarizing themselves with the software and immediate feedback (Association of Test Producers, 2000).

 

Immediate Results

One of the major drawbacks of state testing on paper has been the long wait for results because of the need to distribute, collect, and then scan test booklets/answer forms and hand score open-response items and essays. Students tested in the spring often do not receive their results until fall—nor do their teachers or schools. The results of computer-based tests can be available immediately, providing schools with diagnostic tools to use for improved instruction, and states with information to guide policy. Even open-ended items can be scored automatically, greatly reducing cost and scoring time (Thompson, 1999). According to a report by the National Governors Association (2002), cost savings can result from “the elimination of printing and shipping activities when paper testing ceases” (p. 7).

 

Efficient Item Development

As computer-based testing becomes more developed, item development will be more efficient, higher quality, and less expensive (National Governors Association, 2002). Bennett (1998) believes that at some point items might be generated electronically, with items matched to particular specifications at the moment of administration. “Test design will also be the focal point for responding to diversity. The effects of different test designs on minority group members, females, …will be routinely simulated in deciding what skills and which task formats to use in large-scale assessments” (Bennett, 1998, p. 9). According to Russell (2002), “already, some testing programs are experimenting with ways to generate large banks of test items via computer algorithms with the hope of saving the time and money currently required to produce test items manually” (p. 65). Baker (2002) cited several research efforts that have significantly advanced the progress of schema or template-based, multiple-choice development and test management systems (see Bejar, 1995; Bennett, 2002; Chung, Baker, & Cheak, 2001; Chung Klein, Herl & Bewley, 2001; Gitomer, Steinbert, & Mislevy, 1995; Mislevy, Steinbert, & Almond, 1999).

 

Increased Authenticity

Computers allow for increased use of “authentic assessments”—responses can be open-ended rather than just relying on multiple choice. According to Bennett (1998), the next generation of computer-based tests will be “qualitatively different from those of the first generation. This difference will be evident in the test questions (and, in some cases, the characteristics they measure), as well as in development, scoring, and administrative processes” (p. 4, see Table 1). Bennett notes that many Americans are now receiving their news from TV and the World Wide Web, with the expectation that students will increasingly be able to process information from a variety of sources, not just from print. Bennett also suggests that response formats will shift dramatically, perhaps including problems in which a student is not expected to find the best answer, but a reasonable one within certain constraints.

Table 1. Three Generations of Large-Scale Educational Assessment

Generation

Key Characteristics

First-Generation Computer-Based Tests

(Infrastructure Building)

Primarily serve institutional needs

Measure traditional skills and use test designs and item formats closely resembling paper-based tests

Take limited advantage of technology

Next-Generation Computer-Based Tests

(Qualitative Change)

Primarily serve institutional needs

Use new item formats (including multimedia and constructed response), automatic item generation, automatic scoring, and electronic networks to make performance assessment an integral program component; measure some new constructs

Allow customers to interact with testing companies entirely electronically

Generation “R” Test (Reinvention)

Serve both institutional and individual purposes

Integrated with instruction via electronic tools so that performance is sampled repeatedly over time; designed according to cognitive principles

Use complex simulations, including virtual reality, that model real environments and allow more natural interaction with computers

Adapted from: Bennett, R.E. (1998). Reinventing assessment: Speculations on the future of large-scale educational testing. Princeton, NJ: Policy Information Center, Educational Testing Service.

 

Shifts Focus from Assessment to Instruction

Bennett (1998) believes that eventually large-scale assessment will join with instruction. “Decisions like certification of course mastery, graduation eligibility, and school effectiveness will no longer be based largely on one examination given at a single time but will also incorporate information from a series of measurements” (p. 11). “By virtue of moving assessment into the curriculum, the locus of the debate over performance differences must logically shift from the accuracy of assessment to the adequacy of instruction” (p. 12). Bennett continues this line of thought in a 2001 article, “When well-constructed tests closely reflect the curriculum, group differences should become more an issue of instructional inadequacy than test inaccuracy. As attention shifts to the adequacy of instruction, the ability to derive meaningful information from test performance becomes more critical” (p. 2).


Challenges

Despite the potential advantages offered by computer-based testing, there remain several challenges, especially in the transition from paper/pencil assessments. First of all, the use of technology cannot take the place of content mastery. No matter how well a test is designed, or what media are used for administration, students who have not had an opportunity to learn the material tested will perform poorly. Students need access to the information tested in order to have a fair chance at performing well. Hollenbeck, Tindal, Harniss, and Almond (1999) strongly caution that the use of a computer, in and of itself, does not improve the overall quality of student writing. They, and other researchers, continue to find significantly lower mean test scores for students with disabilities than for their peers without disabilities. Other challenges that must be overcome in order for computer-based testing to be effective include: issues of equity and skill in computer use, added challenges for some students, technological challenges, security of online data, lack of expertise in designing accessible Web pages, and prohibitive development cost.

 

Issues of Equity and Skill in Computer Use

Concerns continue to exist in the area of equity, where questions are asked about whether the required use of computers for important tests puts some students at a disadvantage because of lack of access, use, or familiarity (Trotter, 2001). Concerns include unfamiliarity with answering standardized test questions on a computer screen, using buttons to search for specific items, and indecision about whether to use traditional tools (e.g., hand held calculator) vs. computer-based tools. According to Wissick and Gardner (2000), “Students will not take advantage of help options or use navigation guides if they require more personal processing energy than they can evoke” (p. 38).

A survey on computer use by students with disabilities in Germany (Ommerbon & Schuemer, 2001) found the cost of acquiring and using a computer as the greatest barrier, with the second being a lack of training opportunities. Students who needed assistive technology cited high cost and lack of information as barriers to increased computer use.

The gap in access to technology—sometimes referred to as the “Digital Divide”—is continuing to grow. According to Bolt and Crawford, authors of Digital Divide (2000, p. 98):

While over 80 percent of families with incomes of $100,000 or more have computers at home, only about 25 percent of those households with annual incomes under $30,000 have home access to computers. Demographically, this means that the digital revolution is in full swing in America’s wealthy suburbs and affluent sections of cities and towns, while in some of our poorest areas, it is a phenomenon that is at best heard about on television. The gap has widened considerably for computer ownership among racial minorities when compared with European-Americans. In the context of the overall racial digital divide, a low-income European-American child is three times more likely to have internet access than his or her African-American counterpart, and four times as likely as a Latino family in the same socioeconomic category.

 

Added Challenges for Some Students

Some research questions whether the medium of test presentation affects the comparability of the tasks students are being asked to complete. Here are some findings that show added difficulty for some students.

  • Computer-based testing places more demands on certain skills such as typing, using multiple screens to recall a passage, mouse navigation, and the use of key combinations (Bennett, 1999; Ommerborn & Schuemer, 2001).
  • Some people become more fatigued when reading text on a computer screen than on paper (Allan et al, 2001; Mourant, Lakshmanan, & Chantadisai, 1981).
  • Long passages may be more difficult to read on computer screen (Haas & Hayes, 1986).
  • The inability to see an entire problem on screen at one time is challenging because some items require scrolling horizontally and vertically to get an entire graphic on the page (Hollenbeck, Tindal, Harniss, & Almond, 1999).
  • Few teachers use computers in math instruction, or spreadsheets, so students do not know how to “think on the monitor” (Trotter, 2001).
  • Graphic user surfaces present considerable obstacles to students with visual impairments (Ommerborn & Schuemer, 2001).

 

Technological Challenges

Computers and the Internet do not always work the way we want them to. The word “crash” has taken on a whole new meaning in our technology-oriented world. An issue brief of the National Governors Association listed some of the problems: “testing sessions may be interrupted, proceed so slowly as to interfere with student performance, or encounter difficulties in machine operation or telecommunications that cause data to be lost entirely. Unlike a paper-and-pencil testing system, keeping a computerized system functioning requires significant technical expertise, which many schools lack” (p. 7). Burk (1999) argued, “Computerized testing for students with disabilities is viable but only with appropriate equipment, staff preparation, and student preparation” (p. 6). Some researchers, like Hamilton, Klein, and Lorie (2001), question whether an infrastructure currently exists that can support the use of computers by large numbers of students. They also question the quality of the hardware, especially with our constant evolution of technology, and whether there is sufficient training for staff who must help with administration and technological difficulties that may be encountered. Also, the test program may be device-dependent; for example, there may be a difference in contrast between monitors and speed of the computer. A test presented online may default to the computer’s font, print size, and background color. Graphics may become distorted on small screens, reducing standardization of the assessment presentation. According to a report by the National Governors Association (2002, p. 7):

The reality of statewide computerized testing is that equipment will vary from one school to the next and, sometimes, from one machine to the next within the same school. Similarly, the speed of the Internet connection may differ across schools or within the same school by time of day. The result of these variations is that one student may take a test on a small-screen monitor running at low resolution, thereby requiring repeated scrolling to read comprehension passages. Because of the Internet connection, that student may have to wait five seconds before the next passage is displayed. In contrast, another student may be able to see not only the entire passage but also the questions on the same single screen, with no wait between passages. It is known that such variations can affect performance, but it is not known how to adjust for them in test results.

A constant challenge is ongoing entry of new Web browsers and new versions of existing browsers. In addition, HTML and document converters are constantly being developed and modified. Unfortunately, several features may not be universally accessible and advancements in assistive technology are usually several steps behind new Internet components and tools. For example, using an eye pointing device may increase the time needed to position each eye pointing frame, leading to increased fatigue, boredom, and inattention by the test-taker (Haaf, Duncan, Skarakis-Doyle, Carew, & Kapitan, 1999). As computer-based testing becomes a reality across states and districts, it is important to ensure that the new technology either improves accessibility or is compatible with existing assistive computer technology.

 

Security of Online Data

Critics question whether online data are secure. In a report by the National Governors Association (2002), security issues related to protecting test questions and ensuring the confidentiality of student data in a computerized system were compared to those encountered with conventional tests and were found to be conceptually similar. Differences were found in mechanisms to accomplish breaches and protect against them. For example, test questions and student data could be stolen from central servers or from local computers. This can be minimized through technical design that encrypts questions and student records and through the careful use of passwords.

 

Lack of Ability to Design Accessible Web Pages

According to WebAIM, (Web Accessibility in Mind, an initiative of the Center for Persons with Disabilities at Utah State University, 2001), there are 27.3 million people with disabilities who are limited in the ways they can use the Internet: “The saddest aspect of this fact is that the know-how and the technology to overcome these limitations already exist, but they are greatly under-utilized, mostly because Web developers simply do not know enough about the issue to design pages that are accessible to people with disabilities. Unfortunately, even some of the more informed Web developers minimize the importance of the issue, or even ignore the problem altogether” (p. 1).

 

Prohibitive Development Cost

Development expenses listed in a report by the National Governors Association (2002) include: “central hardware to deliver the test over the Internet, local telecommunications hardware, machines in schools for students to take the tests on, and test authoring and delivery software. Labor expenses include costs for entering questions into the testing software, assuring quality in the test’s operation, extracting student records from the test database and translating the information into a form suitable for analysis, and servicing the technology that runs the system. There are also ongoing connection charges” (p. 7). The National Governors Association recommends that states form consortia, cooperative agreements, or buying pools in order to reduce the costs of “test questions, telecommunications equipment, computer hardware, testing software, and equipment maintenance” (p. 9).


Universally Designed Computer-based Tests

Universal design is defined by the Center for Universal Design (1997) as “the design of products and environments to be usable by all people, to the greatest extent possible, without the need for adaptation or specialized design.” The Assistive Technology Act of 1998 (PL 105-394) addresses universal design through this definition:

The term ‘universal design’ means a concept or philosophy for designing and delivering products and services that are usable by people with the widest possible range of functional capabilities, which include products and services that are directly usable (without requiring assistive technologies) and products and services that are made usable with assistive technologies.

A recent report on the application of universal design to large-scale assessments (Thompson, Johnstone, & Thurlow, 2002) found that good basic design, whether on paper or technology-based, increases access for everyone, and poor design can have detrimental effects for nearly everyone. Many accessibility issues relate to content and design features, with content defined as subject matter on the page while design is defined as the organization or arrangement of objects and information on the page.

 

Content

An important function of well-designed assessments is that they actually measure what they are intended to measure. Test developers need to carefully examine what is to be tested and design items that offer the greatest opportunity for success within those constructs. Just as universally designed architecture removes physical, sensory, and cognitive barriers to all types of people in public and private structures, universally designed assessments need to remove all non-construct-oriented cognitive, sensory, emotional, and physical barriers.

Assessment instructions need to be easy to understand, regardless of a student’s experience, knowledge, language skills, or current concentration level. Directions and questions need to be in simple, clear, and understandable language. It is important for designers of computer-based tests to strive for content that is understandable and navigable. According to WebAIM (2001), “this includes not only making the language clear and simple, but also providing understandable mechanisms for navigating within and between pages” (p. 8).

 

Design Features

Legibility is the physical appearance of text; the way shapes of letters and numbers enable people to read text “quickly, effortlessly, and with understanding” (Schriver, 1997, p. 252). Though a great deal of research has been conducted in this area, the personal opinions of editors often prevail (Bloodsworth, 1993; Tinker, 1963). Bias results from items that contain physical features that interfere with a student’s focus on or understanding of the construct an item is intended to assess. Format dimensions can include contrast, type size, spacing, typeface, leading, justification, line length/width, blank space, graphs and tables, illustrations, and response formats (see Table 2).

Table 2. Characteristics of Maximum Legibility

Dimension Maximum Legibility Characteristics
Contrast

Black type on matte pastel or off-white paper is most favorable for both legibility and eye strain.

Type Size

Large type sizes are most effective for young students who are learning to read, students with visual difficulties, and individuals with eye fatigue issues.

Spacing

The amount of space between each character can affect legibility. Spacing needs to be wide between both letters and words. Fixed-space fonts seem to be more legible for some readers than proportional-spaced fonts.

Leading

Leading, the amount of vertical space between lines of type, must be enough to avoid type that looks blurry and has a muddy look. The amount needed varies with type size (for example, 14-point type needs 3-6 points of leading).

Typeface

Standard typeface, using upper and lower case, is more readable than italic, slanted, small caps, or all caps.

Justification

Unjustified text (with staggered right margin) is easier to see and scan than justified text – especially for poor readers.

Line Length

Optimal length is about 4 inches or 8 to 10 words per line. This length avoids reader fatigue and difficulty locating the beginning of the next line, which causes readers to lose their place.

Blank Space

A general rule is to allow text to occupy only about half of a page. Blank space anchors text on the paper and increases legibility.

Graphs and Tables

Symbols used on graphs need to be highly discriminable. Labels should be placed directly next to plot lines so that information can be found quickly and not require short-term memory.

Illustrations

When used, an illustration should be directly next to the question for which it is needed. Because illustrations create numerous visual and distraction challenges, and may interfere with the use of some accommodations (such as magnifiers), they should be used only when they contain information being assessed.

Response Formats

Response options should include larger circles (for bubble response tests), as well as multiple other forms of response.

From Thompson, Johnstone, & Thurlow, 2002.

 

It is important to maintain these aspects of universal design when converting paper/pencil tests to computer-based tests. Poor design on paper will result in poor design on a screen. In addition to the universal design elements described above, computer-based testing can offer several additional features that can increase the accessibility of assessments for all students, including students with disabilities and English language learners. According to WebAIM (2001), “Everyone benefits from well-designed Web sites, regardless of cognitive capabilities. In this context, ‘well-designed’ can be defined as having a simple and intuitive interface, clearly worded text, and a consistent navigational scheme between pages” (p. 8). These features also need to take into account variations in technology available in schools across a district or state, and the other challenges described in the previous section.

The provision of navigation tools and orientation information in pages can maximize access for all users. However, there are users who cannot access visual clues such as image maps, scroll bars, side-by-side frames, or graphics. Some users lose contextual information because they are accessing a page one word at a time through speech synthesis or braille. Ommerborn and Schuemer (2001, p. 21) conducted a survey of German students with disabilities and found that:

Being able to use various ways of sending commands within a programme not only helps people with specific handicaps, but also renders working with a computer much more comfortable for all users with their different preferences and skills…Multimedia products addressing several senses or allowing the user to choose between visual and acoustic information not only makes access easier for people with impaired senses but also makes the product altogether more attractive.


Assistive Technology

Even though items on universally designed assessments will be accessible for most students, there will still be some students who continue to need accommodations, including assistive technology. According to Bowe (2000), “One big advantage of universal design is that it minimizes the need, on the part of people with disabilities, for assistive technology devices and services” (p. 25). Items are biased when they do not allow for adaptation for use with assistive technology that is needed to facilitate use of the student’s primary means of communication. Computer-based tests need to be accessible for a variety of forms of assistive technology (e.g., key guards, specialized keyboards, trackballs, screen readers, screen enlargers) for students with physical or sensory disabilities. Bowe (2000) stated, “If a product or service is not usable by some individual, it is the responsibility of its developers to find ways to make it usable, or, at minimum, to arrange for it to be used together with assistive technologies of the user’s choice” (p. 27). Appendix A describes several resources to assist assessment developers in increasing access to assistive technology.

It is important to note that making computer-based testing amenable to assistive technology does not mean that students will automatically know what to do. Educators, especially special educators, need to be competent in technology knowledge and use. According to Lahm and Nickels (1999), “Educators must become proactive in their technology-related professional development because teacher education programs have only recently begun addressing the technology skills of their students” (p. 56). The Knowledge and Skills Subcommittee of the Council for Exceptional Children’s (CEC) Professional Standards and Practice Standing Committee has developed a set of 51 competencies for assistive technology that cross 8 categories, along with knowledge and skills statements for each category (see Lahm & Nickels, 1999).

 

Laws Governing Assistive Technology

The use of assistive technology is defined in the Individuals with Disabilities Education Act (IDEA 97), the Rehabilitation Act of 1997, and is implied in the Americans with Disabilities Act (ADA). IDEA 97 defines assistive technology as “any item, piece of equipment, or product system…that is used to improve the functional capabilities of individuals with disabilities; and any service that directly assists an individual in the selection, acquisition, or use of an assistive technology device.” An “assistive technology device” is further defined as “any item, piece of equipment, or product system, whether acquired commercially off the shelf, modified, or customized, that is used to increase, maintain, or improve the functional capabilities of a child with a disability” (20 U.S.C. 1401(1)).

The Rehabilitation Act (reauthorized in 1997) requires institutions receiving federal funds to have accessible Web sites. Similarly, the Americans with Disabilities Act (ADA) requires covered entities to furnish appropriate auxiliary aids and services where necessary to ensure effective communication with individuals with disabilities, unless doing so would result in a fundamental alteration to the program or service or in an undue burden (See 28 C.F.R. 36.303; 28 C.F.R. 35.160). Auxiliary aids include taped texts, Brailled materials, large print materials, captioning, and other methods of making audio and visual media available to people with disabilities. Titles II and III of the ADA require State and local governments and the business sector to provide effective communication whenever they communicate through the Internet. In order to specifically address the needs of people with visual disabilities, an ADA policy ruling determined that a text format rather than a graphical format assures accessibility to the Internet for individuals using screen readers. Without special coding, a text browser will only display the word “image” when it reads a graphic image, and if the graphic is essential to navigating the site (e.g., navigational button or arrow) or if it contains important information (e.g., table or image map) the user can get stuck and not be able to move or understand the information provided.

 

Assistive Technology Resources

There are several resources available to increase the accessibility of computer-based testing for students with disabilities. These resources are found primarily in the area of general Web content. Chishold, Vanderheiden, and Jacobs (1999) offer guidelines on how to make Web content accessible to people with disabilities. They are quick to point out that following these guidelines can also make Web content more available to all users, including those who use voice browsers, mobile phones, automobile-based personal computers, and other technology. The guidelines, found in Table 3, explain how to make multimedia content more accessible to a wide audience. For more information about Web accessibility, visit http://www.webaim.org, the official Web site of Web Accessibility in Mind (WebAIM). Several additional resources can be found in Appendix A.

 

Table 3. Web Content Accessibility Guidelines

21 December 2001: The Authoring Tool Accessibility Guidelines Working Group has released the first public Working Draft of Authoring Tool Accessibility Guidelines "Wombat". The guidelines are for developers who wish to design authoring tools that produce accessible Web content and who wish to create accessible authoring interfaces)

Guideline 1. Provide equivalent alternatives to auditory and visual content.
Provide content that, when presented to the user, conveys essentially the same function or purpose as auditory or visual content.

Guideline 2. Don't rely on color alone.
Ensure that text and graphics are understandable when viewed without color.

Guideline 3. Use markup and style sheets and do so properly.
Mark up documents with the proper structural elements. Control presentation with style sheets rather than with presentation elements and attributes.

Guideline 4. Clarify natural language usage.
Use markup that facilitates pronunciation or interpretation of abbreviated or foreign text.

Guideline 5. Create tables that transform gracefully.
Ensure that tables have necessary markup to be transformed by accessible browsers and other user agents.

Guideline 6. Ensure that pages featuring new technologies transform gracefully.
Ensure that pages are accessible even when newer technologies are not supported or are turned off.

Guideline 7. Ensure user control of time-sensitive content changes.
Ensure that moving, blinking, scrolling, or auto-updating objects or pages may be paused or stopped.

Guideline 8. Ensure direct accessibility of embedded user interfaces.
Ensure that the user interface follows principles of accessible design: device-independent access to functionality, keyboard operability, self-voicing, etc.

Guideline 9. Design for device-independence.
Use features that enable activation of page elements via a variety of input devices.

Guideline 10. Use interim solutions.
Use interim accessibility solutions so that assistive technologies and older browsers will operate correctly.

Guideline 11. Use W3C technologies and guidelines.
Use W3C technologies (according to specification) and follow accessibility guidelines. Where it is not possible to use a W3C technology, or doing so results in material that does not transform gracefully, provide an alternative version of the content that is accessible.

Guideline 12. Provide context and orientation information.
Provide context and orientation information to help users understand complex pages or elements.

Guideline 13. Provide clear navigation mechanisms.
Provide clear and consistent navigation mechanisms -- orientation information, navigation bars, a site map, etc. -- to increase the likelihood that a person will find what they are looking for at a site.

Guideline 14. Ensure that documents are clear and simple.
Ensure that documents are clear and simple so they may be more easily understood.


Computerized Adaptive Testing

In computerized adaptive testing, a student responds to an item, which is followed by more difficult items if the student responded correctly, or easier items if the student responded incorrectly (Hamilton, Klein, & Lorié, 2001). Through this process, a student’s performance level is determined. According to Hamilton, Klein and Lorié (2001), “each response leads to a revised estimate of the student’s proficiency and a decision either to stop testing or to administer an additional item that is harder or easier than the previous one” (p. 12).

The advantages cited for computerized adaptive testing include short and efficient administration time, with the computer selecting the next item immediately after an item is completed. A proficiency level is determined through the completion of fewer items than a test in which students respond to every item on the test. According to McBride (1985), “A well-constructed adaptive test attains a specified level of measurement precision in about half the length of time a conventional test would require to reach the same level. This is attributable to the adaptive feature; by tailoring the choice of questions to match the examinee’s ability, the test bypasses most questions that are inappropriate in difficulty level and contribute little to the accurate estimation of the test-taker’s ability” (p. 26).

However Stone and Lunz (1994) found that the inability of students taking computerized adaptive tests to review items and alter their responses may affect the quality of measurement. Students cannot select the order in which they respond to items, or leave some items blank.

There is some research that suggests that students who change earlier answers may improve their scores by a small margin (Gerson & Bergstrom, 1995; Stocking, 1996). There is also concern that some students may respond to early items wrong on purpose to get easier questions (Wainer, 1993).

The use of computerized adaptive tests for large-scale assessments has come under scrutiny by federal officials who question whether “levels” testing meets accountability requirements of Title I (Olson in Education Week, 2002). Levels testing, which has been defined as testing at a student’s instructional level rather than at his or her grade level, relies on overlapping levels within a single grade level, and common items among the levels. Computerized adaptive testing goes beyond the need for separate booklets by using a variety of complex algorithms that allows the student to move among different “levels” more freely, based on performance (Quenemoen, Thurlow, & Bielinski, in press).

 

Process for Developing Inclusive Computer-based Tests

The transformation of traditional paper/pencil tests to inclusive computer-based tests takes careful and thorough work that includes the collaborative expertise of many people. As discussed earlier in this paper, in order for the full benefits of computer-based testing to be realized, a thoughtful and systematic process to examine the transfer of existing paper/pencil assessments must occur. It is not enough to simply transfer test items from paper to screen. Not only will poor design elements on the paper test transfer to the screen, additional challenges may result in reducing the validity of assessment results. Some of the challenges traditionally present with accommodations could be minimized through universally designed computer-based tests, while others might remain or present even greater challenges. Here are some steps to follow in addressing these transformation issues.

Step 1. Assemble a group of experts to guide the transformation. This group needs to include experts on assessment design, accessible Web design, universal design, and assistive technology, along with state and local assessment and special education personnel. Table 4 contains a worksheet to use when gathering this group.

Table 4. Assemble a Group of Experts to Guide the Development of Computer-based Tests.

Type of Expert

Names/Positions

Assessment design experts

 

Accessible Web design experts

 

Universal design experts

 

Assistive technology experts

 

State assessment personnel

 

State special education personnel

 

Local assessment personnel

 

Local special education personnel

 

 

Step 2. Decide how each accommodation will be incorporated into the computer-based test. Examine each possible accommodation in light of computer-based administration. Some of the traditional paper/pencil accommodations will no longer be needed (e.g., marking responses on test form rather than on answer sheet), while others will become built-in features that are available to every test-taker. Some accommodations will be more difficult to incorporate than others, requiring careful work by test designers and technology specialists. The standards and guidelines for accessible Web design found in Appendices B, C, and D should be used when building in these features.

Step 3. Consider each accommodation or assessment feature in light of the constructs being tested. For example, what are the implications of the use of a screen reader when the construct being measured is reading, or the use of a spellcheck when achievement in spelling is being measured as part of the writing process? As the use of speech recognition technology permeates the corporate world, constructs that focus on writing on paper without the use of a dictionary or spellchecker may become obsolete and need to be reconsidered.

Step 4. Consider the feasibility of incorporating the accommodation into computer-based tests. Questions about the feasibility of the accommodation may require review by technical advisors, or members of a policy/budget committee, or may require short-term solutions along with long term planning. According to the Technology Act of 1998 (§ 1194.2 Application):

(a) When developing, procuring, maintaining, or using electronic and information technology, each agency shall ensure that the products comply with the applicable provisions of this part, unless an undue burden would be imposed on the agency.

(1) When compliance with the provisions of this part imposes an undue burden, agencies shall provide individuals with disabilities with the information and data involved by an alternative means of access that allows the individual to use the information and data.

Construct a specific plan for building in features that are not immediately available, in order to keep them in the purview of test developers. Extensive pilot testing needs to be conducted with a variety of equipment scenarios and accessibility features.

Step 5. Consider training implications for staff and students. The best technology will be useless if students or staff do not know how to use it. Careful design of local training and implementation needs to be part of the planning process. Special consideration needs to be given to the computer literacy of students and their experience using features like screen readers. Information about the features available on computer-based tests needs to be marketed to schools and available to IEP teams to use in planning a student’s instruction and in preparation for the most accessible assessments possible. Practice tests that include these features need to be available to all schools year around. This availability presents an excellent opportunity for students whose schools have previously been unaware of or balked at the use of assistive technology.


Considerations

Most states have a list of possible or common accommodations for students with disabilities within the categories of timing/scheduling, presentation, response, and setting (Thurlow, Lazarus, & Thompson, 2002). Some states also list accommodations specifically designed for students with limited English proficiency (Rivera, Stansfield, Scialdone, & Sharkey, 2000).

 

Presentation Accommodations

The list of accommodations in Table 5 is an expanded list of presentation accommodations generated to address the needs of students with a variety of accommodation needs—including students with disabilities, students with limited English proficiency, students with both disabilities and limited English proficiency, and students who do not receive special services, but have a variety of unique learning and response styles and needs. For each accommodation, relevant considerations are provided in the table. The three columns to the right of the Considerations Column represent:

  • A built-in feature of universally designed computer-based tests (available for self-selection by any student)
  • The need for this accommodation is not affected by computer-based testing
  • A new or different accommodation may be needed for computer-based testing

 Following Table 5 is a summary of considerations for each of the presentation accommodations.

Table 5. Presentation Accommodations

Accommodation

Considerations for Computer-based Tests

1*
Built in

2*
Not affected

3*
Other  accom. needed

Large print and magnification

 

Capacity for any student to self-select print size or magnification

Graphics and text-based user interfaces have different challenges

Scrolling issues

Determination of optimal print size (e.g., default set at 14 pt) and size of graphics to reduce need for large print or magnification

Variations in screen size

Effects of magnification on graphics and tables

X

 

X

Instructions simplified/clarified

 

Instructions designed for maximum simplicity/clarity

Capacity for student to self-select alternate versions of instructions in written or audio format

Capacity to have instructions repeated as often as student chooses

Variable audio speed

X

 

 

Audio presentation of instructions and test items

 

 

Capacity for any student to self-select audio (screen reader) presentation of instructions (all students wear ear/headphones)

Graphics and text-based user interfaces have different challenges

Capacity to repeat instructions and items as often as student chooses

Variable audio speed

Audio presentation must be high quality

X

 

 

Instructions and test items presented in sign language

 

 

Capacity for student to self-select alternate versions of instructions in written format

Capacity for student to self-select signed versions of instructions and test items (Note: some words may not be easily translated into sign language)

Graphics and text-based user interfaces have different challenges

Not feasible to read lips on video

X

 

 

Instructions and test items presented in a language other than English

 

 

 

Capacity for student to self-select alternate language versions of test items in written or audio format

Beware of the speed at which some languages are produced, they may take more space than English

Machine translation capability

Graphics and text-based user interfaces have different challenges

Capacity for pop-up translation

Variable audio speed

X

X

 

Braille

 

Use of screen reader that converts text into synthesized speech or Braille

Alternative tags for images

Graphics and text-based user interfaces have different challenges

Students using Braille may require extra time

 

X

X

Highlighter and place holding templates Capacity for any student to self-select highlighter

Graphics and text-based user interfaces have different challenges

Clear instructions for use of highlighter

X

 

 

Graphics or images that supplement text Careful selection of images

Alternative text or "alt tags" for images

Graphics and text-based user interfaces have different challenges

Avoidance of complex backgrounds or wallpaper that may interfere with the readability of overlying text

Tactile graphics or three-dimensional models may be needed for images

 

 

X

Paper/pencil test format Students who are not computer literate

Students who need accommodations that are not available on computer-based assessments