Empirical Support for Accommodations Most Often Allowed in State Policy


NCEO Synthesis Report 41

Published by the National Center on Educational Outcomes

Prepared by:

Martha Thurlow • Sara Bolt

November 2001


Any or all portions of this document may be reproduced and distributed without prior permission, provided the source is cited as:

Thurlow, M., & Bolt, S. (2001). Empirical support for accommodations most often allowed in state policy (Synthesis Report 41).Minneapolis, MN: University of Minnesota, National Center on Educational Outcomes. Retrieved [today's date], from the World Wide Web: http://education.umn.edu/NCEO/OnlinePubs/Synthesis41.html


Executive Summary

The need for research findings on the effects of assessment accommodations has been lamented for several years. This need was heightened in importance as IDEA 97 became law and required that students with disabilities be included in assessments and provided with needed assessment accommodations. In 1999 Tindal and Fuchs developed a comprehensive review of the literature on test changes. The complexity of the many studies included in that review made it evident that in the future a searchable data base was needed to cull the information for addressing specific accommodations, specific groups of students, specific ages, or combinations of these and other factors. NCEO launched a searchable database of research on accommodations in September, 2001 (http://education.umn.edu/NCEO/AccomStudies.htm), with plans to update the research included in the accommodations database every three months.

This report is a compilation of information collected from the searchable database on the accommodations that are most often allowed in state policies: Braille editions, computer/machine response, dictate response to scribe, extended time, interpreter for instructions, large print edition, mark answers in test booklet, read aloud, test direction clarifications, and test breaks. The summary of research reveals that there are not yet simple or conclusive answers to questions about the effects of specific accommodations. It is important to continue to document what the research tells us, and to analyze the specific contexts of the studies, their methodologies, and their findings.


Research Findings Needed

The reauthorization of the Individuals with Disabilities Education Act in 1997 (IDEA 97) requires that students with disabilities be included in state and district assessments, with appropriate accommodations as necessary. Accommodations are changes in assessment materials or procedures that address aspects of students’ disabilities that may interfere with the demonstration of their knowledge and skills on standardized tests. Accommodations attempt to eliminate barriers to meaningful testing, thereby allowing for the participation of students with disabilities in state and district assessments.

Currently there is limited consensus on what constitutes an “appropriate” accommodation. Although providing accommodations for sensory or physical disabilities (e.g., Braille, large print, etc.) has rarely been questioned, accommodations for students with disabilities that specifically affect cognitive functioning (e.g., read the test to the student, extended time, etc.) have been considered more controversial due to beliefs that these accommodations may alter the construct the test is intended to measure (Phillips, 1994). This controversy is evident from analyses of state policies of testing accommodations. Thurlow, House, Boys, Scott, and Ysseldyke (2000) found that although there has been an increase in the number of states with accommodation policies, states continue to vary in terms of which accommodations they will either allow on statewide tests or treat in the same way as “standard” test administrations.

Complicating this issue is the fact that empirical research on the effects of specific testing accommodations is still quite limited (Fuchs, Fuchs, Eaton, Hamlett, & Karns, 2000). In 1999, Tindal and Fuchs completed a comprehensive review of literature on testing accommodation research. Although they identified 114 studies on test changes overall, these studies have not provided conclusive support for many specific accommodations. Also, many of the studies that were included in their review did not address the use of specific accommodations.

Given the controversy surrounding the use of accommodations as evidenced by state policy analysis (Thurlow et al., 2000), it is important to know that the most frequently allowed accommodations are supported by a solid research base. The purpose of this document is to summarize information currently available on 10 of the most frequently allowed testing accommodations in state policies. The intent of this summary is to provide information on what the research currently indicates, and to provide decision makers with a general overview of common accommodations in order to help in making “appropriate” accommodation decisions.


Method

In 1999 NCEO gathered, organized, and reported various aspects of state accommodation policies (Thurlow et al., 2000). Five main categories of accommodations were identified: presentation, presentation equipment, scheduling, response, and setting accommodations. State policy information was organized according to each of these main categories, and by each of the more frequently mentioned individual accommodations (read aloud, extended time, dictated response, etc.). The report also included information on states that allowed specific accommodations only under certain conditions (e.g., on certain test content areas, for specific grades of students, etc.). For the current synthesis, the number of states allowing accommodations both with and without such limitations was tallied. Information for the top 12 accommodations that are allowed in state policies is included in Table 1.

Table 1. Most Frequently Allowed Accommodations in State Policies

Accommodation

 Number of State Allowing Accommodation

 With or Without Limitations

Without Limitations

Individual Administration

44

44

Dictate Response to Proctor/Scribe

43

32

Small Group Administration

41

41

Large Print

40

38

Braille

38

33

Extended Time

37

32

Interpreter for Instructions

36

34

Read/Reread/Simplify/Clarify Directions

35

31

Computer/Machine Response

34

28

Read Aloud

34

4

Write in Test Booklet

33

28

Testing with Breaks

33

28

It is important to note that these are not necessarily the most frequently used accommodations, they are simply the most frequently allowed in state policies. Because two of the first three accommodations listed are not considered to be highly controversial (individual and small group administrations), and may actually be used during standardization procedures, these accommodations were deleted for purposes of this synthesis of accommodation research. For the remaining 10 accommodations, survey and empirical research was collected and summarized.

The Tindal and Fuchs (1999) report provided initial guidance on what research studies to include. Updated searches of ERIC were conducted to identify more recent research. Research on accommodations used for K-12 students in large-scale assessments was the primary focus. However, for several of the accommodations, limited research of this type was identified and so other research (accommodations for college students, intellectual assessments, etc.) was also included. The 10 accommodations are presented here in alphabetical order.


Braille Edition of the Test

Braille editions of tests are developed to accommodate students with blindness or significant visual impairments. Sixty-three possible dot combinations that can be read tactually form the basis for the Braille code, which is now universally accepted as the graphic symbol for blind readers (Barraga, 1983).

 

Explanation

Academic tests are not designed to measure students’ sensory abilities. However, if students with blindness are required to take a regular-print version of the test, their performance will be more representative of their visual disability than of their academic abilities. By taking a Braille version of a test, students with blindness are able to demonstrate their true academic abilities. Phillips (1994) pointed out that providing accommodations to students with sensory deficits has traditionally been a common practice.

 

State Use

The Braille accommodation is allowed by 33 of the 48 states that have statewide assessments (Thurlow et al., 2000). Five additional states allow this accommodation with limitations. Some states allow the Braille accommodation only on certain assessments and others may not allow the accommodation simply because a Braille version of the test has not yet been developed.

 

Empirical Research

Despite the fact that the Braille accommodation has been widely accepted, studies have shown that students using a Braille edition of a test may have trouble with certain types of items. Bennett, Rock, and Kaplan (1987) found that math items involving special symbols and tally systems were differentially difficult for students taking a Braille edition of the Stanford Achievement Test (SAT). In addition, Bennett, Rock, and Novatkoski (1989) found that differential item functioning existed for the Braille edition of the SAT math section, specifically among items that included figures in the stimulus and items for which spatial estimation was considered helpful in eliminating two of the options. Similarly, Coleman (1990) found that Braille readers had more difficulty with length measurement tasks than regular-print readers. A statewide stakeholder focus group in Texas suggested that tactual versions of print diagrams and pictures may convey very different information to students than visual diagrams and pictures, thus challenging the validity of the Braille version of the test. Although certain types of test items have been found to be more difficult for Braille readers, it appears that overall test scores are not greatly affected by this.

In addition to having difficulty with certain test items, students taking a Braille assessment may need extra time to complete the test. According to Phillips (1994), reading in Braille is a slower process than reading print materials. Braille shorthand can help speed up the process; however, Barraga (1983) suggests that extensive abbreviation can actually make it more difficult for students to recognize words in an unfamiliar context. Wright and Wendler (1994) found that the majority of students who were given twice the normal amount of time were able to finish the test.

 

Controversy

The Braille accommodation is typically regarded as a change that maintains the validity of the test. There is little controversy about whether this accommodation should be allowed. However, even though the accommodation is allowed, the scores from Braille editions of a test often are not aggregated or reported in the same way as other scores. Analyses of test performance at the item level also suggest that certain types of items are differentially difficult for Braille readers.

 

Recommendations

    Braille assessments should be developed for tests that currently do not have a Braille edition and should be offered to students with severe visual impairments who use Braille.

    Items that are difficult to translate into a Braille version of the test should be avoided during initial item development (e.g., picture items, tally items, etc.).

    Students using a Braille version of the test should be given extended time to complete the test, perhaps as much as double the time.


Computer/Machine Response

Students are using computers more than ever before. Similarly, computerized assessments are becoming more widely used. Because computerized assessments often involve both computer presentation and computer response, it is often difficult to disentangle the impact of the computer response mode on test administration.

 

Explanation

Many students complete writing assignments via computer. Students who are more familiar with completing assignments on computer may be more comfortable with completing a test in this mode. Because tests are intended to measure the outcomes of instruction, rationale for using this accommodation when students use computers in the classroom is clear. Also, many students with physical impairments that limit their abilities to respond with paper-and-pencil may not be able to demonstrate their true knowledge and skills unless a test is administered in an alternate format, such as via computer. Thus, computer/machine response has been studied for both writing assessments and assessments in other subject areas.

 

State Use

Of the 48 states with statewide assessment programs, 28 allow the computer/machine response accommodation. Six additional states allow it in some situations and prohibit it in others. States that limit its use commonly only allow this accommodation on certain assessments, or only allow it if all students in the classroom have the opportunity to take the test via computer (Thurlow et al., 2000).

 

Survey Research

Jayanthi, Epstein, Polloway, and Bursuck (1996) sent surveys to 708 general education teachers to obtain information on educators’ perceptions and use of a variety of accommodations. Results of the survey (401 respondents) indicated that of the 24 accommodations studied, allowing a word processor was somewhat difficult for elementary and middle school teachers, but a bit easier for high school teachers to provide. Overall, 8.3% of teachers surveyed reported using this accommodation, and teachers reported finding this accommodation somewhat helpful.

 

Empirical Research

Several studies have examined the use of computerized assessments for students in grades K-12. For this analysis, our primary focus was computer and word-processor response accommodations, rather than general computer administrations. Studies that examined student preferences related to the computer/machine response accommodations also are discussed.

Writing assessments. Eight studies were identified that examined the effects of having students complete writing tasks via computer. The majority of these studies involved middle-school students, and some included students with disabilities.

One of these studies showed students performing better under a computer response mode. Russell and Haney (1997) had two groups of middle school students complete an open-ended assessment on paper that involved questions in a variety of subject areas. Then, one group completed a performance writing assessment (which required extended written response) on paper, and one group completed it on computer. Paper-and-pencil test responses for the second test were then converted into computer responses. Results indicated that groups scored similarly on the open ended assessment, but that students using a computer for the performance writing assessment scored significantly better than those responding with paper-and-pencil (effect size = .94).

Other studies have suggested that positive effects of the computer response mode are mediated by student efficiency in keyboard use. In a later study, Russell (1999) again had middle school students answer open-ended test questions in different subject areas, some students using a paper-and-pencil format and others responding in the computer format. Results indicated that for students who keyboard at a speed of 20 words or more per minute, the paper-and-pencil format underestimated achievement level. For slower keyboarders, the computer response format underestimated achievement level. Similarly, Russell and Plati (2001) found that paper-and-pencil results severely underestimated the achievement of students in grades 8-10 who were accustomed to writing using a computer. This study used open-ended questions from the language arts section of the Massachusetts Comprehensive Assessment System (MCAS).

Hollenbeck, Tindal, Harniss, and Almond (1999) found no differences between the scores for seventh and eighth grade students composing stories for a statewide writing test in computer mode versus handwritten mode. In fact, students with disabilities performed significantly poorer when composing with computer than when handwriting their stories. Significant effects were found, however, for students who were allowed to use spell-checkers. It was not clear whether handwritten work was converted into word-processed work prior to evaluation in this study.

Several studies have looked at process related differences between handwritten and computer formats for writing. In a study by Vacc (1987), four eighth grade students (all certified as having mild mental handicaps) wrote letters in both computer and handwritten formats. Vacc found that writing letters on a microcomputer took more time, produced noticeably longer letters, and involved more revision than handwritten letters. The mean number of words written per unit of time was substantially higher for participants’ handwritten letters. Judges evaluations did not differ significantly between the two production modes.

MacArthur and Graham (1987) had fifth and sixth graders with learning disabilities compose and revise stories under three modes: handwritten, word processed, and dictation. Although the dictation condition resulted in significant differences, the handwritten and word-processed stories did not differ in terms of length, quality, story structure, mechanical or grammatical errors, or vocabulary. Word-processing was less than half as fast as handwriting. The overall amount of revision was similar across the handwritten and word-processed modes.

Another study looked at differences in how judges rated handwritten and word-processed essays for a statewide assessment (Hollenbeck, Tindal, Stieber, & Harniss, 1999). Middle school students completed the assessment in handwritten mode. Next, essays were word-processed, without changes in spelling or grammar. Original handwritten compositions were rated significantly higher than the typed composition on three of the six traits for the total group. Five of the six mean trait scores favored the handwritten essays. This study suggested that the two modes should not be used in the same evaluation system. Similarly, factor analysis studies of writing assignments corresponding to the Oregon statewide writing assessment found that when handwritten and word-processed essays were analyzed separately, all traits loaded on a single factor; however, when data from the two modes were analyzed together, two factors emerged: one for the handwritten traits, and one for the word-processed traits (Helwig, Stieber, Tindal, Hollenbeck, Heath, & Almond, 2000). Furthermore, low correlations were found for individual students’ ratings in computer and handwritten essays in this study.

Other computerized response assessments. Nine studies examining the use of computerized assessments for K-12 students in non-writing assessments were identified. Three of these studies involved assessment of writing-related skills (capitalization and spelling), and the others examined other skill areas (math, reading, vocabulary, etc.). Several of these studies demonstrated that students performed better under the paper-and-pencil format.

Watkins and Kush (1988) conducted a study in which elementary students with learning disabilities received a capitalization test under both a paper-and-pencil method and via computer (with conditions counterbalanced). During the computer administration, the students were presented items that measured particular capitalization objectives until the student reached a “mastery,” “non-mastery,” or “review” criteria for each objective. Analysis of the paper-and-pencil tests also involved coding student progress into one of the three categories. Results indicated that students scored higher on the paper-and-pencil test; however, the computer and paper-and-pencil tests did not significantly differ in their assignment of students to instructional interventions.

Varnhagan (1984) studied one group of students in special education and one group of third-grade regular education students who were both administered a spelling test in standard written format, and via computer. Results indicated that the written mode elicited higher scores than the computer format for both groups. Students generally took more time in completing the computer test. Contrary to this, Hasselbring and Crossland (1982) found the computer response format to be favored over the paper-and-pencil format on a spelling test for 9- to 14-year-old students with learning disabilities. The computer administration required less teacher administration time, and difficulties associated with deciphering student handwriting were avoided in the computer response format.

In a study by Swain (1997), third grade students (some with disabilities in math, some without) completed two math tests (Keymath and a computer-administered math test). Participants scored higher on all subtests of the paper-and-pencil test than on the computer-administered test. No interaction effects (mode of administration by disability status) were found.

Russell (1999) also found limited support for computer response accommodations on a math test. He found that regardless of keyboarding speed, middle school students’ performance on math assessments administered on a computer underestimated their level of achievement. He suggested that students may require scratch paper for many math items, which is difficult to provide in a computer administration.

Other studies have shown no differences according to assessment mode. Miller (1990) administered a receptive vocabulary screening test to students with cerebral palsy and students without disabilities. Students completed different forms of the test in both standard response mode and via computer (counterbalanced design). Results indicated that the two modes were equivalent for the groups of students. In the study by Russell and Haney (1997), in addition to completing a writing performance assessment, the two groups of students completed a multiple choice test in differing formats (computer and paper-and-pencil). While students who were administered the writing test via computer performed better on a writing performance task, the two groups did not significantly differ in performance on the multiple choice test.

Horton and Lovitt (1994) found mixed results in the area of reading. Their study involved having middle and high school students (some with and some without learning disabilities) complete reading inventories under two conditions: computer presentation and response, and paper-and-pencil administration. Results favored the computer mode on factual questions for the students overall; however, it appeared that middle school students with learning disabilities scored markedly better on social studies items in the paper-and-pencil format than in the computer format. On interpretive test items, the results of the group analysis showed no significant difference between the two assessment methods.

In addition to the Hasselbring and Crossland (1982) and Horton and Lovitt (1994) studies which provided some support for the computer response accommodation, one other study was identified that supported the use of this accommodation for students with disabilities. Burk (1999) studied the effects of a computer format versus a paper-and-pencil format for a variety of populations on multiple choice tests in multiple subject areas. Results indicated that students with learning disabilities performed significantly better on tests delivered via computer.

Student preferences. A number of the studies on the effects of computer accommodations also surveyed students’ perceptions of the computer administration. Hollenbeck et al. (1998) found that both students with and without disabilities tended to prefer the handwritten format for the writing assessment. Varnhagan (1984), Watkins and Kush (1988), and Horton and Lovitt (1984) all found students to prefer the computer format, despite their general findings that students performed better under the handwritten condition.

Overall, it appears that the computer/word-processor response accommodation has limited empirical support, especially for students with disabilities. The research review identified studies that both supported and discouraged its use, as well as studies that demonstrated no mode effect.

 

Controversy

Research findings indicate limited agreement about whether computer/machine response is a valid accommodation for students with disabilities. Teachers find it somewhat difficult to implement as an accommodation, and states are not unanimous in terms of allowing it. Also, it is clear that without necessary computer knowledge and typing skills, this accommodation may hinder student performance. However, with increasing use and access to computers, this accommodation may become more helpful and necessary for students. Furthermore, computer administrations can additionally offer easy access to other accommodations (e.g. computer read aloud, large print, etc.). Although this synthesis did not focus on the multiple accommodation possibilities available via computer, research studies have begun to analyze how several accommodations can be incorporated by using computers. Finally, if students cannot physically use a pencil to complete a test, it may make more sense for them to use a computer response format.

The fact that judges often rate word-processed essays lower than handwritten essays suggests that this accommodation may put students at a disadvantage unless there is appropriate training of scorers. Greater attention to these scoring implications will be needed if computer use in assessments increases as expected.

 

Recommendations

    Students must be provided the necessary typing and computer skills so that they are accustomed to using the computer/machine response accommodation. Without this support, caution should be taken in allowing a student to use the computer/machine response accommodation.

    This accommodation may also be relevant for students who physically cannot manipulate a pencil, and can more easily respond using a computer or machine. Of course, instruction in its use, and previous experience with computers are essential.

    Because studies have demonstrated the tendency to use different evaluation criteria for handwritten and computer versions, it is essential that evaluators first demonstrate the ability to use the same criteria before scoring assessments.

    When students use this accommodation, it is important to know that equipment is working properly. Also, students should frequently save their responses during testing to ensure that work is not inadvertently erased (CEC, 2000).


Dictate Response to a Scribe

Students are sometimes offered the opportunity to orally respond to test items and have a scribe record their answers instead of providing a written response. This accommodation is offered to students with a variety of disabilities including learning disabilities, behavioral disorders, mild mental retardation, physical impairments, and communication disorders.

 

Explanation

In order to accurately answer test items, students need writing skills. It is possible for test results to inaccurately represent a student’s ability in a particular subject area simply because the student’s writing skills hinder his or her ability to demonstrate achievement. For instance, students who can accurately solve a math problem may have difficulty demonstrating this ability if they cannot write.

 

State Use

Of the 48 states with statewide assessment programs, 32 allow the dictated response to a scribe accommodation. Eleven additional states allow it in some situations, and prohibit it in others. Some states prohibit the use of this accommodation on writing tests, but allow it for other subject areas. Another state specifies that students must indicate punctuation and spell all key words when this accommodation is used. This suggests that there is some disagreement about whether dictated response is a valid accommodation (Thurlow et al, 2000).

 

Survey Research

Several surveys have addressed the scribe accommodation. There is some evidence of change over time in the responses to this accommodation. The earliest survey was conducted by Gajria, Salend, and Hemrick (1994). They surveyed 100 teachers on their awareness, use, and perceived integrity of a variety of accommodations and received responses from 64 of them. Gajria et al. found that 82.8% of these respondents were aware of the dictated response accommodation, 50% reported using it in their classrooms, and 67% thought that it maintained the integrity of the test. It was ranked 31st of 32 modifications in terms of “ease of use,” and 28th in terms of effectiveness.

Results of the Jayanthi et al. (1996) survey (401 respondents of 708 surveyed) indicated that of the 24 accommodations studied, allowing dictated responses for tests was rated as easy and somewhat helpful to provide. Overall, 21% of the teachers who responded indicated that they currently use this accommodation in their classrooms.

Lambert, Dodd, Christensen, and Fishbaugh (1996) surveyed rural secondary teachers, asking about their willingness to provide and their current use of the dictated response accommodation. Of the 171 teachers surveyed, 121 surveys were returned. “Allow the student to dictate answers to a proctor” ranked tenth among several accommodations in terms of the percentage of teachers who had provided the accommodation in the past. Also, the average rating of willingness to provide this accommodation was 4 on a 1-5 rating scale, 5 being “very willing to provide.”

Most recently, Hollenbeck, Tindal, and Almond (1998) surveyed 633 regular and special educators, of which 166 responded. They found that 85% of the teachers who returned surveys had correct knowledge about this accommodation, and 71% reported using this accommodation.

 

Empirical Research

A limited amount of research has been conducted on the dictated response accommodation. Three empirical studies and four descriptive studies on the use of this accommodation for K-12 students were identified. Two of the empirical studies focused only on story writing, while the other investigated the effects of dictation on a math assessment. The four descriptive studies investigated the effects of accommodations on the scores of students with disabilities in the statewide assessments in Maryland and Kentucky, and were included due to the limited amount of empirical research available.

Fuchs, Fuchs, Eaton, Hamlett, and Karns (2000) studied the effects of the encoding accommodation (“teacher wrote student responses”) on math problem solving curriculum based measures (CBM) scores of students both with and without learning disabilities. Results indicated that this accommodation had a significantly greater impact on the scores of students with learning disabilities than those without learning disabilities.

MacArthur and Graham (1987) studied the differences between stories written by 5th and 6th grade students with learning disabilities under handwritten, word-processed, and dictated response modes. Each student composed a story in each of the three modes. Results indicated that the dictated stories were longer, of higher quality, and had fewer grammatical errors than handwritten or word-processed stories. This study suggests that dictating a story helps to free the student from spelling, penmanship, punctuation, and capitalization concerns, and allows students to express their ideas more quickly. Hidi and Hildyard (1983, as cited in MacArthur and Graham) conducted a similar study on students of average achievement in grades 3 and 5. Results showed an increase in the length of stories and essays, but no differences in terms of quality. MacArthur and Graham (1987) suggest that when the “mechanical and conventional demands of producing text” are removed, “learning disabled students compose more fluently and with better results.”

Tippets and Michaels (1997) studied the factor structure of the reading and language arts sections of the Maryland statewide assessment for 3rd, 6th, and 8th grade students. Students were assigned to accommodation categories according to their Individualized Education Programs (IEPs). Analysis of test scores indicated that students receiving the dictation accommodation in combination with other accommodations (read aloud, extended time, etc.) performed better than students not receiving any accommodations. The average scores for students receiving the dictated response accommodation were better than students who wrote their own responses. A concern that emerged was whether scribes were encouraging students to elaborate on responses, helping students organize their thoughts, or facilitating writing mechanics such as capitalization and punctuation skills. However, overall, the factor structures that emerged from accommodated and non-accommodated tests in this study were similar.

Koretz (1997) performed an in-depth study of the effects of the dictated response accommodation on the scores of students in the Kentucky statewide assessment system. Results indicated that the dictated response accommodation had a large impact on scores for students with learning disabilities, behavioral disabilities, and mild mental retardation, especially in social studies, reading, and science. In a second study of this assessment system, Koretz and Hamilton (1999) found this accommodation to have somewhat less of an impact on scores of students with disabilities.

Trimble (1998) studied the effects of the dictation accommodation in combination with other accommodations on the scores of 4th, 8th, and 11-12th grade students taking the statewide assessment in Kentucky. For 4th graders, the dictated response accommodation produced average scores above that of the total population, including students without disabilities. For 8th and 11th graders, students receiving the dictated response accommodation scored on average higher than those of students with disabilities not receiving the accommodation, but not above the average score of all students.

 

Controversy

Research on the dictated response accommodation for students with disabilities in the K-12 range has been limited and rather inconclusive. Although students with disabilities appear to receive higher scores on a variety of tests when using this accommodation, it is unclear whether the test scores obtained using this accommodation as implemented are valid. For students who could physically not respond without a scribe, dictated response seems like an appropriate accommodation to allow. Also, when a test is not meant to measure certain writing skills, it also seems like a legitimate accommodation. Contrary to this, if a test is designed to measure grammar and other specific writing skills, this accommodation may be less valid. Furthermore, teachers appear to be concerned about the difficulty of implementation, and consider it to be one of the least effective accommodations.

 

Recommendations

    The dictated response accommodation should be provided to students for whom it is determined will benefit from this accommodation on tests not specifically designed to measure writing ability that includes specific skills such as spelling.

    If students are unable to handwrite, but can efficiently type on a computer, a computer response accommodation should be considered prior to a dictated response accommodation.

    When the dictated response accommodation is allowed, scribes should be carefully trained in how to record responses (e.g., whether students must indicate spelling, punctuation, etc.). Scribes should also be monitored to be certain that students’ verbatim responses are recorded.

    Scribes should be familiar with the test so they can easily record student answers (CEC, 2000).

    More research should examine the effects of this accommodation on the test scores of students with and without disabilities.


Extended Time

Students with disabilities are often allowed extra time to complete tests that are normally administered under timed conditions. This accommodation is frequently given in addition to other common accommodations (read aloud, Braille, large print, etc.), and is offered to students with a variety of disabilities. Extended time is often understood as “unlimited time” although in some cases it simply means that a specified amount of time is added to the normal time allotted for students to complete the test. Research has used both.

 

Explanation

Deficits in information processing rates are common among students with learning disabilities (Huesman, 1999). When a test is timed, students with learning disabilities may not be given enough time to show what they know and can do. Students with other disabilities, similarly, may require extra time if their disability hinders their ability to respond in a timely manner. Extending the amount of time that a student has to complete a test, therefore, can help to alleviate these problems, and consequently can make the test a more accurate measure of the student’s level of achievement.

 

State Use

Of the 48 states with statewide assessment programs, 32 allow the extended time accommodation. Five additional states allow it in some situations, and prohibit it in others. Two other states prohibit the use of extended time. This suggests that there is some disagreement about whether extended time is a valid accommodation (Thurlow et al., 2000).

 

Survey Research

Several surveys have explored the extended time accommodation. The earliest, by Gajria et al. (1994) surveyed 100 teachers on their awareness, use, and perceived integrity of the accommodation. They found that 93.8% of the 64 teachers who responded were aware of the extended time accommodation, 90.6% reported using it in their classrooms, and 79.7% thought that it maintained the integrity of the test. It was ranked 14th of 32 modifications in terms of “ease of use,” and 15th in terms of effectiveness.

The Jayanthi et al. (1996) survey responded to by 401 general education teachers (of 708 sent) indicated that allowing extended time for tests was rated as “easy” and helpful to provide. Overall, 74% of the teachers who responded indicated that they currently use the extended time accommodation in their classrooms.

The Lambert et al. (1996) survey of 171 rural secondary teachers found that for 121 respondents, extended time ranked first among several accommodations in terms of the percentage of teachers who had provided the accommodation in the past. The average rating of willingness to provide this accommodation was 4.4 on a 1-5 rating scale, 5 being “very willing to provide.”

Finally, the Hollenbeck et al. (1998) survey of 633 regular and special educators found that 59% of the 166 teachers who returned surveys had correct knowledge about this accommodation. They also found that only 13% reported using this accommodation.

 

Empirical Research

Extended time is one of the most widely researched testing accommodations. Chiu and Pearson (1999) conducted a meta-analysis of 18 studies on this accommodation (including primary, secondary, and post-secondary studies) and found that the majority of studies looked at the effects of extended time on the test scores of students with learning disabilities. Extended time accommodation studies often examine whether there is a significantly greater effect of the accommodation on the scores of students with disabilities versus the scores of students without disabilities. Overall, Chiu and Pearson (1999) found a .07 effect size supporting the use of the extended time accommodation for students with disabilities over students without disabilities. Some studies have found a differential impact of the accommodation according to student disability status; others have found no differential impact.

Several studies examined the effects of extended time on test scores of students with disabilities in elementary, middle, and high school. Some of these studies examined the effects of the extended time accommodation on tests in separate content areas, including language arts and math. Others have looked at the effects of this accommodation on scores from standardized tests covering multiple subject areas.

Math. Eight studies were identified that examined the effects of extended time on math test scores. Gallina (1989) found that elementary students with Tourette’s syndrome scored in the average range on untimed math tests, but performed poorly on timed math tests. Centra (1986) found that students with learning disabilities derived significantly greater gains from extended time on the math section of the Scholastic Aptitude Test (SAT) than students without learning disabilities. Furthermore, Lewis and Green (1999) found there to be relatively few items displaying differential item functioning (DIF) for students with disabilities receiving the extra time accommodation on a math test. Finding few DIF items suggests that the accommodation did not change the underlying construct of the test.

Contrary to these findings that provide support for the use of the extended time accommodation for students with disabilities, Marquart (2000) found there to be no significant increase in math test scores for eighth grade students when receiving the test under an extra time condition (40 min) as opposed to a standard condition (20 min). This non-effect was found for all three groups of students studied (students with disabilities, students without disabilities, and at-risk students). Students, however, were found to prefer the extended time condition.

Munger and Loyd (1991) also found that 5th grade students with disabilities (both learning and physical) were not differentially affected by the untimed condition on a math test when compared to students without disabilities. Similarly, Montani (1995) found that both low achieving and normally achieving 3rd grade students benefited from extended time, and there was no significant difference in effect among the two groups of students. Murray (1987) looked at the effects of the untimed condition on spatial test scores of boys ages 12-14 years. Groups included: (a) boys without learning disabilities + boys with learning disabilities and average achievement, and (b) boys with learning disabilities and low achievement. Results indicated that the first group performed better in the untimed, but not in the timed condition.

Taken altogether, the results of these studies are inconclusive. However, results of a study by Fuchs, Fuchs, Eaton, Hamlett, and Karns (2000) may provide a better understanding of the effects of extended time on math tests. This study found that 4th grade students with learning disabilities did not differentially benefit from extended time on concept and application problems, but did differentially benefit from extended time on more innovative problem-solving type items. This suggests that on certain types of math problems, extended time may be more influential for students with disabilities.

Language arts. Five studies were identified that examined the effects of the extended time accommodation on language arts tests. Munger and Loyd (1991) found that 5th grade students with physical and learning disabilities were not differentially affected by the amount of “speededness” on a language use and expression test when compared to students without disabilities. Fuchs, Fuchs, Eaton, Hamlett, Binkley, and Crouch (2000) found that the extended time accommodation did not benefit fourth grade students with learning disabilities more than students without disabilities on reading tests. Huesman and Frisbie (2000) found that the test scores of students with learning disabilities on the reading section of the Iowa Tests of Basic Skills did significantly increase in the extended time condition; however, the scores of students without disabilities who were given extended time and told to “take time and work carefully” also significantly increased. Another important finding in their study was that the amount of extended time that students with learning disabilities required varied among individuals.

Perlman, Borger, Collins, Elenbogen, and Wood (1996) found that fourth grade students with learning disabilities and IEPs requiring the extended time accommodation finished the reading test within the recommended time limit, but that eighth graders with learning disabilities took substantially more time, and scored higher when they did use extra time. Overall, the results of the Perlman et al. (1996) study suggest that the tests may be more reliable when administered without time limits, and that merely knowing unlimited time is available may yield higher scores even if no additional time is used. Finally, Lewis and Green (1999) found that there were relatively few items that displayed DIF for students with disabilities receiving extra time on a language arts tests, which suggests that the same construct was being measured under the accommodated condition. Altogether, these studies seem to suggest that although there appears to be no definitive answer to the question of whether extended time is significantly better for students with disabilities, this accommodation may make the test results more valid for all students.

Ziomek and Andrews (1998) studied the effects of the extended time accommodation for students who had taken the American College Test (ACT) twice, and at least once under extended time conditions. Results indicated that they performed better under the extended time condition. The authors concluded that scores administered under non-standard conditions should continue to be “flagged” until score comparability can be determined; however, this recommendation recently has been negated. The Educational Testing Service (ETS) decided to stop flagging accommodated tests.

 

Controversy

Research on the extended time accommodation for students with disabilities in the K-12 grade range has been inconclusive. Although students with disabilities appear to gain from the use of this accommodation on a variety of tests, it is also true that students without disabilities display similar gains. For math tests, it appears that differential gains for students with disabilities may be evident on certain types of items. On reading tests, the research has provided less direction. Limited differential effects for students with disabilities, as opposed to students without disabilities, were found for this accommodation. One study suggested both groups of students benefit from extended time. It may be that the underlying question is whether the particular test is intended to measure how fast students can complete test items. If tests are meant to measure rate of completion, then extended time is not a valid accommodation. However, if rate of test completion is not meant to be evaluated, extended time may be considered more valid.

In this synthesis, studies involving extended time in addition to other accommodations were not examined in depth. It is understandable that extended time might be a valid accommodation for students who are also having the test read aloud, reading large print, etc. The question of whether extended time alone is a valid accommodation for students with disabilities remains somewhat unanswered.

 

Recommendations

    The extended time accommodation should be allowed to students for whom it is determined will benefit from this accommodation on tests not specifically designed to measure rate of test-completion.

    The amount of extra time necessary may vary among students; it may be important to determine for each student individually the amount of extra time that is most desirable. Some students may experience fatigue from a longer test, and others may benefit.

    Students who require this accommodation may need to take the test in another room (CEC, 2000).

    The extended time accommodation should be allowed when other accommodations provided necessitate extended time (e.g., read aloud, tape recorder, etc.).

    When only some students are allowed access to this accommodation, it may be necessary to indicate that they received this accommodation in reporting results.

    Test-makers should determine whether test-completion rate is of essence to what they are testing. If it is determined that testing rate is not intended to be measured, perhaps all students should be given the amount of time they need to complete the test.


Interpreter for Instructions

Students with hearing impairments are often allowed to have an interpreter communicate test instructions in sign language. Although entire tests (directions and items) may be communicated to students through interpreters, for the purpose of this summary, we will refer to using interpreters to communicate only test directions. According to Bourquin (1996), interpreting is “the process of receiving a message in one language and transmitting an equal meaning into a second language.” The form of interpretation used will depend on the language preferences of the student (e.g., American Sign Language, finger spelling, signing + speech, etc.).

 

Explanation

For students to demonstrate their knowledge and perform well on tests, they must understand what the test requires them to do. Students with hearing impairments cannot understand orally communicated test instructions unless they are able to lip-read or are provided an interpreter who can translate the spoken directions into a language that they can understand. Through the use of an interpreter for test directions, tests can measure achievement rather than sensory deficits of students with hearing impairments.

 

State Use

Of the 48 states that have statewide assessment programs, 34 allow “interpreter for instructions” as a testing accommodation. Two additional states allow this accommodation on certain tests, but not on others. One other state prohibits the use of this accommodation (Thurlow et al., 2000).

 

Survey Research

Petronio (1988) interviewed 10 students with deaf-blindness to find out what they needed and wanted most from interpreters. Results indicated that students wanted information conveyed in the language that they most easily understood. A number of students also needed special modifications to the normal manner of interpreting due to tunnel vision or problems viewing interpreters at a distance. Students also emphasized the need for interpreters to convey visual information about relevant things going on in the classroom. Finally, Petronio’s results indicated that students who had used interpreters for three or more years were the most satisfied with them; those who were new to using interpreters were more easily frustrated.

 

Empirical Research

Very limited empirical research exists on the use of an interpreter specifically for communicating achievement test instructions to students with hearing impairments. Because of this, other studies that examined adapting tests for students with hearing impairments were examined. Ray (1982) conducted a study of adapting the Weschler Intelligence Scales for Children-Revised (WISC-R) for students with hearing impairments. The adaptation involved using additional sample items prior to conducting each subtest, as well as using alternate instructions in which the directions were signed to the student. The directions were also modified in order to avoid using words that were not easily translated into sign language. Results indicated that when these adaptations were made, deaf children scored on average the same as other students on the performance scale of the WISC-R.

Sullivan (1982) similarly studied the effects of accommodations on the WISC-R for students who were deaf. Each student in the study had half of the subtests communicated via Total Communication (simultaneous verbal and sign language), and half of the subtest directions communicated verbally with gestures. This study found that WISC-R Performance Scale scores were higher when directions were communicated via Total Communication than when communicated only verbally and gesturally. An additional study (Sullivan, 1982) showed Total Communication of directions resulted in significantly higher scale scores than pantomiming directions. However, results also suggested that the population of students with hearing impairments is not homogeneous; adaptations therefore must be individualized.

Benderson (1988, as cited in Gorden, Stump, and Glaser, 1996) found that the scores of students with hearing impairments who were given special administrations of the Stanford Achievement Test (SAT) did not over- or under-predict their college achievement. For other students given special administrations, there was a great deal of fluctuation in terms of how well the test scores predicted future achievement.

 

Controversy

Providing an interpreter for instructions to students with hearing impairments appears to be a very reasonable testing accommodation. Students need to be able to comprehend task demands in order for them to demonstrate what they know and can do on a test. Perhaps the only controversy identified has to do with the quality of the interpretation. If interpreters do not communicate directions accurately to students with hearing impairments, the accommodation does not meet its intended purpose.

 

Recommendations

    An interpreter for instructions should be provided to students with hearing impairments who will benefit from such communication.

    Interpreters should be trained professionals (Bourquin, 1996), and should sign the instructions exactly as given (Ray, 1982).

    Interpreters should be aware of the unique needs of the students they are serving; different students may prefer differ language systems.

    Interpreters should check to be sure that the correct message is received by the student.

    Some words in the standardized directions may not be easily translated into sign language. Test developers should avoid using such words and phrases, and a standard procedure should be developed for how interpreters should communicate directions if such words and phrases are necessary.

    Students should have experience using an interpreter prior to the testing situation.

    More empirical research is necessary to demonstrate effective ways of adapting standardized tests for students with hearing impairments.


Large Print Edition of a Test

Large-print editions of tests are frequently used to accommodate students with visual impairments. Burns (1998) states that large-print applications are also used by students who are distracted by cluttered test formats and by very young children. Research studies have defined large-print in several similar ways: “14-point Helvetica font” (Mick, 1989), “double the size of regular print” (Burk, 1999), and “16-point type” (Grise, Beattie, & Algozzine, 1982).

Explanation

Few academic tests are developed with the intent to measure either visual abilities or a student’s degree of distractedness. However, the way students are currently tested requires that they have adequate visual abilities and that they attend well to regular print tests. Consequently, the score of a non-accommodated student with a visual impairment on a reading comprehension test may reflect his or her visual difficulties rather than his or her comprehension skills. A large-print edition of a test can ensure that the student’s academic abilities rather than his or her visual abilities are measured.

 

State Use

Of the 48 states that have statewide assessment programs, 38 allow large print as a testing accommodation (Thurlow et al., 2000). Two additional states allow it in some situations but not others. One of these states allows large-print only if it is not offered in addition to the extended time accommodation. None of the states that allow large print indicate that it can be used only by students with visual disabilities.

 

Survey Research