Tuesday, September 06, 2005

(Article written for the academic journals in 1990 but not accepted for publication)


J.J. Ray

University of N.S.W., Australia


There appears to be a strong consensus among those who research acquiescent response tendency to the effect that one-way-worded scales are undesirable. Many psychologists, however, continue to use such scales. This suggests that the occasional dissenters who have defended such scales are surprisingly influential and should be taken seriously. Some faults in the reasoning of such commentators are therefore outlined. Some further data showing how the presence of acquiescent bias can be detected are presented using a balanced form of the Rokeach Dogmatism scale. It is shown that only the presence of an acquiescent response tendency serves to explain the findings presented. The balanced Dogmatism scale is shown to work best with highly educated respondents. This suggests that the one-way-worded version of the Dogmatism scale also has serious problems.


I am indebted to a massive bibliography supplied to me some time ago by Dr Lewis R. Goldberg of the Oregon Research Institute for the information that the earliest academic paper on acquiescent response tendency goes all the way back to the time of the first world war (Cogan, Conklin & Hollingworth, 1915). I have been unable to check the reference personally as the library facilities available to me are less than encyclopedic but I have no reason to doubt the accuracy of such a distinguished scholar as Dr Goldberg.

As the references that follow will show, acquiescence has continued to be a phenomenon of interest to researchers right up to the present day. All of the references given show concern with acquiescent tendency as a problem in attitude and personality measurement (Altemeyer, 1981; Bachman & O'Malley, 1984; Bass, 1955; Bentler, Jackson & Messick, 1971 & 1972; Berkowitz & Wolkon, 1964; Blau & Katerberg, 1982; Block, 1965; Byrne & Bounds, 1964; Campbell, Siegman & Rees, 1967; Cherry & Byrne, 1977; Cloud & Vaughan, 1970; Cogan, Conklin & Hollingworth, 1915; Couch & Keniston, 1960; Cronbach, 1946; Davison & Srichantra, 1988; Duhan & Keown, 1988; Eisenman & Townsend, 1970; Gage & Chatterjee, 1960; Gibbins, 1968; Goldsmith, 1986 & 1987; Goldsmith, White & Stith, 1987; Heaven, 1983; Hui & Triandis, 1985; Jackson, 1967; Krenz & Sax, 1987; Lambley & Gilbert, 1970; Lee & Warr, 1969; Lentz, 1938; Martin, 1964; Milbrath, 1962; Neel, Tzeng & Baysal, 1983; Oskamp, 1970; Peabody, 1966; Ray, 1970, 72, 74, 79a, b & c, 80a & b, 81, 82a & b, 83, 84a, b & c, 85a & b; Ray & Pratt, 1979; Roberts, Forthofer & Fabrega, 1976; Schmitt & Stults, 1985; Trott & Jackson, 1967; Vagt & Wendt, 1978; Van Heerden & Hoogstraten, 1979; Wilson & Patterson, 1968; Winkler, Kanouse & Ware, 1982). The references listed are only a small sub-set of those that might have been listed with the major omissions being less recent studies.

With such a strong consensus that acquiescence is a problem requiring corrective measures (e.g. use of "balanced" scales containing equal proportions of "True" and "False" items) in attitude and personality measurement, one would think that there was little left to be said on the topic and that all social scientists would now use balanced scales. Surprisingly, however, this is not so. There have been isolated apologists for one-way-worded scales (e.g. Rokeach, 1967; Rorer, 1965; Samelson, 1972) and it appears that such writings have been seized on by many researchers as saving them from the need to use balanced scales. The reasoning seems to be something like: "Some say balanced scales are needed and some say they are not so both options are equally legitimate". While such thinking may be understandable at some level it is remarkably poor science. A variety of authors have shown that acquiescent response tendency can have important correlates of its own (e.g. Gage & Chatterjee, 1960; Milbrath, 1962; Eisenman & Townsend, 1970; Goldsmith, 1987; Goldsmith, White & Stith, 1987; Heaven, 1983; Blau & Katerberg, 1982) so the correlates of any one-way-worded scale will always be susceptible of at least two interpretations. Science, however, is precaution-oriented and failing to take measures that will preclude alternative interpretations of one's findings is quite simply careless and asks faith of the reader. Faith and science are hardly of a piece.

The major point that critics of balanced scales (e.g. Rorer, 1965) seem to seize on is that different measures of acquiescence often show little correlation between themselves (e.g. McGee, 1962). They then seem to reason: "Well if it does not generalize it cannot be a problem". There is some truth in that reasoning, of course. The trouble is that sometimes acquiescence scores do intercorrelate (e.g. Vagt & Wendt, 1978). So the same reasoning consistently applied must say that on such occasions acquiescent tendency could be a problem. But how can we know in advance which circumstance will prevail? How can we know whether we will have a problem or not? I will not be so foolhardy as to say that we will never be able to know but, certainly the seemingly obvious predictors that I have tried did not work (Ray, 1983). That being so, it would seem the path of prudence always to use balanced scales so that if acquiescence problems do arise they can be both detected and controlled for.

Another approach used by critics of balanced scales is to point out that double agreement with oppositely-worded items is not necessarily a sign of acquiescent bias (e.g. Rokeach, 1967). This is, of course, perfectly true but it is, for all that, to fail to see the wood for the trees. Surely such double agreement is a problem whatever its source. It shows that the scale author has got it wrong in one way or another and that items he intends to be of opposite import are not so seen by those he surveys. It is a clear indication that the scale lacks construct validity. Only by using balanced scales, however, can we detect such validity deficiencies.

Other commentaries on the arguments used by the critics of balanced scales have been widespread but perhaps Peabody (1966), Campbell, Siegman & Rees (1967), Jackson (1967), Bentler, Jackson & Messick (1971 & 1972), Bentler (1973) and Ray (1983 & 1985b) might be specifically mentioned.

At any event it seems clear that many psychologists remain unpersuaded of the importance of using only balanced scales so yet more efforts to demonstrate the usefulness and informativeness of such scales seem needed. The tenacity with which some researchers cling to their one-way-worded scales can, in fact, be remarkable. One recent author (Van Ijzendoorn, 1989) used a one-way-worded scale even though he knew of the arguments against such scales and even though he knew of an alternative measure of the same construct in balanced form!

As we have seen, one of the persistent defenders of one-way- worded scales was Rokeach (1967). This may be connected with the fact that his widely-used Dogmatism ('D') scale is one-way-worded. It seems appropriate, therefore, to see how badly (if at all) his Dogmatism scale is acquiescence-affected. This has now been possible for some time since the production of two different balanced revisions of the 'D' scale (Ray, 1970 & 1974). Such an examination will be attempted below.


The results to be reported below were in fact obtained in 1972. Some results of the study concerned were reported fairly promptly (Ray, 1974; Ray & Martin, 1974) but a full write up of the findings was not carried to completion and became overlooked under the pressure of other work. As the results do not appear to be in any way time dependant, however, it still seems appropriate to report them here.

The Statistics
For a start, it is accepted that double-agreements with original and reversed items cannot be seen as proof that acquiescent bias is present (Rokeach, 1967). Other methods will be needed if such a demonstration is to be accomplished. For similar reasons, nor is it sufficient to show that agreement is by far the commonest response to the items. The presence of such a phenomenon may simply show that both sides of the argument are persuasively put. The degree of agreement could, in other words, be quite meaningful and not at all vacant.

I have long used two statistics to demonstrate the presence of meaningless acquiescence: coefficient alpha and r(P-N). Acquiescent tendency should inflate alpha and deflate r(P-N). As Davison & Srichantra (1988) have recently reported findings that generally support my approach I will confine myself here to pointing out that the reasoning behind both indices is fairly simple. Anything that causes items to be responded to similarly will cause the items to correlate positively. Those who score highly on one item will also tend to score highly on other items. Acquiescent bias will therefore tend to increase the correlation between one-way-worded items. Coefficient alpha, however, can be represented as average inter-item correlation weighted by test length (Cronbach, 1951; Lord & Novick, 1968) so it should rise as acquiescent bias affects the one-way-worded scale (test length or number of items being constant).

The second index is useful as representing the outcome of two opposing pressures. The statistic r(P-N) represents the correlation between the two subscales made up respectively of the positively-worded and negatively-worded items. The opposition in meaning between these two groups of items should cause the items to be responded to oppositely and thus bring about an r(P-N) that is high and negative.

Insofar as meaning-independent acquiescence is present, however, it will cause all items to be responded to similarly and this could lead to an r(P-N) that is high and positive. If both things are true (i.e. the items are of opposed meaning and meaningless acquiescence is present) the two tendencies should cancel one-another out and leave an r(P-N) that approximates zero. The latter circumstance quite commonly prevailed with early attempts to balance the F scale (Christie, Havel & Seidenberg, 1956).

Subjects and materials

The Ss for the study were students in the School of Behavioural Sciences at Macquarie University in Sydney, Australia who completed a questionnaire containing the Ray (1970) Balanced Dogmatism (BD) scale in class time. There were 74 First-year students, 52 second-year students and 51 third-year students, to a total of 177.


The results of interest here are given in Table 1.


Statistics from the Ray (1970) BD scale when applied to three groups of

Statistic....... Year 1 ...... Year 2 ..... Year 3

Alpha............ 00.83 ........ 00.80 ....... 00.72
r(P-N)............ -0.34 ......... -0.37 ....... -0.51
BD mean ...... 86.95 ........ 86.08 ....... 79.55
BD S.D.......... 14.42 ........ 13.02 ....... 10.71
n................... 74.00 ........ 52.00 ....... 51.00

It will be noted that there was a trend (non-significant) for the coefficient alpha to decline as exposure to the University increased. At the same time r(P-N) tended to rise. Thus the more sophisticated subjects (third-year students) showed less organization of measured attitudes (as indexed by alpha) even though the intended opposition between the positively and negatively worded items was most evident in
their responses.


If the students with greater exposure to the University had in fact (as we might have expected) had attitudes which were more organized, thought-out and consistent one would surely expect that both indices (alpha and r(P-N)) would have risen (both being measures of internal consistency). Ray (1970) certainly found students to have more organized attitudes than the general public and Sniderman, Brody & Kuklinski (1984) found that education generally increased attitude organization. That did not seem to happen on the present occasion, however. Why? Acquiescent bias provides the answer.

It must be reiterated that alpha is expressible as the weighting of mean inter-item correlation against the number of items. Since the number of items (36) is constant for all groups in the present study, it follows that variations in alpha are wholly traceable to variations in mean inter-item correlations. The implication of reduced overall correlations combined with increased (or even stable) pos-neg correlations can only be therefore that the intercorrelations between the positive items only and the negative items only (those being the only other correlations) must have dropped. And this is precisely what reduced acquiescent set would have led us to expect! Why? Because the positive items alone or the negative items alone form one-way-worded scales. As mentioned above, the effect of acquiescent bias on such scales is to inflate the intercorrelation between their items. If such bias is reduced, however, the intercorrelations between such items will drop (i.e. the correlation due to common direction of wording will be removed) and the contribution of such correlations to the average intercorrelation also therefore will drop -- leading in turn to the effect observed: An alpha that is slow to rise and which may even fall.

Thus reduced acquiescent set due to the mentally organizing effect of increased education does provide a complete explanation for the effects observed on the present occasion where the effects of education alone would not do so.

Clearly, then, the Dogmatism items are acquiescence affected and need to be used in conjunction with reversed items in order to control for any effects this may have. The fact that increased higher education causes the Dogmatism items to be responded to more and more as they should be, does, of course, have a corollary: The Dogmatism items are less and less valid the less educated are the respondents to whom they are applied. If increased education reduces acquiescent bias, lesser education should lead to more of it. It would appear likely, then, that the Dogmatism scale in a balanced form is suitable for use only with students. This is also what emerged from Ray's (1979c) study of balanced Dogmatism scales applied to a general population sample so it is revealed that even the balanced Dogmatism scale is a very limited measure. How much less valid must be the one-way-worded form of the scale.


