|
 
- 帖子
- 7022
- 精华
- 0
- 开心果
- 40009

|
4#
发表于 2008-7-13 16:00
| 只看该作者
Conjunction fallacy test phase. After training, the computer presented the following instructions (borrowing from Fantino et al., 1997) for the CF test phase:
During this stage of the experiment you must make judgments based on what you've previously learned about the relationships between nonsense words to respond to novel arrangements. As before, you will start by clicking on the item at the top of the screen. However, during this phase, after clicking on the top item, you will now be shown only one choice at the bottom. Below the choice, you will see a drag box which you can adjust to enter a percentage from 0-100%. You will be asked to rate how likely you think it is that each choice displayed is correct given the combination of nonsense words on the top. For example, entering "0" would suggest that given your experience you think it is virtually impossible that the arrangement of nonsense words shown on the bottom would be a correct answer given the combination of nonsense words on the top. Entering "100" would suggest that given your experience you think it is virtually certain that the arrangement of nonsense words shown on the bottom would be a correct answer given the combination of nonsense words on the top. Since each new display asks you to enter a new percentage from 0-100, it is NOT EXPECTED that the numbers you enter will sum to 100. You will not be told if specific responses are correct or incorrect, but you may be awarded bonus points. You will be told if you have been awarded bonus points after all of your ratings have been entered.
Click the "yes" box below when you understand these instructions and are ready to begin.
When the participants indicated readiness to proceed, the computer presented 18 test trials designed to be analogous to those used in traditional CF studies. Specifically, the sample now consisted of three members of one of the two stimulus classes (e.g., FOW, XID, BAX or REK, VAG, PID). For each three-member sample (analogous to the descriptive framing information), the participant was then asked to enter a likelihood rating that each of nine comparison stimuli (that differed in terms of their "representativeness"; that is, class membership), was a correct answer. The comparison stimuli were presented one-by-one and, during this phase, were centered from left-to-right and middle-to-bottom on the screen. For example, in the presence of the compound stimulus TIV, XID, BAX, the participant rated, in the following order, the likelihood that FOW (representative), FUH (nonrepresentative), FOW,FUH (conjunction), REK (non-representative), LOQ (representative), REK,LOQ (conjunction), YOF (never before seen nonsense word), BOK (never before seen nonsense word), and YOF, BOK were correct answers. For the preceding example, the conjunction fallacy would be identified by the following pattern of likelihood ratings: FOW,FUH > FUH and REK,LOQ > REK. Of note, YOF, BOK, and YOF,BOK were never before seen nonsense words used to establish the "base rate" of conjunction responding (i.e., instances where YOF,BOK > YOF or BOK). The same procedure was then conducted for the compound stimulus MEP, VAG, PID. Thus, across the 18 test trials there were four conjunctions assessed involving stimuli used in the training and two involving never-before-seen nonsense words. During this phase, responses (the percentages entered by the participant) did not result in any feedback; however, to maintain motivation, the participants were told that their responses might produce "bonus points," and that they would be informed if they received any bonus points after all of their responses were entered. To their cumulative point total, all participants received 18 additional points, 1 for each response, independent of the percentages entered.
Equivalence test phase. The purpose of the equivalence test phase was to determine if the participants derived relationships between the nonsense words through their common node. That is, were equivalence classes indeed formed such that participants treated FOW, XID, BAX, TIV, and LOQ as related based on the common node SOJ and REK, VAG, PID, FUH, and MEP based on the common node ZAB?
The screen set-up during equivalence testing was identical to that in the training phase. Across trials the sample and comparison stimuli changed such that the relationships between the various members of each class were assessed. In this phase trials were presented in blocks of 50. The blocks of 50 were designed to combine the 10 trials from the mixed training phase with the 40 equivalence test items (i.e., BC, BD, BE, BF, CB, CD, CE, CF, DB, DC, DE, DF, EB, EC, ED, EF, FB, FC, FD1 FE relations for both classes). However, because of a computer programming oversight, the 50 trial blocks actually included 11 mixed training trials and 39 untrained test trials. As a result, not all directions of all derived relations were assessed, but all possible derived relations were assessed (e.g., C1[arrow right]D1 was not assessed; however, D1[arrow right]C1 was). Because the training procedure was comparison-as-node, such that both C1[arrow right]D1 and D1[arrow right]C1 trials constitute tests of equivalence of the CD relation, and all derived relations were assessed, the omission of the few test items, while unfortunate, does not appear to be an oversight that would alter conclusions.
During the equivalence test, trials were presented randomly within each block and continued until 2 blocks of 50 trials (100 total trials) were presented. When the subject made his or her choice no feedback was delivered. However, to maintain motivation, participants were informed that despite the absence of feedback there were correct choices and that correct choices would be recorded by the computer and generate points that would be awarded at the conclusion of a 50-trial block (responses demonstrating derived relations were considered the correct response). After the 50-trial blocks the point counter was displayed for 5 s.
Conjunction fallacy replication test. In the final phase of the experiment the participants were presented with the Bill scenario. The computer presented the following information at the same height as the sample stimuli in the previous phases:
Read the passage in the box below closely and click on "yes" when you are finished.
Bill is 34 years old. He is intelligent but unimaginative, compulsive, and generally lifeless. In school, he was strong in mathematics but weak in social studies and humanities.
After the participant entered "yes" the following instructions appeared on the screen (in addition to the passage, which remained on the screen):
You will now be shown various statements about Bill. You will be asked to indicate the likelihood that each statement is true of Bill by entering a percentage using the drag bar. Since each statement asks you to enter a new percentage from 0-100, it is NOT EXPECTED that the numbers you enter will sum to 100. You will not be told if specific responses are correct or incorrect, but you may be awarded bonus points. You will be told if you have been awarded bonus points after all of your ratings have been entered. Please click on "yes" when you understand these instructions.
The participant was then asked to indicate the likelihood that each of the following statements presented one at a time, in blue font, and in this order, were true by entering a percentage on the drag box.
A. Bill is an accountant
B. Bill plays jazz for a hobby
C. Bill is an accountant who plays jazz for a hobby
During this phase, responses (the percentages entered by the participant) did not result in any feedback; however, to maintain motivation, the participants were again told that their responses might produce "bonus points," and that they would be informed if they received any bonus points after all of their responses were entered. In fact, all participants received 1 point per response, independent of the percentages entered.
After completing the CF replication phase the participant was informed by the computer that the experiment was done along with her or his final point total. The experimenter then paid the participants and thanked them for their participation.
Results
1. What proportion of participants showed the CF pattern on the analogue procedure and how did these rates compare to rates on the Bill scenario?
Incidence during the analogue conjunction fallacy test phase. Responses analogous to the CF were identified when the conjunction was rated higher than the low likelihood constituent. Using this method, across opportunities, responding analogous to the CF occurred 60% of the time. On each of the four opportunities, 67%, 56%, 59%, and 59% of the sample evinced the conjunction fallacy pattern. In addition, showing the CF pattern on one opportunity was a significant predictor of showing the CF pattern on other opportunities ([straight phi] = .37-.47, ps = .01-.05).
We also examined how often participants rated the conjunction of never-before-seen nonsense words as more likely than either one of the constituent never-before-seen nonsense words. Across opportunities responding analogous to the CF occurred 46% of the time, with rates of 44% and 48% observed on each of the two opportunities. However, showing the CF pattern on the first opportunity was not associated with CF responding on the second occasion ([straight phi] = .18, p = .34). Moreover, there was no relationship between number of CF responses to the never-before-seen stimuli and CF responses to the stimuli to which the participant was exposed during training (tau = .02, p = .91). Thus, it appears that something very different was happening when participants responded to the stimuli to which they were exposed during training compared to those that were never encountered until the CF test phase.
Incidence during the conjunction fallacy replication test. When responding to the Bill scenario, 69% of the sample evinced the CF. A binomial analysis suggested that there was no significant difference between the observed frequency of .69 on the Bill scenario and a test proportion of .60 (the frequency observed during the analogue CF test), p = .22. Thus, the rates of CF responding were similar during the analogue CF test and the CF replication test. Moreover, commission of the CF on the Bill scenario was not predictive of greater commission of the CF during the anaologue test, t(24) = -.21, p = .83. Thus, it did not appear that participants had a response set or general proclivity towards CF responding that was stable across stimulus arrangments.
2. Did the formation of equivalence relations alter the propensity to demonstrate the analogue CF?
In considering these data it is important to remember the experimental timeline. At the time of testing for responding analogous to the CF, the participants had received only the comparison-as-node training (i.e., B[arrow right]A, C[arrow right]A, D[arrow right]A, E[arrow right]A, F[arrow right]A). Thus, none of the members of the compound sample stimuli used in the analogue CF test were part of an explicitly trained class nor did any explicit training establish relations between any members of the compound sample stimuli and the comparison stimuli used in the analogue CF test. Testing for the emergence of the equivalence classes occurred in the subsequent phase and without feedback.
Equivalence classes were considered to have formed if participants responded correctly on 90% or more of the emergent relations items during the equivalence test phase, a criterion that has been used by others for identifying class formation (see Lazar, Davis-Lang, & Sanchez, 1984). According to this criterion, equivalence classes formed for 13/27 participants (48%). The average percentage correct for those 13 participants on the items assessing emergent relations was 97% (SD = 3%), compared to the average of 70% (SD = 15%, range 42-87%) for the remaining 14 participants, t(25) = 6.6, p < .001.
Based on these equivalence test data, two groups were formed: (a) those who were positive for equivalence class formation (n = 13), and (b) those who were negative for equivalence class formation (n = 14). We then explored whether there were group differences in the number of analogue CF responses, finding a statistically significant difference, F(1, 25) = 17.31, p < .001. Specifically, the mean number of analogue CF responses (with four being the maximum) for participants showing equivalence class formation was 3.38 (SD = 1.12, median & mode = 4), compared to a mean for the remaining participants of 1.50 (SD = 1.23, median & mode = 1). The individual participant data (presented in Figure 2) corroborated the group level analysis and suggested that this finding was fairly robust at the level of the individual participant.
Also conducted was an analysis of how often participants in each group rated the conjunction of never-before-seen nonsense words as more likely than either one of the constituent never-before-seen nonsense words. This analysis revealed no group difference, F(1, 25) = 0.25, p = .62. Furthermore, there was no correlation between group membership (> 90% vs. < 90% equivalence test performance) and commission of the CF on the Bill scenario, [straight phi] = .05, p = .79. In sum, the data suggest that commission of the analogue CF was governed by the nature of the stimulus relations established during the experiment with those who showed derived relations being much more likely to evince the CF pattern.
3. What was the magnitude of the conjunction fallacy, did it appear to represent an average?
The magnitude of the CF was determined by calculating the difference between the likelihood rating given to the conjunction and the nonrepresentative item. If this was positive, we assessed whether the difference appeared to represent an averaging of the representative and nonrepresentative items.
Magnitude and averaging on the analogue conjunction fallacy test. Across the sample (N = 27), the mean (SD) likelihood ratings on the analogue CF test for the representative, nonrepresentative, and conjunction items were 62.95 (30.69), 23.37 (22.76), and 34.00 (20.47), respectively, for a magnitude of 10.63, which was lower than would be expected by the averaging hypothesis (19.79). However, this analysis includes the entire sample, which is known to systematically differ in terms of commission of the CF based on equivalence class formation. When only the group of participants who demonstrated equivalence class formation were considered, the mean (SD) likelihood ratings on the analogue CF test for the representative, nonrepresentative, and conjunction items were 75.92 (24.87), 13.96 (15.75), and 47.23 (17.30), respectively, for a magnitude of 33.27, which was quite consistent with what would be expected by an averaging hypothesis (30.98). The upper panel of Figure 3 presents the observed conjunction likelihood ratings with what would be expected based on averaging for each participant. If these are identical they will fall on the diagonal. For 7/13 participants (54%), observed minus expected difference scores were ± 10 points or less, while 3/13 (23%) had difference scores that were lower than expected and 3/13 (23%) had difference scores that were higher than expected based on averaging. Across the group, the observed minus expected difference scores were not statistically significantly different from zero, (M = 2.29, SD = 13.94, t(12) = .59, p =.57.
The group who failed to demonstrate equivalence class formation showed no evidence of averaging and, in fact, the mean likelihood ratings for the conjunctions of 21.71 (14.96) were lower than the ratings for the nonrepresentative items of 32.11 (25.22). The ratings for the representative items averaged 50.91 (31.50). The middle panel of Figure 3 presents the observed conjunction likelihood ratings with what would be expected based on averaging for each participant. Across this group, the observed minus expected data were statistically significantly different from zero, M = -19.79, SD = 18.28, t (13) = -4.05, p =.001.
Magnitude and averaging on the conjunction fallacy replication test. Across the sample (N = 26, a computer error produced missing data for 1 participant), the mean (SD) likelihood ratings on the Bill scenario for the representative, nonrepresentative, and conjunction items were 80.96 (23.39), 7.0 (10.97), and 23.77 (20.11), respectively, for a magnitude of 16.77, which was lower than what would be expected by averaging. Similarly, when only the data from the 69% of the sample that showed CF responding was considered, the respective likelihood ratings were 88.89, 6.22, and 29.72; again, lower than what would be expected by averaging. The bottom panel of Figure 3 presents the observed conjunction likelihood ratings with what would be expected based on averaging for the participants demonstrating CF responding. For 7/18 participants (39%), observed minus expected difference scores were ±10 points or less, and the remaining 11/18 (61%) made likelihood ratings that were systematically lower than expected by averaging. As a result, the observed minus expected difference scores were statistically significantly different from zero, M = -17.83, SD = 14.78, t(17) = -5.12, p < .001. Thus, although the conjunction produced an intermediate response on the Bill scenario and a subset of participants' responses was consistent with averaging, the group average was significantly lower than what would be expected based on the averaging hypothesis. |
|