Discrepancies in autobiographical memories — implications for the assessment of asylum seekers: repeated interviews studyBMJ 2002; 324 doi: http://dx.doi.org/10.1136/bmj.324.7333.324 (Published 09 February 2002) Cite this as: BMJ 2002;324:324
- Jane Herlihy, psychologist in clinical traininga (, )
- Peter Scragg, lecturer in psychologyb,
- Stuart Turner, consultant psychiatrista
- a Traumatic Stress Clinic, London W1T 4PL
- b University College, University of London, London WC1E 6BT
- Correspondence to: J Herlihy
- Accepted 13 September 2001
Objective: To investigate the consistency of autobiographical memory of people seeking asylum, in light of the assumption that discrepancies in asylum seekers' accounts of persecution mean that they are fabricating their stories.
Design: Repeated interviews.
Setting: England, 1999 and 2000.
Participants: Community sample of 27 Kosovan and 12 Bosnian refugees.
Main outcome measures: Discrepancies in repeated descriptions of one traumatic and one non-traumatic event, including specific details, rated as central or peripheral to the event. Self report measures of post-traumatic stress disorder and depression.
Results: Discrepancies between an individual's accounts were common. For participants with high levels of post-traumatic stress, the number of discrepancies increased with length of time between interviews. More discrepancies occurred in details peripheral to the account than in details that were central to the account.
Conclusion: The assumption that inconsistency of recall means that accounts have poor credibility is questionable. Discrepancies are likely to occur in repeated interviews. For refugees showing symptoms of high levels of post-traumatic stress, the length of the application process may also affect the number of discrepancies. Recall of details rated by the interviewee as peripheral to the account is more likely to be inconsistent than recall of details that are central to the account. Thus, such inconsistencies should not be relied on as indicating a lack of credibility.
What is already known on this topic
Discrepancies between accounts of an event are often used to judge the credibility of asylum seekers
What this study adds
Discrepancies arise between two accounts of the same event even when there is no reason for fabrication
Refugees with high levels of post-traumatic stress are more likely to give inconsistent accounts if they have a long time to wait between interviews
Interviewees are more likely to be inconsistent in details that they rate as peripheral to their experiences than details they consider to be central
Inconsistent recall does not necessarily imply that asylum seekers are fabricating their accounts
Under the terms of the 1951 United Nations convention on the status of refugees, a refugee is someone with a well founded fear of persecution on arbitrary grounds — such as ethnicity or political opinion — who cannot achieve protection in their home country.1 When they escape to a new country, their application for asylum is considered in the light of the information they can supply and any facts known about their country. There will often be little documentary evidence about the asylum seeker, and a legal decision on status by the authorities in the country of reception may rest on their credibility as a witness.
Asylum seekers sometimes give accounts of persecution that differ with each telling (that is, they contain discrepancies). A common, understandable assumption is that the case is not a credible one — that is, the account is false or exaggerated and that is why it cannot be recalled clearly. In the United Kingdom, national guidelines state that “discrepancies, exaggerated accounts, and the addition of new claims of mistreatment may affect credibility” and that such discrepancies may be used as a reason for refugees being refused asylum.2
Generally, the more detail a memory has, the more believable and convincing the account is. The gist of an autobiographical memory (central details) can be reconstructed from general (historical or schematic) knowledge, whereas details of a specific event (peripheral details) cannot. Recall of peripheral details is thus seen as a good way of distinguishing between “accurate recollection and plausible reconstruction.”3 This is presumably the principle that, in part, guides state authorities' reliance on consistent details as an indication of credibility.
However, this view has been challenged in research on witnesses' evidence. Laboratory and field studies have shown that people recall more details that are central when an event has a high level of emotional impact, such as armed robbery, than when an event is emotionally neutral. Their recall of central details is, however, at the expense of their recall of peripheral details.4 Also, peripheral detail is more susceptible to disruption after the event. For example, discussions about the event5 and the exact wording of questions6 can change the detail of the responses given by the asylum seeker.
Emotional disorders can affect the quality of material that people remember. Clinical anxiety can lead to selective attention to threat, and depressed patients are biased towards recalling negative personal memories in favour of positive ones.7 Patients with depression and with post-traumatic stress disorder can have difficulties in retrieving specific autobiographical memories. 8 9 Refugees often present with clinical depression or with post-traumatic stress disorder.10–12
Where the experience is highly traumatic — for example, serious injury to the person — the situation may be even more complex. There may be important differences between traumatic and non-traumatic memories. For example, initial recall of traumatic events by people with post-traumatic stress disorder typically does not involve normal narrative memory.13 In a military population, inconsistent (discrepant) recall of traumatic experiences over time has been reported,14 with a notable correlation between symptoms of post-traumatic stress disorder and an increase in traumatic memories.
Our study explores empirically the consistency of recall of traumatic and non-traumatic events in a sample of refugees with no obvious motivation to fabricate accounts. Any inconsistencies found would require an alternative explanation. We hypothesised that there would be discrepancies between an individual's accounts, that the discrepancies would be exacerbated by symptoms of post-traumatic stress disorder, and that more discrepancies would occur in peripheral details than in central details.
We invited 27 Kosovan Albanians and 16 Bosnians to take part in research about memory; 23 were men and 20 were women, all aged between 18 and 64 (mean 39.5 (SD 14.5)). The research was not part of clinical treatment or medicolegal assessment. All participants had been granted leave to remain in the United Kingdom under the United Nations High Commissioner for Refugees group programmes — that is, none of the participants had given accounts of their experiences to gain their asylum status. We obtained written (translated) informed consent from all participants and the study was granted ethical approval.
One of us (JH) interviewed all participants twice (with an interpreter). The time between interviews ranged from three to 32 weeks. At the first interview we used a translated form of the post-traumatic diagnostic scale to assess the level of post-traumatic stress.15 Participants were asked to recall a traumatic event from their experiences. Fifteen predefined questions were asked about the chosen event — for example, what was the date? Similar questions were asked about a non-traumatic event. After answering each question, participants were asked to rate that particular element as central or peripheral to their experience. At the second interview participants were asked to recall the same two events. The same 15 questions were repeated and we assessed whether the participants were depressed by using a translated form of the Beck depression inventory.16
Calculation of discrepancy rates
Discrepancy rates were calculated by dividing the number of discrepant details between answers at the two interviews (including new information) by the total number of units of information in the first interview. Four separate rates were calculated per participant: central details of traumatic memories, peripheral details of traumatic memories, central details of non-traumatic memories, and peripheral details of non-traumatic memories.
Another rater coded 70% of the transcripts according to written coding procedures. Intraclass correlation estimates for the four rates ranged from 0.65 to 0.81. General linear model univariate analyses and bivariate correlations were used to compare the discrepancy rates of participants who scored high on the post-traumatic diagnostic scale with those of participants who scored low on the scale. General linear model repeated measures tests were used to analyse the interactions between the types of detail (central versus peripheral) and the type of event recalled (traumatic versus non-traumatic). SPSS software was used for all analyses.
Four of the Bosnian participants left the study after the first interview. All participants reported traumatic experiences. Scores of symptom severity on the post-traumatic diagnostic scale (maximum 51) ranged from 5 to 50 (27.3 (10.9)). Participants were divided into high and low scorers: scores of ≥ 26 were categorised as high (n=19); scores of ≥25 were categorised as low (n=21). The depression scores (maximum 63), standardised to the Beck depression inventory II, ranged from 7 to 52 (mean 24.2 (11.6)). Of the 39 scores, 31 indicated probable clinical depression (score >14); 21 indicated moderate or severe depression.
The mean age of the Bosnian group was greater than that of the Kosovan group (46.2 v 35.5 years; P<0.05, t test) and the average time between interviews was significantly longer for Bosnian refugees than for Kosovan refugees (159 v 29 days; P<0.0001). No significant differences in psychopathology between the two groups were seen. To reduce the limitations of these differences on interpreting the results, and to take account of the loss of four subjects from the Bosnian group, each of the hypotheses was tested on the whole sample and then on the larger of the two subsets — the 27 Kosovan participants — alone. Significant findings only are reported.
Discrepancies between the two accounts were found for all participants. The mean (overall) discrepancy rate was 0.32 (0.14; range 0.01-0.65; 95% confidence interval 0.28 to 0.36).
The length of time between interviews had a significant effect on discrepancy rates. Testing for homogeneity of regression in the two groups (high or low levels of post-traumatic stress) showed an interaction between the level of post-traumatic stress and length of time between interviews. In the group with high post-traumatic stress there was a positive association between number of discrepancies and length of delay (r=0.70, P<0.01), but this difference was not seen in the group with low post-traumatic stress (r=-0.122) (also see table 1). A univariate analysis of variance showed that this interaction was significant (F1,32=6.48, P<0.05).
The distributions of the discrepancy rates for peripheral details were skewed because of three outliers; these values were excluded from the parametric analyses. We confirmed our findings including the outliers, using non-parametric tests.
The mean discrepancy rates for each category of discrepancy are shown in table 2. Significantly more discrepancies were observed in peripheral details than in central details (F1,32=5.06, P<0.05). A Wilcoxon signed ranks test confirmed a significant difference between peripheral and central discrepancy rates (z=2.31, P<0.05).
The interaction between type of detail and type of event recalled was not significant. Retrospective pairwise comparisons showed that type of detail (central or peripheral) had a significant effect on the discrepancy rate when memories were traumatic (F1,32=4.42, P<0.05), but not when they were non-traumatic (F1,32=1.25, P=0.27). This result was not replicated, however, with non-parametric testing (traumatic central versus traumatic peripheral discrepancies, z=1.89, P=0.06; non-traumatic central versus non-traumatic peripheral discrepancies, z=1.74, P=0.08).
In the Kosovan subsample, more discrepancies were found in peripheral details than in central details (table 2). The main effect of type of detail (central or peripheral) was in the same direction as the whole sample and marginally significant (F1,24=4.25, P=0.05). This finding was not replicated with non-parametric testing (z=1.34, P=0.18).
Discrepancies (including the provision of new information) exist between autobiographical accounts of refugees given by the same individual on two occasions up to seven months apart. These findings cannot be explained on the grounds of intent to deceive. For refugees with high post-traumatic stress disorder, more discrepancies were found with longer times between interviews. (In the asylum process, there may be months or years between the original interview and an appeal hearing.) In addition, more discrepancies are found in details rated by participants as peripheral, compared with recollection of the central gist of the event. Discrepancies therefore cannot be taken as automatically implying fabrication.
Our findings cause us to question why such discrepancies might exist. Our research cannot provide a causal explanation but it can point to some possibilities. A common difficulty reported was related to the experience of repeated events that are similar. This may have led to the recall of an event similar in type but different in detail at the second interview, or to the mixing up of two or more events. The emotional state of the refugee at the time of the interview may have affected his or her responses. For example, one participant changed his description of his treatment by military police from “we were slapped around” to “we were badly beaten.” He may simply have been in a different mood state in each interview, thus giving different evaluations of his experience. In states of depressed mood, recall is biased towards negative memories.7 Further studies should consider measuring the person's state of mood at each interview.
Reminiscence is the phenomenon of new information about an event becoming available over repeated recall. It has been shown clearly in the laboratory but has received little interest in applied areas. One explanation for reminiscence is that, once a person has initiated a search in memory, the search continues. The person may or may not be consciously aware of the process. This may lead to the checking of memories with others who were present at the time, or the gradual remembering of more detail. Both these factors would be associated with discrepancies and may increase in importance over time.
Length of time between interviews
Although it was not hypothesised initially, and we must consider the possibility of type I error, the effect of the interaction of post-traumatic stress and length of time between interviews on discrepancies is probably the most important finding in relation to asylum policy. If discrepancies continue to be used as a criterion for regarding a case as lacking credibility, then asylum seekers who have post-traumatic stress at the time of their interviews are systematically more likely to be rejected the longer their application takes. Cogent reasons exist as to why memories may be different when a person has post-traumatic stress disorder 13 17 and these certainly merit further exploration in the context of asylum applicants and refugees.
Our study shows the danger of concluding that asylum seekers are fabricating their histories, solely on the basis of discrepancies between interviews, even when the interviews are only weeks apart. Discrepancies are common, especially (although not exclusively) when the person has post-traumatic stress disorder and has to wait a long time between interviews. Discrepancies are more likely to arise when the details required are peripheral to the interviewee's experience and when the content is traumatic to the interviewee. All of these factors are present in many asylum applications, and they may be increasing the risk of incorrect judgments. Our study has important implications for national and international policy in the assessment of asylum seekers.
Boba Dobretic, Teuta Hoxha, and Dr Laidon Shapo gave cultural advice and interpreting and translation work. Dr Pasco Fearon was the statistical consultant.
Contributors: JH initiated the study, conducted the theoretical research, formulated the hypotheses, designed the protocol, and carried out the data collection, statistical analyses and interpretation of the results. ST proposed the original idea of investigating discrepancies, discussed design issues, gave access to participants, and was field supervisor for JH's doctoral thesis in clinical psychology at University College London. PS discussed design issues and core ideas and was UCL internal supervisor for the doctoral thesis. All three authors contributed to the writing of the paper.
Funding University College London. This work was undertaken with the support of Camden and Islington Mental Health NHS Trust, who received funding from the NHS Executive; the views expressed in this publication are those of the authors and not necessarily those of the NHS executive or the Department of Health.
Competing interests JH submitted the study in part fulfillment of JH's requirements for a doctorate in clinical psychology at University College London. ST undertakes medicolegal assessments of asylum seekers.