Intended for healthcare professionals

CCBYNC Open access
Research Christmas 2017: Natural Phenomena

Individual differences in normal body temperature: longitudinal big data analysis of patient records

BMJ 2017; 359 doi: (Published 13 December 2017) Cite this as: BMJ 2017;359:j5468
  1. Ziad Obermeyer, assistant professor1 2,
  2. Jasmeet K Samra, research analyst1,
  3. Sendhil Mullainathan, professor3
  1. 1Department of Emergency Medicine, Brigham and Women’s Hospital, Boston, MA, USA
  2. 2Department of Emergency Medicine and Health Care Policy, Harvard Medical School, Boston, MA, USA
  3. 3Department of Economics, Harvard University, Boston, MA, USA
  1. Correspondence to: Z Obermeyer zobermeyer{at}
  • Accepted 23 November 2017


Objective To estimate individual level body temperature and to correlate it with other measures of physiology and health.

Design Observational cohort study.

Setting Outpatient clinics of a large academic hospital, 2009-14.

Participants 35 488 patients who neither received a diagnosis for infections nor were prescribed antibiotics, in whom temperature was expected to be within normal limits.

Main outcome measures Baseline temperatures at individual level, estimated using random effects regression and controlling for ambient conditions at the time of measurement, body site, and time factors. Baseline temperatures were correlated with demographics, medical comorbidities, vital signs, and subsequent one year mortality.

Results In a diverse cohort of 35 488 patients (mean age 52.9 years, 64% women, 41% non-white race) with 243 506 temperature measurements, mean temperature was 36.6°C (95% range 35.7-37.3°C, 99% range 35.3-37.7°C). Several demographic factors were linked to individual level temperature, with older people the coolest (–0.021°C for every decade, P<0.001) and African-American women the hottest (versus white men: 0.052°C, P<0.001). Several comorbidities were linked to lower temperature (eg, hypothyroidism: –0.013°C, P=0.01) or higher temperature (eg, cancer: 0.020, P<0.001), as were physiological measurements (eg, body mass index: 0.002 per m/kg2, P<0.001). Overall, measured factors collectively explained only 8.2% of individual temperature variation. Despite this, unexplained temperature variation was a significant predictor of subsequent mortality: controlling for all measured factors, an increase of 0.149°C (1 SD of individual temperature in the data) was linked to 8.4% higher one year mortality (P=0.014).

Conclusions Individuals’ baseline temperatures showed meaningful variation that was not due solely to measurement error or environmental factors. Baseline temperatures correlated with demographics, comorbid conditions, and physiology, but these factors explained only a small part of individual temperature variation. Unexplained variation in baseline temperature, however, strongly predicted mortality.


Have you ever felt cold, or warm, in a room where everyone else felt comfortable? This common experience about room temperature has some interesting lessons for body temperature and how we measure it. To know how warm or cold someone feels, we would not look at room temperature alone. Married couples sitting next to each other in the same room routinely disagree about whether to turn the heat up or down. Individuals have different baseline propensities to feel hot or cold—at any given absolute room temperature.

Yet doctors have forgotten this lesson when measuring core body temperature. We would not use absolute room temperature to infer perceived warmth, but we do use absolute body temperature to infer fever.

In medical school, students are taught that humans have a core body temperature as a species, not as individuals. When clinicians take patients’ temperatures in the clinic or hospital, they compare the measurements with the population average. Deviations from this single number help in the diagnosis of acute pathological states, from infections to thyroid disorders.

Why should someone’s physiological state be compared with an absolute standard temperature? Body temperature deviations, after all, can have their roots in individual physiology, such as age12 and circadian,3 metabolic,4 and ovulatory cycles.5 These factors vary dramatically across individuals, raising the possibility that individuals have baseline temperatures that differ systematically from the population average. The same temperature that is normal for one person might be dangerously high for another.

Historically, the use of a population average was partly a data problem: estimating each individual’s baseline body temperature would have been challenging. Any credible effort would need to tease apart individual baseline temperatures from other sources of variation in measured temperatures: ambient conditions when temperature is taken,6 differences in technique,78 and random error due to intrinsic variability in the measurement process itself.89 Distinguishing signal from noise under these conditions would require large amounts of data: both large numbers of patients and multiple temperature measurements for each patient.

The modern electronic health record stores rich physiological data, including temperature, on large numbers of patients, and many have noted its potential to generate new clinical knowledge.1011 We used these data to deal with random errors in individual temperature measurements, by applying statistical techniques that find signal and minimize noise across multiple temperature measurements, taken in a variety of outpatient settings and for a large number of patients. To deal with the many factors known to affect temperature measurements, we used rich contextual data to control for conditions at the time of measurement: ambient temperature, humidity, time of day, date, and body site of measurement. This allowed us to estimate stable baseline temperatures for every patient in our large and diverse sample, as opposed to population averages. We then explored links between individual temperature and a range of other variables: demographics; physiological measures, including vital signs; and mortality.

Individual differences in body temperature might be meaningful in two ways. Firstly, their very existence could open new insights into human physiology and links between body temperature, metabolism, and longevity.121314 Secondly, individualized “precision temperatures” could allow doctors to tailor testing and treatment decisions to patients’ physiology. At a minimum, they might change one familiar part of the doctor-patient conversation: the eye-rolling that sometimes ensues when patients report that a given temperature, although normal, is “high for me.”


Until recently, large scale databases of temperature measurements were scarce: with some notable exceptions,1516 most studies of body temperature in humans date from 1950 or before17 and have important limitations. Temperatures were measured at varying times of day and in different seasons, using unspecified instruments.17 Details regarding enrollment procedures were often absent, but—particularly in older studies—there was little apparent concern about achieving a diverse sample of patients for race, sex, and comorbid conditions.17 Lack of longitudinal data meant that temperature measurements could not be correlated to subsequent outcomes.151617 Perhaps most importantly, albeit with some exceptions,1518 sample sizes were small—often in the 10s to 100s of patients.1617


We used a dataset of electronic health records from a large US based academic hospital. The dataset was assembled in two steps: firstly, we identified patients to be included in the cohort—those with one or more visits to the hospital’s emergency and outpatient departments during 2010-12; and, secondly, we obtained data on each of these patient’s outpatient visits from 2009-14, consisting of visits to clinics during which a temperature was measured (Welch Allyn SureTemp Plus digital thermometers were present in most examination rooms).

Since we were interested in individual estimates of normal body temperature in adults, we focused on routine visits during which temperature was expected to be within normal limits. We did not include emergency department visits, during which acute physiological disturbances may affect measured temperatures. We excluded patients aged less than 18 years and those seen on weekends or outside business hours (7 am-6 pm), to avoid selecting patients seen for non-routine problems; those with implausible recorded temperatures (<32°C and >45°C; 0.04% of the total sample); and those visiting clinics for infections, to remove the effect of infection related disturbances in body temperature. We thus excluded visits with ICD-9 (international classification of diseases, ninth revision) codes for infectious diseases, and visits with antibiotics prescribed in the week after the visit (see supplement eTable 1 for cohort construction). Our final sample accounted for roughly 3-5% of the hospital’s annual load of outpatient visits.

Statistical analysis

Estimating individual baseline temperatures

In the included sample of routine outpatient visits, we used ordinary least squares regression to model measured visit level temperature as a function of external conditions (ie, ambient temperature and dew point, drawn from National Oceanographic and Atmospheric Administration data19), body site (eg, oral, axillary), and time of measurement (hour, day, month, and year).

To estimate baseline temperatures, we modeled individual patients’ deviations from the population mean, controlling for the selected factors. We used fixed effects to ensure that individual temperature effects were approximately normally distributed (see supplement eFigure 1), then we re-estimated the model using random effects. We chose random effects because even in the presence of measurement error they are consistent and efficient estimators of variance.2021 These steps allowed us to estimate individual temperature effects as well as features of the population level distribution, such as the variance of individual effects. Using individual level effects necessarily restricted the sample to patients with at least two measured temperatures over the study period. We omitted time invariant attributes of patients (eg, sex, race) since these cannot be estimated alongside individual level effects. Standard errors were clustered at patient level.

We correlated the resulting individual temperature random effects with other variables of interest in the electronic health record, in three groups: demographics, including age, sex, and race; comorbidities, defined using ICD-9 codes over the year before visits, following usual practice22; and physiological measurements (pulse, systolic and diastolic blood pressure, body mass index), using average values over the period spanning from first to last included visit date for each patient—this mirrored the period over which individual level effects were estimated in our dataset. Since we were exploring a range of correlations, we adjusted P values for multiple hypothesis testing using the Holm sequential procedure.23 Alternative methods of adjustment2425 are provided in supplement eTable 3 (results were substantively unchanged).

Relation between individual temperature effects and mortality

Finally, we explored the relation between individual temperature and mortality, using linkage to state social security data. To accurately calculate one year mortality, we addressed a source of bias in these longitudinal cohort data: patients were sampled and included in the cohort because of encounters over 2010-12, up until which they were necessarily alive. Likewise, patients were followed until their last temperature measurement, as late as 2014, at which time they were also alive by construction. Since our primary interest was the correlation between mortality and routine temperature measurements, we calculated mortality in the year after each patient’s last temperature measurement, excluding those whose 2010-12 sampling event occurred after the last temperature measurement (16% of the sample). We then estimated the relation between one year mortality and individual temperature effects by logistic regression, controlling for demographics, comorbidities, and physiological measures. As the hospital is a referral center serving patients from the local community and further away—who require specialized care for complex, serious illnesses with high mortality—we also controlled for log distance between patients’ home zip code and hospital zip code.

Statistical packages

All analyses were performed in STATA (version 14.0) and R (version 3.2.3; Foundation for Statistical Computing).

Patient involvement

No patients were involved in setting the research question or the outcome measures, nor were they involved in developing plans for design or implementation of the study. No patients were asked to advise on interpretation or writing up of results. There are no plans to disseminate the results of the research to study participants or the relevant patient community.


Of 374 306 patients with temperature measurements at outpatient visits, we excluded 130 800 (largely because of infection diagnoses or antibiotic prescriptions; see supplement eTable 1), leaving 243 506 visits (18% of all outpatient visits meeting the other inclusion criteria). Table 1 shows demographics, physiological measurements, comorbidities, and one year mortality. The mean age at time of the visit was 52.9 years, 64% of patients were women, and 41% were of non-white race (including 16% black or African-American patients, 17% Hispanic patients). The most common primary diagnoses at included outpatient visits were osteoarthritis (5.9%), back pain (4.9%), and routine evaluation and examination (4.5%). The clinics most commonly visited were orthopedics (10%) and internal medicine (hospital based clinic: 7.8%, community clinic: 5.7%). One year mortality in this sample of patients receiving care at a tertiary referral hospital was 6.2%, considerably higher than in the general population (<1% in a similar age range).

Table 1

Demographics, physiological measurements, comorbidities, and one year mortality for full sample. Values are numbers (percentages) unless stated otherwise

View this table:

The mean measured temperature was 36.6°C (95% confidence interval 36.6°C to 36.6°C). Each patient had a median 5 (interquartile range 3-9) temperature measurements over a median of 2.1 (0.8-3.8) years, and 19% had more than 10 measurements.

Summary statistics on temperature

  • Mean 36.6°C (95% range 35.7-37.3°C; 99% range 35.3-37.7°C)

  • Measurements at different sites (versus oral): temporal: –0.03°C; tympanic: –0.06°C; axillary: –0.26°C

Figure 1 shows the correlation of temporal and environmental factors to measured body temperature, estimated by using random effects regression. Temperature measurements were largely oral (88.2% oral, 3.5% temporal, 3.0% tympanic, 0.1% axillary, 5.2% not recorded), and temporal, tympanic, and axillary temperatures were significantly lower than oral temperatures (by –0.03°C, –0.06°C, and –0.27°C, respectively; all P<0.001). We observed diurnal variation in temperature by hour, with a peak at 4 pm (0.03°C v 12 pm, P<0.001). Higher ambient temperature and dew point were both linked to higher body temperature. Month effects worked to offset these effects—that is, at the same ambient temperature and dew point, summer months were linked to lower body temperature and winter months to higher body temperature. On a median temperature day in our dataset (12.2°C), body temperature was on average 0.08°C lower in July than in February, presumably reflecting the effect of compensatory physiological mechanisms (eg, evaporative cooling, vasoconstriction) by season. Supplement eTable 2 presents the full coefficients from the model.

Fig 1
Fig 1

Relation of temporal and environmental factors to measured body temperature. Coefficients estimated by random effects regression are shown for ambient temperature, dew point, hour, and month, compared with reference categories: median temperature 10th (12.2°C), median dew point 10th (4.7˚C), 12 pm, and April, respectively

The standard deviation for random effects from this model, denoted as individual baseline temperatures, was 0.15, with 95% range (ie, 2.5–97.5th centile range) 0.60°C (–0.33-0.27°C). In comparison, the standard deviation for raw measured body temperature was 0.42, with 95% range 1.67°C (after subtracting the mean, for comparability: –0.93-0.74°C).

Table 2 shows demographics, vital signs, and comorbidities by fifth of individual baseline temperature, along with regression coefficients of baseline temperatures on the variable. Baseline temperatures declined with age (–0.02°C every decade, P<0.001). African-American women had the highest temperature (0.052°C higher than white men, P<0.001). Baseline temperature also varied significantly as a function of comorbid conditions. Cancer was linked to higher temperature (0.02°C, P<0.001), whereas hypothyroidism was linked to lower temperature (–0.01°C, P=0.01; the relation was linear for mean thyroid stimulating hormone level over the span of the data (see supplement eFigure 2A). The total number of comorbidities was not statistically significantly linked to baseline temperature over and above all individual included comorbidities.

Table 2

Coefficients from regression of individual temperature effect on demographics, comorbidities, and physiological measurements in 20 718 participants

View this table:

Table 2 shows the relation between individual baseline temperatures and average physiological measurements over the study period. Controlling for demographic factors and comorbidities, higher temperatures were linked to increased body mass index (0.002°C per m/kg2, P<0.001). Higher temperature was also linked to higher pulse (4.0×10−5°C per beats per minute, P=0.17) and increased diastolic blood pressure (1.2×10−4°C per mm Hg, P=0.01). Figure 2 shows mean vital signs over the study period and their relation to baseline temperatures by sex.

Fig 2
Fig 2

Physiological measurements and relation to temperature effects, by sex. Dots represent centiles of individual temperature effect and vertical bar 95% confidence intervals

Despite the many statistically significant relations identified between individual baseline temperatures and demographics, comorbidities, and physiological measures, these factors collectively accounted for only 8.2% of variation (adjusted R2) in temperature. Residual variation was greater for women than for men (residual sum of squares: 283 v 170), raising the possibility that at least part of the residual difference was driven by hormonal cycles (not measured). However, most variation in baseline temperatures remained unexplained by commonly measured health variables in both sexes.

Table 3 shows the relation between individual baseline temperature and mortality. Controlling for age, sex, race, vital signs, and comorbidities, a 1°C increase in temperature translated into 3.5% higher mortality (P=0.014). For example, a 1 SD increase in temperature (0.15°C) would translate into a 0.52% absolute increase in one year mortality. Compared with a mean mortality of 6.2% in our sample, this represented an 8.4% relative increase in mortality risk.

Table 3

Regression of one year mortality on individual temperature, controlling for demographics, comorbidities, and physiological measures in 15 821 participants

View this table:


To our knowledge, this is the first attempt to demonstrate meaningful variation in individual body temperature, separately from random measurement error and the influence of external factors, and to correlate them to a range of patient factors and outcomes in a diverse patient population.

These results illustrate a way in which “big data” can serve to generate new medical knowledge. Here we used these data not to answer a causal question (eg, to discover side effects of drugs) or to predict an outcome (eg, to create an early warning system for a given condition), but to discover previously unsuspected and potentially important patterns in human physiology. It is unlikely that any one practitioner could have noticed these patterns, despite their fairly large magnitude connections with mortality. Rather, large datasets and statistical methods are needed to bring out these patterns from the empirical record.

Our most noteworthy result was the connection between temperature and mortality. This fits with a larger body of research showing that reduction in body temperature increases longevity and delays aging in a range of (ectothermic) experimental models, including Drosophila and Caenorhabditis elegans,26 as well as transgenic (homeothermic) mice engineered to have lower temperatures.12 This observation raises a set of questions that may be worth answering through further research. What is the biological basis for an individual’s baseline temperature? And how might these factors teach us more about subtle but important physiological patterns that might lead to good or poor outcomes? Two sets of results from our analysis were suggestive along these lines.

Firstly, individual temperatures were highly correlated with measured patient characteristics, particularly those related to metabolism and obesity. These differences may have their roots in obvious thermodynamic factors: bodies with larger mass dissipate heat less rapidly, leading to higher temperatures. Fat (which is correlated with mass) could also act as an insulator, independent of mass, leading to higher heat retention in people with more fat. Other explanations, however, are also plausible. It is well known that caloric restriction through fasting leads to down-regulation of temperature, presumably to conserve energy27; indeed, reduced temperature is now considered a biomarker for caloric restriction.1314 Although there are few trials of caloric supplementation, the existing literature suggests that individuals vary widely in their ability to dissipate excess energy from overfeeding.28 Given the strong links between resting metabolic rate and body temperature,4 it is possible that higher resting body temperature could be a response serving to dissipate excess energy from caloric intake. We found that a raised temperature correlated with both body mass index and activation of the sympathetic nervous system (ie, increased pulse rate and diastolic blood pressure). Thus temperature could be another variable in the cluster of traits linking obesity and activation of the sympathetic nervous system.293031 Interestingly, while our temperature effects were estimated over longer periods, some small studies have found that hypothermia is linked to increased mortality for acute events (eg, in hip fractures),32 potentially implying a different physiology in short term versus long term temperature regulation.

Secondly, we found a large correlation between individual baseline temperature and mortality that was not explained by measured patient characteristics. What factors might this temperature variation be picking up on that confer an 8.4% mortality disadvantage? It is tempting to speculate. We did identify a correlation between diagnosed cancers and temperature. This has been noted in the literature previously, either because of the direct metabolic demands of the cancer itself,33 or because of the body’s immune response.34 Subclinical infections or rheumatological diseases could exert a similar effect. If higher temperature reflected undiagnosed cancers or other illnesses, this would generate the correlation we observed between the unexplained component of temperature variation and subsequent mortality from these same illnesses. Another potential explanation is that higher temperature reflected a pro-inflammatory milieu; however, we found no clear connection between individual temperature and the inflammatory marker reactive protein (see supplement eFigure 2b). Ultimately, further study is required. We could imagine studies that estimated individual temperature effects using similar methods, then subjected those with higher temperatures to additional diagnostic studies to identify undiagnosed illnesses.

The finding that measured temperature was, other things being equal, lower in hot months and higher in cold months may reflect engagement of well known compensatory adaptations (eg, plasma volume, evaporative cooling, vasoconstriction, shivering) to temperatures experienced over longer periods, as opposed to the short term direct effects of higher or lower temperature.35 Related recent work has shown, for example, that drinking warm beverages on warm days results in heat loss from increased sweat output.36

Finally, our estimate of population mean temperature differed from other studies—for example, it was lower than in a sample of primarily young, healthy participants,16 and higher than in a population of older adults seen in healthcare settings.15 One potential advantage of our approach is that it adjusted for environmental and temporal factors at the time of measurement, which was not possible in other studies. Although this might enhance the generalizability of our estimate, it would be difficult for any one study of core temperature to ensure that estimates are valid for an entire species. However, it may help with clinical decision making for populations seeking healthcare in similar settings.

Limitations of this study

Our study had several limitations. We considered patients at one academic center and measured temperature using similar equipment, which can have correlated errors in measurement. Our methods were designed to estimate robust deviations in individual temperature separately from measurement error but might not generalize to other equipment, although we would guess that stringent Food and Drug Administration standards for clinical electronic thermometers offer some guarantees that the relative magnitude of effects should be similar. Our data came from one climate zone. Although we controlled for the substantial variation in environmental conditions within this zone, temperature and compensatory mechanisms may vary across climactic zones. We excluded patients with infection and those prescribed antibiotics, but it is possible that some patients had infections that were undiagnosed and untreated by their physicians. However, since this would have to be a consistent finding over multiple visits for the same patients, and since most infections are by contrast transient, this is unlikely to have affected our estimated correlation between individual baseline temperature and mortality. We sampled patients based on visits to a hospital emergency department and clinics. This resulted in an ethnically and medically diverse sample over multiple years of data, but it also selected a sicker set of patients, with a higher comorbidity burden and mortality than the general population. An advantage is that our sample was representative of the population of patients using healthcare today.


We found that individuals have body temperature baselines that correlate with a range of demographic factors, comorbid conditions, and physiological measurements. Since the unexplained variation in temperature is large and correlates with mortality, it may be an interesting and important area for further study.

What is already known on this topic

  • A long tradition of research on human core body temperature, starting in the 19th century, has focused on establishing average temperature in a population

  • Temperature is known to be influenced by many factors that differ widely across patients (eg, age and circadian, metabolic, and ovulatory cycles) raising the possibility that individual baseline body temperatures might vary systematically

What this study adds

  • Individual baseline temperatures are correlated with specific demographic factors, with older people the coldest and African-American women the hottest

  • Particular medical conditions were also statistically significantly linked to lower or higher temperature, as were physiological measurements, but these factors explained only 8.2% of variation in individual baseline temperatures

  • The remaining unexplained variation was a large and significant predictor of subsequent mortality, nearly 8.4% higher mortality for a 1 SD increase in temperature


We thank Philip Mackowiak for comments on an early draft of this manuscript.


  • Contributors: ZO and SM designed the study and wrote the manuscript. ZO obtained funding. JKS and ZO analyzed the data. All authors had full access to all of the data (including statistical reports and tables) in the study and can take responsibility for the integrity of the data and the accuracy of the data analysis. ZO acts as guarantor.

  • Funding: This study was supported by a grant from the office of the director of the National Institutes of Health (DP5 OD012161) to ZO. This research was independent from funders. The funder had no role in the study design; in the collection, analysis, and interpretation of data; in the writing of the report; or in the decision to submit the article for publication.

  • Competing interests: All authors have completed the ICMJE uniform disclosure form at and declare: no financial relationships with any organizations that might have an interest in the submitted work in the previous three years; no other relationships or activities that could appear to have influenced the submitted work.

  • Ethical approval: This study was approved by the institutional review boards of Partners HealthCare, the parent organization of the hospital where the research was conducted.

  • Data sharing: No additional data available.

  • Transparency: The lead author (ZO) affirms that the manuscript is an honest, accurate, and transparent account of the study being reported; that no important aspects of the study have been omitted; and that any discrepancies from the study as planned have been explained.

This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See:


View Abstract