Skip to main content

The effect of post-exercise heat exposure (passive heat acclimation) on endurance exercise performance: a systematic review and meta-analysis

Abstract

Background

“Active” heat acclimation (exercise-in-the-heat) can improve exercise performance but the efficacy of “passive” heat acclimation using post-exercise heat exposure is unclear. Therefore, we synthesised a systematic review and meta-analysis to answer whether post-exercise heat exposure improves exercise performance.

Methods

Five databases were searched to identify studies including: (i) healthy adults; (ii) an exercise training intervention with post-exercise heat exposure via sauna or hot water immersion (treatment group); (iii) a non-heat exposure control group completing the same training; and (iv) outcomes measuring exercise performance in the heat (primary outcome), or performance in thermoneutral conditions, V̇O2max, lactate threshold, economy, heart rate, RPE, core temperature, sweat rate, and thermal sensations. Study quality was assessed using the Cochrane Risk of Bias 2 tool. To determine the effect of post-exercise heat exposure, between-group ratio of means or standardized mean differences (SMD) were calculated for each outcome and weighted by the inverse of their variance to calculate an overall effect estimate (ratio of mean or Hedges’g) in a random effects meta-analysis, with 95% confidence intervals (CI) and prediction intervals (PI). Quality of evidence was assessed using the Grading of Recommendations Assessment, Development and Evaluation (GRADE) tool.

Results

Ten studies (199 participants: 156 male, 43 female, age 20–32 years) were included. The effect of post-exercise heat exposure on performance in hot conditions (33–40 °C) was trivial (ratio of means = 1.04) with poor precision (95%CI 0.94–1.15, P = 0.46) and low predictive certainty (95%PI 0.81–1.33). There were also trivial effects on performance in thermoneutral conditions (18–24 °C) and speed at lactate threshold, small effects on V̇O2max, heart rate, core temperature, and sweat rate, and a moderate effect on thermal sensations. However, the certainty in the effect estimates was graded as low to very low across all outcomes due to small sample sizes, high risk of bias, risk of publication bias, imprecision in the effect estimates, and low statistical power.

Conclusions

The use of post-exercise heat exposure for improving exercise performance is uncertain. Further high-quality trials are needed to make firm conclusions.

Protocol registration

Open Science Foundation (https://doiorg.publicaciones.saludcastillayleon.es/10.17605/OSF.IO/256XZ).

Peer Review reports

Background

The current evidence shows that “active” heat acclimation (exercise-in-the-heat) achieved with daily exercise sessions performed in a 30–40 °C heat chamber can improve endurance exercise performance in hot conditions [1,2,3,4,5,6,7]. This occurs via several cardiovascular/metabolic/thermoregulatory adaptations, including enhanced sweat rate and skin blood flow, plasma volume expansion, improved fluid balance and cardiovascular stability, increased thermal tolerance, and lower perceived exertion (RPE) [1, 3, 4, 6,7,8]. However, training in the heat can be unpleasant, increases the risk of heat illness, and requires a reduction in external load (e.g. power output or speed) during sessions. From a practical perspective, “passive” heat acclimation with daily post-exercise heat exposure is favourable because it does not impede an athlete’s typical training: desired during-session power output or speed can be maintained. Furthermore, due to logistical or financial constraints, athletes may lack access to the necessary resources for active heat acclimation (a bicycle or treadmill in a heat chamber) but will likely have access to a bathtub or sauna with which to undertake passive heat acclimation. Consequently, post-exercise heat exposure may be a more practical method of heat acclimation [9, 10].

Several existing systematic reviews on heat acclimation and exercise performance have pooled the outcomes from active and passive heat acclimation interventions [1,2,3,4, 6, 7, 11], and no systematic review of passive heat acclimation currently exists. Therefore, the specific effect of post-exercise heat exposure on exercise performance is unclear. Consequently, this systematic review aims to determine whether passive heat acclimation using post-exercise heat exposure via sauna bathing or hot water immersion improves exercise performance in healthy adults. To obtain relevant information, endurance exercise performance outcomes from time-to-completion tests (time trials or races) and time-to-exhaustion tests in hot conditions were chosen as the primary outcome. Secondary outcomes included endurance exercise performance tests in thermoneutral conditions, as well as V̇O2max, economy/efficiency, and lactate threshold; and physiological measurements (heart rate, rating of perceived exertion [RPE], core temperature, sweat rate, and thermal ratings) during submaximal exercise in thermoneutral or hot conditions. Furthermore, due to existing evidence showing how factors like training status and biological sex might influence performance outcomes following exercise-in-the-heat interventions [3, 6], subgroup analyses were also planned.

Methods

Research question

The purpose of this paper is to answer the question: Does post-exercise heat exposure improve endurance exercise performance? To answer this question, a systematic review was synthesised in line with the Cochrane Handbook for Systematic Reviews of Interventions [12]. Table 1 shows the structure of the research question — the Population, Interventions, Comparisons, and Outcomes (PICO).

Table 1 PICO — Population, Interventions, Comparisons, Outcomes

Search strategy

The search strategy was planned in January 2022 and independently peer-reviewed on February 11 2022 by Professor Janice Thompson (University of Birmingham, UK) using the 2015 PRESS (Peer Review of Electronic Search Strategies) Guideline Statement [14] (Additional File 1). The following databases were searched: MEDLINE, CENTRAL, and clinical trials databases for unpublished data (ClinicalTrials.gov, WHO International Clinical Trials Registry Platform [WHO ICTRP], and EU Clinical Trials Register [EUCTR]). The following MEDLINE search string was used and adapted for other databases (Additional File 2):

  • (heat exposure[Title/Abstract] OR post exercise sauna[Title/Abstract] OR post exercise hot water bath[Title/Abstract] OR heat acclimation OR heat acclimatization OR heat acclimat* OR post exercise bath OR heat adaptation OR heat adapt* OR "hot water immersion"[Title/Abstract] OR Sauna OR "heat chamber") AND (time trial[Title/Abstract] OR performance[Title/Abstract] OR race[Title/Abstract] OR time to fatigue[Title/Abstract] OR athletic performance[MeSH Terms] OR VO2max OR "aerobic capacity" OR exercise[title/abstract]) AND humans

Note that “heat acclimatisation” describes the adaptations to heat exposure gained naturally through exposure to living in hot conditions whereas “heat acclimation” describes the adaptations gained from purposeful exposure to artificial conditions. These definitions are used throughout this paper, but the two phrases are used interchangeably in the literature. Therefore, both phrases were included in the search strategy.

The initial search was conducted on April 7, 2022. The study protocol was then registered on Open Science Foundation (OSF) before data extraction began [15]. Due to delays in responses to data requests, subsequent searches were completed on Feb 8, 2023 and Feb 7 2024. Other amendments to the original protocol were documented on the OSF registry [15]. The review was reported according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 statement [16] and the PRISMA checklist for systematic review protocols (PRISMA-P) was completed (Additional File 3).

Study selection

The search hits were downloaded into Endnote and the review authors TS and ML independently screened titles and abstracts, coding each study as “include” (eligible or potentially eligible/unclear) or “exclude”. Duplicates were identified and excluded. Studies were only selected if they included: (i) healthy male or female adults (≥ 18 years of age) of any race/ethnicity, (ii) an exercise training intervention (≥ 30 min/day) with post-exercise heat exposure for at least 2 consecutive days, (iii) a non-heat-exposure control group completing the same daily exercise, and (iv) outcomes measuring endurance exercise performance during time-to-completion tests (time trials or races) or time-to-exhaustion tests, or exercise capacity (V̇O2max, lactate threshold, economy, etc.), or heart rate, RPE, core temperature, sweat rate, thermal sensations and thermal comfort during exercise. Studies were excluded if they were written in a non-English language, published in non-peer-reviewed journals, or if the full text was unavailable.

Reference lists of included articles were also screened using the same inclusion criteria. Full-text versions of the included articles were retrieved and independently screened by TS and ML to identify the final list of studies meeting the inclusion criteria. Reasons for exclusion were recorded and any disagreements were resolved through discussion or, when required, via the independent search strategy peer-reviewer, JT.

Data extraction

TS and ML independently extracted article info (author, year, journal, title), intervention details (duration; heat type, temperature, and duration; exercise type, duration, and intensity), sample sizes (N), mean and standard deviation (SD) values for subject characteristics (age, sex, BMI), and outcome variables of interest. If studies reported standard error (SE), it was converted to SD by multiplying it by √N. When full data were not reported, they were requested from the authors. If authors were unreachable (no response after several attempts to contact) or refused to share data, data were estimated from figures where possible using WebPlotDigitizer software [17]. When data were unobtainable, the study was still described in the qualitative summary but excluded from the meta-analysis. Independent data extraction from TS and ML was cross-checked and disagreements were resolved through discussion. To ensure data accuracy, TS & ML compared the magnitude and direction of effects reported in the included studies with how they appear in the study database.

Risk of bias analysis

TS and ML independently assessed the quality of the included studies using the Cochrane Risk of Bias 2 (RoB2) Excel tool [18], which assesses bias in five domains: Bias arising from the randomization process (including allocation concealment and blinding of participants and investigators); deviations from intended interventions; missing outcome data; the measurement of the outcome; and the selection of the reported result. The risk of bias for each domain was rated as “high”, “some concerns”, or “low”. A separate risk of bias analysis was conducted for each outcome of interest: exercise performance (time-to-completion and time-to-exhaustion tests, V̇O2max, economy, lactate threshold), heart rate, RPE, core temperature, sweat rate, thermal ratings (thermal sensations and thermal comfort). TS and ML’s independent RoB2 results were cross-checked, and disagreements were resolved through discussion. The overall risk of bias for each study was determined according to Cochrane guidelines [18]: “Low” if the trial was judged to be at low risk of bias for all domains; “Some concerns” if the trial was judged to raise some concerns in at least one domain but not to be at high risk of bias for any domain; or “High” if the trial was judged to be at high risk of bias in at least one domain or if the trial was judged to have “some concerns” for multiple domains. Risk of bias figures were built using the robvis R Shiny app [19].

Data synthesis and statistical analysis

A qualitative synthesis of the included studies was written, and an overall qualitative assessment was made based on the between-study heterogeneity in settings, participants, sample sizes, study designs, interventions, outcomes, risk of bias, funding sources, and conflicts of interest. Next, a quantitative synthesis was completed to determine the effect of the interventions. Any changes to the planned comparisons or outcomes or any new comparisons that became necessary were documented on the OSF protocol registration record [15].

Measures of treatment effects — meta-analysis

Different performance tests use different units of measurement — watts, kcals, seconds, metres, etc. — and an improvement in exercise performance can be represented by an increase in work done (kilojoules) or mean power output (watts), an increase in time-to-exhaustion, or a decrease in time to complete a specific distance. Therefore, intervention-induced changes for performance outcomes were calculated as a ratio of means (post divided by pre) [20, 21]. With most types of performance test, a “post ÷ pre” ratio of means greater than 1 indicates an improvement in performance. However, with some performance tests (e.g. time to completion tests), a “post ÷ pre” ratio of means that is greater than 1 inadvertently reflects a decrease in performance. To prevent this issue, an inverse transformation was applied to “post/pre” ratio of means values for time-to-completion tests (i.e. they were raised to the power of -1). Then, the between-group difference was calculated as a between-group ratio of means: (PostTreatment ÷ PreTreatment) ÷ (PostControl ÷ PreControl) [20, 21] and the standard deviation of the ratio of means was calculated as standard error (SE) × √n, where SE = √ ( 1 / nTreatment × (SDTreatment / meanTreatment)2 + 1 / nControl × (SDControl / meanControl)2) [20,21,22]. Because the sampling distribution of a ratio of means is not symmetrical, individual trial ratio of mean estimates must be pooled on a log scale [20,21,22]. Therefore, the ratio of means values were natural-log-transformed before meta-analytical calculations were made and summary effect estimates were then inverse-log-transformed (ex) back to ratio of mean (ROM) values, which were interpreted as: trivial (ROM < 1.08), small (1.08 ≤ ROM < 1.22), moderate (1.22 ≤ ROM < 1.37), or large effects (ROM ≥ 1.37) of treatment [20, 21]. Ratio of means meta-analyses were completed with RevMan v7.5.0 [23].

For the remaining outcomes, the between-group standardised mean difference (SMD) in each study was calculated as: ((PostTreatment—PreTreatment)—(PostControl—PreControl)) ÷ SDpooledbaseline [24]. The pooled baseline SD of the treatment and control groups was used rather than the SD of the change score to prevent the error of measurement from introducing bias into the estimate of the SMD. Between-group SMDs were weighted according to the relative sample sizes (N) in the treatment and control groups and reported as Hedges g effect size estimates, which were interpreted as: trivial (g < 0.2), small (0.2 ≤ g < 0.5), moderate (0.5 ≤ g < 0.8), or large effects (g ≥ 0.8) of treatment. SMD meta-analyses were completed with Meta-Essentials v1.5 [25] Because some studies used different scales to rate RPE (Borg 6 to 20 or 0 to 10), thermal sensation (-4 to + 4 or 0 to 13), and thermal comfort (-4 to + 4 or 0 to 10), these variables were re-scaled to a range of 0 to 100 (i.e. a per cent of the maximum possible score) before meta-analysis.

For all outcomes, a combined summary effect estimate was calculated using a DerSimonian‐Laird random effects model to account for between-study heterogeneity by weighting each study’s effect estimate by its inverse variance (i.e., 1/SE2) [25]. Z-tests were used to test for the overall effect and statistical significance was reported when P ≤ 0.05. The precision of the effect size estimates — SE and 95% confidence intervals (95%CI) — were also calculated. Forest plots were built using Microsoft Excel.

Statistical heterogeneity

To examine statistical heterogeneity (inconsistency of results), the Q statistic and its corresponding χ2 test P-value were calculated to test whether effect sizes departed from homogeneity [26]. I2 was also calculated to report the proportion of dispersion due to heterogeneity, and was interpreted as follows: 0% to 40% might not be important; 30% to 60% represents moderate heterogeneity; 50% to 90% represents substantial heterogeneity; 75% to 100% represents considerable heterogeneity [26]. τ2 and τ were calculated to quantify the amount of between-study heterogeneity in the true effects [26]. Prediction intervals, which are derived from τ, were also calculated to visualise the range in which, in 95% of the cases, the outcome of a future study will fall — i.e. the expected range of effects of future studies [27,28,29]. If the effect sizes of the included and not yet included studies are normally distributed, a large prediction interval indicates low certainty in the effect size estimate.

Statistical power in the meta-analysis

The metameta R package was used to calculate statistical power — the probability of detecting a true effect — for each outcome variable using the effect sizes and SEs measured in individual studies [30].

Assessment of reporting bias

To help identify the risk of publication bias and small study effects, the funnel plot relationship between the effect estimate (either ratio of means or standardised mean difference) and the SE of the effect estimate for each outcome variable was visually inspected [31]. Contour-enhanced funnel plots were constructed (using Microsoft Excel) to illustrate conventional levels of statistical significance (e.g., P < 0.01, P < 0.05, P < 0.10) [32] and Egger’s test was used to identify funnel plot asymmetry [33]. If statistically significant asymmetry (P ≤ 0.05) was detected, Duval and Tweedie’s trim and fill correction was used to simulate a model without publication bias and, therefore, estimate the number of missing studies and the intervention effect adjusted for publication bias [34].

Subgroup analysis

Subgroup analyses were planned for training status (trained athletes vs. non-athletes/untrained people) and temperature dependency (performance in hot vs. thermoneutral conditions), and χ2 tests were planned to test for subgroup interactions [35].

Sensitivity analysis

For each outcome variable, sensitivity analyses were planned if studies with a high risk of bias were identified and to explore between-study heterogeneity. A sensitivity analysis consisted of repeating the meta-analysis with the high-risk of bias studies removed and, if substantial statistical heterogeneity was identified (I2 > 50%), comparing the outcome of fixed-effect and random-effects analyses [36].

Quality of evidence

Certainty in the effect estimates was assessed according to the Grading of Recommendations Assessment, Development and Evaluation (GRADE) guidelines [37], which assesses the quality of evidence across five domains: risk of bias, inconsistency, indirectness, imprecision and publication bias. Note that imprecision was indicated if the 95%CI of the effect size overlapped zero or if the sample size in the meta-analysis was less than the optimal information size (OIS) criterion. The OIS was calculated for each variable using an online sample size calculator [38] recommended by GRADE, using alpha = 0.05 and beta = 0.20.

Results

Search results

The study selection process is presented in Fig. 1. The database searches yielded 2516 records. After removal of duplicates, 2458 records were screened and 2444 were excluded. Fourteen records were obtained for full‐text review, of which 10 met the inclusion criteria [39,40,41,42,43,44,45,46,47,48]. Reasons for exclusion included one or more of the following criteria: unrelated topic, no control group, non-English-language, non-healthy participants, less than 30-min of exercise with less than 2 consecutive days of post-exercise heat exposure, and no outcomes related to the primary and/or secondary outcomes stated in the PICO. The characteristics of excluded studies are available in Additional Files 4a to 4d.

Fig. 1
figure 1

PRISMA 2020 flow diagram

Qualitative synthesis of the included studies

The characteristics of the included studies are described in Table 2. The following is a qualitative summary of all evidence:

Table 2 Characteristics of the included studies

Settings

Of the 10 included studies, 8 were conducted in university research labs [39, 41, 43,44,45,46,47,48] and 2 were conducted in a Sports Institute research lab [40, 42]. Five trials were conducted in the UK [41, 45,46,47,48], 2 in Australia [40, 42], and 1 each in New Zealand [39], Norway [43], and Spain [44]. All studies conducted the heat exposure interventions indoors, while 5 studies conducted the exercise training intervention outdoors [39, 42, 44, 47, 48] and 5 were indoors [40, 41, 43, 45, 46].

Participants and sample sizes

The 10 studies included a total of 199 participants enrolled in interventions relevant to this review. Participants included 156 males and 43 females aged 20±2 to 32±4 years. Study sample sizes ranged from N=6 to N=41. Three of the studies included recreationally active participants [41, 45, 46] while the remaining 7 studies included trained athletes: runners [47, 48], runners and triathletes [39], race walkers [42], cyclists [40, 43], and soccer players [44]. Only 3 studies included female participants [42, 47, 48]; these same 3 studies also included male participants, while the other 7 studies included only males [39,40,41, 43,44,45,46].

Study designs

All 10 studies included a post-exercise heat exposure treatment group and a non-post-exercise heat exposure control group. Note that 2 studies [40, 43] included additional intervention groups (see Table 2) not relevant to this review and are, therefore, not discussed. Of the included studies, two [39, 40] used a crossover design and all but one [39] included a repeated-measures design. Six of the studies randomised participants to the intervention groups while two [47, 48] allowed participants to self-select into a treatment group, 1 study [42] divided participants into two groups to match V̇O2max and 10,000-m performance time between groups, and 2 studies [39, 43] did not describe whether participants were randomised.

Interventions

The intervention durations ranged from 5 to 21 days. The exercise training interventions included running [39, 45,46,47,48], race walking [42], cycling [40, 43], and soccer [44] with exercise performed daily in 6 studies [40,41,42,43, 45, 46]. It is possible that 3 further studies (7.7±2.3 times per week [39] and 7±2 times per week [47, 48]) also used daily exercise but it is not explicitly stated. The remaining study [44] did not explicitly state the exercise frequency, simply that participants engaged in at least 10 hours per week. Exercise sessions lasted between 40 and 105 minutes. A steady-state moderate-intensity (65% of V̇O2peak [41, 45, 46] or 45% to 51% of power at 4 mmol lactate [43]) was used in 4 studies while daily high-intensity intervals were completed in 1 study [40] and the remaining studies used variable intensities [39, 42, 44, 47, 48]. The during-exercise conditions were thermoneutral (18–19°C, 35–45% relative humidity [RH]) in 4 studies [41, 43, 45, 46], hot (≈33°C, ≈34% RH) in 1 study [42], and not reported in 5 studies [39, 40, 44, 47, 48]. The type of post-exercise heat exposure was a sauna in 4 studies [39, 44, 47, 48] and hot water immersion in 6 studies [40,41,42,43, 45, 46]. Heat exposure was delivered for 14–40 mins per session. The hot water immersion studies used a water temperature of ≈40°C, while the sauna studies used air temperatures between 90 and 108°C with relative humidities between 5 and 55%. Five studies described fluid intake during post-exercise heat exposure: 3 [39, 47, 48] reported that ab libitum water intake was permitted, while 1 described that there was no fluid intake [41], and 1 further study [44] described that fluid intake was limited to 250 ml, but none of the studies reported hydration status. Post-exercise heat exposure sessions were daily and followed every exercise training session in 5 studies [40, 41, 43, 45, 46]. Three studies [44, 47, 48] used post-exercise heat exposure on 3 days per week for a total of 9 sessions, while 2 studies did not explicitly state the frequency of heat exposure (12.7±2.1 sessions in 21 days [39], and 8 sessions in 14 days [42]).

Outcomes

Primary outcomes

Four studies [41, 43, 45, 46] reported endurance exercise performance (time trial, time-to-exhaustion, or a race) in hot conditions (air temperature 33–40°C; RH 40–61%).

Secondary outcomes

Five studies [41, 42, 44, 47, 48] reported performance (time trial, time-to-exhaustion, or a race) in thermoneutral conditions (air temperature 18–24°C; RH 42%). Two further studies [39, 40] did not report the temperature during the performance tests but, given the experimental design, they were likely conducted in thermoneutral conditions. Three studies reported V̇O2max in thermoneutral conditions [44, 47, 48], while one study [42] did not report the conditions for V̇O2max. Two studies reported running economy in thermoneutral conditions [47, 48]. Two studies reported speed at lactate threshold in thermoneutral conditions [47, 48] while one study [42] did not report the ambient temperature in which lactate threshold was assessed. Heart rate during submaximal exercise was reported by 7 studies [41,42,43, 45,46,47,48] in hot conditions and 1 study [41] in thermoneutral conditions, while 1 study [40] did not report the conditions. RPE during submaximal exercise was reported by 7 studies [41,42,43, 45,46,47,48] in hot conditions and 1 study [41] in thermoneutral conditions, while 1 study [40] did not report the conditions. All studies used the Borg 6–20 RPE scale except for Vaile et al. [40], which used RPE 0–10 but did not report the actual data. Core temperature during submaximal exercise was reported by 7 studies [41,42,43, 45,46,47,48] in hot conditions and 1 study [41] in thermoneutral conditions, while 1 study [40] did not report the conditions. Seven studies [41,42,43, 45,46,47,48] reported sweat rate during submaximal exercise in hot conditions and 1 study [41] in thermoneutral conditions. Thermal sensation during submaximal exercise was reported by 6 studies [41, 42, 45,46,47,48] in hot conditions and 1 study [41] in thermoneutral conditions. All studies used a 0–13 point thermal sensation scale except Stevens et al. [42], which used a -4 to +4 scale. Thermal comfort during submaximal exercise in hot conditions was reported by 3 studies [42, 47, 48]. Stevens et al. [42] used a -4 to +4 scale while the studies by Kirby et al. used a 0–10 point scale [47, 48]. Due to the use of different scales, thermal ratings were scaled to 0–100 before meta-analysis. No studies compared any of the outcome variables in trained athletes vs. non-athletes.

Risk of bias

The risk of bias analyses for the exercise performance tests (time trials, races, time-to-exhaustion, V̇O2max, and speed lactate threshold) and physiological measures (heart rate, RPE, core temperature, sweat rate, and thermal sensation) are presented in Fig. 2. Bias in the different domains is summarised as follows:

  • Bias arising from the randomisation process: For all outcomes, no study had a low risk of bias, while 2 studies [47, 48] had a high risk of bias and the remaining studies [39,40,41,42,43,44,45,46] had some concerns. The most common reasons for a high risk of bias or some concerns were that: the randomisation, allocation, and blinding processes were not described [39,40,41,42,43,44,45,46], participants self-selected their intervention group [47, 48], or participants were assigned to groups to match fitness between groups [42]. It is important to note that while it’s impossible to blind a participant from exercise, sauna, or hot water immersion, it would have been possible for investigators and/or data analysts to blind themselves to the group allocation.

  • Bias due to deviations from the intended intervention: For the exercise performance outcomes and most of the physiological outcomes, 4 studies [41, 43, 45, 46] had a low risk of bias, 4 studies [39, 40, 42, 44] had some concerns, and 2 studies [47, 48] had a high risk. For RPE during exercise, one additional study [46] had a high risk of bias. The most common reasons for a high risk of bias or some concerns were: not having a pre-registered trial protocol to define the intended intervention.

  • Bias due to missing outcome data: For the exercise performance outcomes, 6 studies [39, 40, 42,43,44,45] had a low risk of bias, 2 studies [41, 47] had some concerns, and 2 studies [46, 48] had a high risk. For the physiological outcomes, most studies were either low risk of bias or had some concerns, except for 1 study [48] that had a high risk of bias due to missing data for heart rate. The most common reasons for a high risk of bias or some concerns were: not describing how missing data were handled and excluding data from participants who dropped out during the intervention.

  • Bias in the measurement of the outcomes: For the exercise performance outcomes, 3 studies [41, 42, 45] had a low risk of bias, 3 studies [40, 43, 44] had some concerns, and 4 studies [39, 46,47,48] had a high risk. For the secondary outcomes, 2 studies [41, 43] had a high risk of bias for the measurement of heart rate, 1 study [44] had a high risk of bias for the measurement of core temperature, 2 studies [41, 46] had a high risk of bias for the measurement of thermal sensation, and 1 study [47] had a high risk of bias for the measurement of sweat rate. All other ratings for the secondary outcomes were either low risk of bias or some concerns. The most common reasons for a high risk of bias or some concerns were: providing insufficient details of how the outcome was measured, and not describing the method and/or stating the manufacturer and model of methodological equipment.

  • Bias in the selection of the reported results: For the exercise performance outcomes, no study had a low risk of bias, while 7 studies [39,40,41,42,43,44,45] had some concerns and 3 studies [46,47,48] had a high risk of bias. For the secondary outcomes, most studies had a high risk of bias, none had a low risk, and the remaining studies [42, 45] had some concerns. The most common reasons for a high risk of bias or some concerns were: not having a pre-registered trial protocol to define the planned reported results, and not defining how the time point of a continuous variable was selected (e.g. whether a heart rate, RPE, core temperature, etc., was an average or peak value achieved during exercise).

Fig. 2
figure 2

Risk of bias analysis. Risk of bias for each outcome variable was rated as high ( ×), low ( +), or some concerns (-) across five domains — randomisation, protocol deviation, missing data, measurement, and reporting — using the Cochrane Risk of Bias 2 (RoB2) tool. The overall risk of bias for each study was determined as: “Low” if the trial was judged to be at low risk of bias for all domains; “Some concerns” if the trial was judged to raise some concerns in at least one domain but not to be at high risk of bias for any domain; or “High” if the trial was judged to be at high risk of bias in at least one domain or if the trial was judged to have “some concerns” for multiple domains. The figure was built using the robvis R package [19]

To summarise the risk of bias, 4 studies [39, 46,47,48] had a high risk of bias for the exercise performance outcomes while the remaining 6 studies [40,41,42,43,44,45] had some concerns. Meanwhile, 5 studies [41, 43, 44, 47, 48] had high risk and 4 studies [40, 42, 45, 46] had some concerns for heart rate; 3 studies [46,47,48] had high risk and 5 studies [40,41,42,43, 45] had some concerns for RPE; 6 studies [40, 43, 44, 46,47,48] had high risk and 3 studies [41, 42, 45] had some concerns for core temperature; 5 studies [41, 43, 46,47,48] had high risk and 2 studies [42, 45] had some concerns for sweat rate; and, 4 studies [41, 46,47,48] had high risk and 2 studies [42, 45] had some concerns for thermal sensations. No study was rated with an overall low risk of bias.

Funding sources

Of the 10 included studies, one [44] was funded by the Regional Government of Extremadura, Spain, with a contribution from the European Union; one [43] received funding from the Norwegian Olympic and Paralympic Committee and Confederation of Sports; two [47, 48] were funded by internal university funds with additional support from the Canadian Centennial Scholarship Fund; and, two [45, 46] received funding from the Ministry of Defence. The remaining four studies [39,40,41,42] did not report funding sources.

Conflicts of interest

Six studies [41, 43, 45,46,47,48] reported that no conflicts of interest existed, while four [39, 40, 42, 44] did not provide this information.

Studies without usable data

Of the 10 included studies, none of the studies presented all the necessary data to complete a meta-analysis. The authors of 6 studies shared their data upon request [41, 42, 45,46,47,48] but the authors of 2 studies were unreachable (no response after several contact attempts) [40, 44] and the authors of 2 studies refused to share the necessary data [39, 43]. Fortunately, data from 4 of these studies [39, 40, 43, 44] could be calculated (e.g. post-minus-pre changes, and SD from CI [49]) or extracted from figures [43, 44] using WebPlotDigitizer software [17]. Additionally, the two studies from Kirby et al. [47, 48] re-used several participants’ data, but the authors agreed to share their raw data so it could be analysed as a single dataset and thus reduce meta-bias due to data duplication.

Quantitative synthesis — the effect of interventions

Before analysis, the following data manipulations were necessary: (i) Stevens et al. [42] reported the 10 km time trial finish time in mm:ss format, and Scoon et al. [39] and Bartolomé et al. [44] reported time-to-exhaustion in minutes; these were all converted to seconds. (ii) Because the studies from Kirby et al. [47, 48] reported total sweat loss (litres) during a 30-min heat tolerance test, the values were multiplied by 2 to derive a litre per hour sweat rate to enable a fair comparison with other studies. (iii) Similarly, Lundby et al. [43] reported total sweat loss in litres during a 50-min exercise test consisting of 15 min of submaximal riding in the heat, 5 min of riding in ambient conditions, and a 30-min time trial in the heat. Therefore, the reported sweat loss was divided by 50/60 to derive a sweat rate in litres per hour.

Effect of post-exercise heat exposure on performance in hot conditions

A random effects model showed a trivial effect with poor precision for improving endurance performance in hot conditions (Fig. 3: ratio of means=1.04, 95%CI 0.94 to 1.15, P=0.46; k=4, n=60). Heterogeneity of results across studies was negligible and might not be important (Q=1.22, P=0.75; I2=0%), but there was low predictive certainty (prediction interval [PI] 0.81 to 1.33) regarding the magnitude of future effects. However, it is important to highlight that the performance outcomes are from highly diverse performance test types (Table 2). Egger’s regression did not indicate a high risk of publication bias (t=2.27, P=0.15; Fig. 13A) but the very small number of studies (k=4) makes publication bias likely and decreases the certainty in the validity of the funnel plot. A sensitivity analysis removing the study by McIntyre et al. 2022 did not change the effect estimate or its precision (ratio of means=1.03, P=0.53; 95%CI 0.93 to 1.15).

Fig. 3
figure 3

Meta-analysis of endurance exercise performance in hot conditions (the primary outcome). A random effects model was used to determine the effect of post-exercise heat exposure (heat acclimation) on endurance exercise performance (time trial, time-to-exhaustion, or a race) in hot conditions. The ln(Ratio of means) values in the control and treatment groups represent intervention-induced changes for performance outcomes calculated as the natural log-transformed ratio of means (post divided by pre). The between-group ln(Ratio of means) values represent the relative difference in the change scores between the treatment and control groups, calculated as the natural log-transformed ratio of means, i.e. loge( (PostTreatment ÷ PreTreatment) ÷ (PostControl ÷ PreControl)). Meta-analytical calculations were performed on the ln(Ratio of means) and its corresponding standard error (SE). The effect estimates are presented as the reverse-transformed ratio of means values and the forest plot bubbles represent the ratio of means for each study. The size of the bubbles represents the weights of each study in the inverse variance model. Error bars represent the 95% confidence interval (CI). The 95% prediction interval is shown in orange. The overall risk of bias for this outcome in each study is rated as high ( ×), low (), or some concerns (!). N = sample size; SE = standard error; Z = z-test statistic and the corresponding p-value; Q = Cochrane’s Q; I2 = percentage variability in effect sizes across studies; Tau2 and Tau are measures of the dispersion of true effect sizes between studies

Effect of post-exercise heat exposure on performance in thermoneutral conditions

A random effects model showed a trivial effect with satisfactory precision for improving endurance performance in thermoneutral conditions (Fig. 4: ratio of means=1.06, 95%CI 0.99 to 1.12, P=0.08; k=6, n=144). However, there was substantial heterogeneity of results across studies (Q=13.75, P=0.02; I2=64%), and the predictive certainty about future effects was low (PI 0.85 to 1.33). Furthermore, the performance outcomes are from highly diverse performance test types (Table 2). Egger’s regression did not indicate a high risk of publication bias (t=1.30, P=0.26; Fig. 13B) but the small number of studies does not rule it out and decreases the certainty in the validity of the funnel plot. A sensitivity analysis removing the high risk of bias study by Kirby et al. had little influence on the effect estimate (ratio of means=1.05, P=0.18; 95%CI 0.98 to 1.13) and removing the high risk of bias study by Scoon et al. further reduced the effect estimate (ratio of means=1.02, P=0.17; 95%CI 0.99 to 1.06). The complex study design and analysis in Vaile et al. (see Table 2) forced the selection of a single performance outcome (total work done in 9 minutes) among several reported; removing this study slightly increased the effect estimate but worsened its precision (ratio of means=1.07, P=0.09; 95%CI 0.99 to 1.15). Removing all three studies further reduced the trivial effect size (ratio of means=1.01, P=0.50; 95%CI 0.98 to 1.05). Because substantial statistical heterogeneity was identified (I2>50%), a fixed-effects analysis was also completed but a trivial effect size persisted (ratio of means=1.04, 95%CI 1.00 to 1.07).

Fig. 4
figure 4

Meta-analysis of endurance exercise performance in thermoneutral conditions. A random effects model was used to determine the effect of post-exercise heat exposure (heat acclimation) on endurance exercise performance (time trial, time-to-exhaustion, or a race) in thermoneutral conditions. Please refer to the legend in Fig. 3 for details about the analysis and definitions of acronyms, etc.

Effect of post-exercise heat exposure on V̇O2max

A random effects model showed a small effect for improving (increasing) V̇O2max (Fig. 5: Hedges’ g=0.33, 95%CI 0.19 to 0.47, P<0.0001, PI 0.24 to 0.42; k=3, n=91) with negligible heterogeneity across studies (Q=0.05, P=0.98; I2=0.0%). Egger’s regression did not indicate a high risk of publication bias (t=-0.26, P=0.84; Fig. 13C) but the very small number of studies makes publication bias highly likely and negates the validity of the funnel plot. A sensitivity analysis to remove the high risk of bias study by Kirby et al had little impact on the effect size estimate but reduced its precision (g=0.36, P<0.0001; 95%CI -0.16 to 0.88).

Fig. 5
figure 5

Meta-analysis of V̇O2max. A random effects model was used to determine the effect of post-exercise heat exposure (heat acclimation) on V̇O2max in millilitres per kilogram body weight per minute (mL/kg/min). The meanΔ values in the control and treatment groups represent the post-intervention minus pre-intervention change scores. The mean difference values represent the difference in the change scores between the treatment and control groups, i.e. (PostTreatment—PreTreatment)—(PostControl—PreControl). The overall risk of bias for this outcome in each study is rated as high ( ×), low (), or some concerns (!). The forest plot bubbles represent the standardised mean difference (SMD; Hedges’ g) for each study. The size of the bubbles represents the weights of each study in the inverse variance model. Error bars represent the 95% confidence interval (CI). The 95% prediction interval is shown in orange. N = sample size; SE = standard error; Z = z-test statistic and the corresponding p-value; Q = Cochrane’s Q; I2 = percentage variability in effect sizes across studies; Tau2 and Tau are measures of the dispersion of true effect sizes between studies

Effect of post-exercise heat exposure on running economy

Two papers [47, 48] reported that the between-trial (heat acclimation vs. control) difference in the intervention-induced change in running economy was not statistically significant. A meta-analysis was not possible because these 2 papers are different analyses from the same experimental study and duplicate several participants’ data.

Effect of post-exercise heat exposure on speed at lactate threshold

A random effects model showed a trivial effect for improving (increasing) speed at lactate threshold (Fig. 6: g=0.19, 95%CI -0.53 to 0.92, P=0.0007, PI 0.05 to 0.33; k=2, n=55) with negligible heterogeneity across studies (Q=0.04, P=0.83; I2=0%). There was an insufficient number of studies to objectively test for publication bias (Fig. 13D); the very small number of studies negates the validity of the funnel plot and makes publication bias highly likely. Sensitivity analyses were not possible due to an insufficient number of studies.

Fig. 6
figure 6

Meta-analysis of speed at lactate threshold. A random effects model was used to determine the effect of post-exercise heat exposure (heat acclimation) on speed at lactate threshold in kilometres per hour (kph). Please refer to the legend in Fig. 5 for details about the analysis and definitions of acronyms, etc.

Effect of post-exercise heat exposure on heart rate during submaximal exercise

A random effects model showed a small effect with high precision for improving (reducing) heart rate (Fig. 7: g=-0.32, 95%CI -0.45 to -0.20, P<0.0001; k=6, n=163), with high predictive certainty (PI -0.64 to -0.01) and negligible heterogeneity across studies (Q=0.83, P=1.00; I2=0%). Egger's regression did not indicate a high risk of publication bias (t=0.64, P=0.55; Fig. 13E) but the small number of included studies does not rule it out and decreases the certainty in the validity of the funnel plot. Sensitivity analyses removing individual high risk of bias studies had little impact on the effect size estimate or its precision and predictive certainty (excluding Kirby et al: g=-0.32, P<0.0001; 95%CI -0.47 to -0.16; excluding Zurawlew et al: g=-0.29, P<0.0001; 95%CI -0.44 to -0.13; excluding Lundby et al: g=-0.32, P<0.0001; 95%CI -0.50 to -0.14). However, removing all high risk of bias studies reduced the effect size and worsened its precision (g=-0.20, P=0.13; 95%CI -0.75 to 0.36).

Fig. 7
figure 7

Meta-analysis of heart rate during submaximal exercise. A random effects model was used to determine the effect of post-exercise heat exposure (heat acclimation) on heart rate in beats per minute (bpm) during submaximal exercise. Please refer to the legend in Fig. 5 for details about the analysis and definitions of acronyms, etc.

Effect of post-exercise heat exposure on RPE during submaximal exercise

A random effects model showed a trivial effect with poor precision for worsening (increasing) RPE (Fig. 8: g=0.11, 95%CI -0.38 to 0.61, P=0.59; k=6, n=163), with moderate heterogeneity across studies (Q=12.8, P=0.08; I2=46%) and low predictive certainty (PI -0.95 to 1.18) concerning where future effects will be found. Egger’s regression did not indicate a high risk of publication bias (t=0.14, P=0.89; Fig. 13F) but the small number of included studies does not rule it out and decreases the certainty in the validity of the funnel plot. Sensitivity analyses removing individual or all high risk of bias studies had little impact on the original effect estimate or its precision (excluding Kirby et al: g=0.22, P=0.35; 95%CI -0.34 to 0.78; excluding McIntyre et al. 2022: g=0.06, P=0.80; 95%CI -0.50 to 0.62; excluding both: g=0.15, P=0.55; 95%CI -0.51 to 0.82).

Fig. 8
figure 8

Meta-analysis of RPE during submaximal exercise. A random effects model was used to determine the effect of post-exercise heat exposure (heat acclimation) on ratings of perceived exertion (RPE; arbitrary units, a.u.) during submaximal exercise. RPE data (Borg 6–20 and 0–10) were scaled to 0–100 before meta-analysis. Therefore, the post-intervention minus pre-intervention change scores (meanΔ) and the mean difference values are derived from pre- and post-intervention values that were scaled to 0–100. Please refer to the legend in Fig. 5 for details about the analysis and definitions of acronyms, etc.

Effect of post-exercise heat exposure on core temperature during submaximal exercise

A random effects model showed a small effect with high precision for improving (preventing the rise in) core temperature (Fig. 9: g=-0.44, 95%CI -0.79 to -0.09, P=0.003; k=6, n=163). There was negligible heterogeneity in results across studies (Q=6.41, P=0.49; I2=0%), but low predictive certainty (PI -1.32 to 0.44). Egger’s regression did not indicate a high risk of publication bias (t=0.81, P=0.45; Fig. 13G) but the small number of included studies does not rule it out and decreases the certainty in the validity of the funnel plot. Sensitivity analyses removing individual high risk of bias studies had little impact on the original effect estimate or its precision (excluding Lundby et al: g=-0.53, P=0.008; 95%CI -1.04 to -0.01; excluding Kirby et al: g=-0.35, P=0.04; 95%CI -0.76 to 0.06; excluding McIntyre et al. 2022: g=-0.41, P=0.01; 95%CI -0.82 to -0.01). The small effect size persisted when all high risk of bias studies were removed, but its precision was worsened (g=-0.36, P=0.26; 95%CI -1.40 to 0.67). Please note that core temperature data in Vaile et al. was not included in the meta-analysis because exact baseline data could not be retrieved for the control and treatment groups. Although their paper states that “average pre-exercise rectal temperature regardless of intervention group or day of exercise was 37.3 ± 0.28”, the authors could not be reached to resolve this matter.

Fig. 9
figure 9

Meta-analysis of core temperature during submaximal exercise. A random effects model was used to determine the effect of post-exercise heat exposure (heat acclimation) on core temperature in degrees Celsius (°C) during submaximal exercise. Please refer to the legend in Fig. 5 for details about the analysis and definitions of acronyms, etc.

Effect of post-exercise heat exposure on sweat rate during submaximal exercise

A random effects model showed a small effect with high precision for improving (increasing) sweat rate (Fig. 10: g=0.27, 95%CI 0.06 to 0.47, P=0.002; k=6, n=139). There was negligible heterogeneity across studies (Q=1.55, P=0.96; I2=0%), but low predictive certainty in future effects (PI -0.21 to 0.74). Egger’s regression did not indicate a high risk of publication bias (t=0.43, P=0.69; Fig. 13H) but the small number of included studies does not rule it out and decreases the certainty in the validity of the funnel plot. Sensitivity analyses removing individual high risk of bias studies had little impact on the original effect estimate or its precision (excluding Lundby et al: g=0.28, P=0.006; 95%CI 0.02 to 0.54; excluding Kirby et al: g=0.25, P=0.02; 95%CI -0.03 to 0.52; excluding McIntyre et al. 2022: g=0.28, P=0.004; 95%CI 0.03 to 0.53; excluding Zurawlew et al: g=0.35, P<0.0001; 95%CI 0.12 to 0.59). Removing all high risk of bias studies left only 2 studies in the analysis: the strength of the effect estimate increased but its precision worsened (g=0.62, P<0.0001; 95%CI -0.23 to 1.47).

Fig. 10
figure 10

Meta-analysis of sweat rate during submaximal exercise. A random effects model was used to determine the effect of post-exercise heat exposure (heat acclimation) on sweat rate in litres per hour (L/h) during submaximal exercise. Please refer to the legend in Fig. 5 for details about the analysis and definitions of acronyms, etc.

Effect of post-exercise heat exposure on thermal sensations during submaximal exercise

A random effects model showed a moderate-sized effect with high precision for improving (reducing) thermal sensations (Fig. 11: g=-0.61, 95%CI -0.98 to -0.24, P<0.0001; k=5, n=112). Heterogeneity across studies was negligible (Q=2.92, P=0.71; I2 = 0%), but there is low predictive certainty about where future effects will lie (PI -1.29 to 0.07). Egger’s regression did not indicate a high risk of publication bias (t=1.58, P=0.19; Fig. 13I) but the small number of included studies does not rule it out and decreases the certainty in the validity of the funnel plot. Sensitivity analyses removing individual high risk of bias studies had little impact on the original effect estimate or its precision (excluding Kirby et al: g=-0.48, P=0.003; 95%CI -0.93 to -0.03; excluding McIntyre et al. 2022: g=-0.56, P=0.0004; 95%CI -1.00 to -0.12; excluding Zurawlew et al: g=-0.61, P=0.005; 95%CI -1.30 to 0.08). Removing all high risk of bias studies left just 2 studies in the analysis; this reduced the strength of the estimate to a trivial effect with poor precision (g=-0.12, P=0.30; 95%CI -1.60 to 1.36).

Fig. 11
figure 11

Meta-analysis of thermal sensation during submaximal exercise. A random effects model was used to determine the effect of post-exercise heat exposure (heat acclimation) on thermal sensation (arbitrary units, a.u.) during submaximal exercise. Thermal sensation data were scaled to 0–100 before meta-analysis. Therefore, the post-intervention minus pre-intervention change scores (meanΔ) and the mean difference values are derived from pre- and post-intervention values that were scaled to 0–100. Please refer to the legend in Fig. 5 for details about the analysis and definitions of acronyms, etc.

Effect of post-exercise heat exposure on thermal comfort during submaximal exercise

A random effects model showed a moderate effect with poor precision for worsening (decreasing) thermal comfort (Fig. 12: g=-0.56, 95%CI -9.78 to 8.65, P=0.44; k=2, n=55). However, there was considerable heterogeneity across studies (Q=5.45, P=0.02; I2 = 81.7%) and very low predictive certainty about where future effects will lie (PI -15.5 to 14.4). There were insufficient studies to objectively test for publication bias (Fig. 13J); however, the validity of the funnel plot is poor due to the very small number of studies, making publication bias highly likely. Sensitivity analyses were not possible due to an insufficient number of studies.

Fig. 12
figure 12

Meta-analysis of thermal comfort during submaximal exercise. A random effects model was used to determine the effect of post-exercise heat exposure (heat acclimation) on thermal comfort (arbitrary units, a.u.) during submaximal exercise. Thermal comfort data were scaled to 0–100 before meta-analysis. Therefore, the post-intervention minus pre-intervention change scores (meanΔ) and the mean difference values are derived from pre- and post-intervention values that were scaled to 0–100. Please refer to the legend in Fig. 5 for details about the analysis and definitions of acronyms, etc.

Fig. 13
figure 13

Funnel plots to represent potential publication bias. Contour-enhanced funnel plots between the natural log of the ratio of means (the effect size estimate) and the standard error of the ratio of means were constructed for: A performance in hot conditions, and B performance in thermoneutral conditions. Contour-enhanced funnel plots between the standardised mean difference (Hedges’ g; the effect size estimate) and the standard error of the standardised mean difference were constructed for: C V̇O2max and D speed at lactate threshold, as well as E heart rate, F RPE, G core temperature, H sweat rate, I thermal sensations, and J thermal comfort during submaximal exercise. Grey-filled circles represent the individual studies. The vertical light grey line represents the combined effect estimate: ln(ratio of mean) in panels A and B, and Hedges’ g in panels C to J. The t-statistics and P-values from Egger’s regression are shown: the regions inside the solid orange funnels represent P > 0.10; the regions between the solid orange lines and the dashed orange lines represent 0.10 > P > 0.05; the region between the dashed orange lines and the dotted orange lines represent 0.05 > P > 0.01; and the regions outside the dotted orange funnels represent P < 0.01. IMPORTANT: Because the included studies have a small sample size and are few in number, there is low certainty in the validity of the funnel plots, and they should be interpreted with caution

Sub-group analysis

Due to insufficient data, it was not possible to complete the planned sub-group analyses for training status (trained athletes vs. non-athletes/untrained people) or temperature dependency (performance in hot vs. thermoneutral conditions). No other sub-group analyses were planned (e.g. sex [male vs. female] or heat exposure type [sauna vs. hot water immersion], etc.) and, given the small number of studies, these comparisons are not currently appropriate to add.

Statistical power in the meta-analysis

The statistical power was very low for each variable (Figs. 3, 4, 5, 6, 7, 8, 9, 10 and 11). For example, there was only 5% power to detect a meaningful effect of post-exercise heat exposure on the primary outcome (Fig. 3) and between 5 and 15% power to detect meaningful effects across the secondary outcomes (Figs. 4, 5, 6, 7, 8, 9, 10 and 11). This suggests that the included body of studies cannot reliably detect true effect sizes of interest and, therefore, may not be very informative.

Quality of evidence (GRADE)

The quality of evidence for the primary outcome — exercise performance in hot conditions — was graded as very low. For the secondary outcomes, the quality of evidence for exercise performance in thermoneutral conditions, V̇O2max, and speed at lactate threshold were also graded as very low, and the physiological measurements during submaximal exercise were graded as either low (core temperature and thermal sensation) or very low (heart rate, RPE, sweat rate, and thermal comfort). The GRADE summary of findings and reasons for downgrading the certainty in the effect estimate are presented in Table 3.

Table 3 GRADE summary of findings

Data availability

Full outcome variable data extracted from the individual studies (including data extracted using WebPlotDigitizer) are available from the Open Science Foundation data registry [50].

Discussion

Summary of findings

This systematic review included 10 studies (199 participants; 156 males and 43 females, aged 20 ± 2 to 32 ± 4 years) investigating the effects of heat acclimation via post-exercise heat exposure. For the primary outcome — exercise performance in hot conditions (air temperature 33–40 °C; RH 40–61%) — there was a trivial effect favouring treatment vs. control, with poor precision, low statistical power, and low predictive certainty (k = 4, n = 60: ratio of means = 1.04, P = 0.46; 95%CI 0.94 to 1.15; PI 0.81 to 1.33; power = 0.05). Similarly, there were trivial effects favouring treatment vs. control, with poor precision, low power, and low predictive certainty on exercise performance in thermoneutral conditions (air temperature 18–24 °C; RH 42%; k = 6, n = 144: ratio of means = 1.06, P = 0.08; 95%CI 0.99 to 1.12; PI 0.85 to 1.33; power = 0.06) and speed at lactate threshold (k = 2, n = 55: g = 0.19, P = 0.0007; 95%CI -0.53 to 0.92; PI -0.53 to 0.92; power = 0.08). For the remaining secondary outcomes, post-exercise heat exposure had small to moderate beneficial effects with better precision, but statistical power and predictive certainty remained low: V̇O2max (k = 3, n = 91: g = 0.33, P < 0.0001; 95%CI 0.19, to 0.47; PI 0.19, to 0.47; power = 0.15), heart rate (k = 6, n = 163: g = -0.32, P < 0.0001; 95%CI -0.45 to -0.20; PI -0.45 to -0.20; power = 0.10), core temperature (k = 6, n = 163: g = -0.44, P = 0.003; 95%CI -0.79 to -0.09; PI -0.79 to -0.09; power = 0.15), sweat rate (k = 6, n = 139: g = 0.27, P = 0.002; 95%CI 0.06 to 0.47; PI 0.06 to 0.47; power = 0.09), and thermal sensations (k = 5, n = 112: g = -0.61, P < 0.0001; 95%CI -0.98 to -0.24; PI -1.29 to 0.07; power = 0.07). Furthermore, there were effects favouring control with poor precision, power, and predictive certainty for RPE (k = 6, n = 163: g = 0.11, P = 0.59; 95%CI -0.38 to 0.61; PI -0.95 to 1.18; power = 0.07) and thermal comfort (k = 2, n = 55: g = -0.56, P = 0.44; 95%CI -9.78 to 8.65; PI -15.5 to 14.4; power = 0.08). Except for performance in thermoneutral conditions and thermal comfort, which had substantial to considerable heterogeneity (I2 = 64% and 82%, respectively), effects were generally consistent across studies. However, prediction intervals revealed a wide range of possible effects in which future studies would fall.

Quality of the evidence

While there were generally no funding issues, 4/10 studies [39,40,41,42] didn’t describe the funding sources and 4/10 [39, 40, 42, 44] didn’t report whether conflicts of interest existed, or not. Furthermore, all 10 included studies were rated with a high risk of bias or some concerns for all outcome variables. Bias arose from the allocation and randomisation process, the (participant and investigator/data analyst) blinding process, deviations from the intended interventions, missing outcome data, the measurement of the outcomes, and selective reporting of the outcomes. Across all outcomes, there was also generally poor precision in the effect estimates, low statistical power, and low predictive certainty. Subsequently, the GRADE certainty in the effect estimates, which assesses quality across five domains (risk of bias, inconsistency, indirectness, imprecision and publication bias) and reflects the extent to which there is confidence in the effect estimate, was rated as very low for the primary outcome — exercise performance in hot conditions — and as low to very low for the secondary outcomes (Table 3). This means that the true effects may be substantially different from those measured in this analysis.

Sensitivity analyses to remove the high risk of bias studies revealed further information about the quality of the evidence. While sensitivity analyses didn’t affect the effect estimates or their precision for either performance in hot conditions or RPE, sensitivity analyses had varying impacts on other secondary outcomes. Removing the high risk of bias studies further reduced the effect on performance in thermoneutral conditions; worsened the precision of the effect estimate for V̇O2max, sweat rate, and core temperature; and reduced the effect size and its precision, and removed statistical significance for heart rate and thermal sensations. Therefore, effect estimates were sensitive to the high risk of bias studies across most outcomes.

Overall, the low quality of evidence prompts little confidence in the effect estimates. Consequently, firm conclusions cannot currently be made concerning the effect of post-exercise heat exposure interventions to improve endurance exercise performance. To remedy this, further high-quality randomised controlled trials are needed.

Generalisability of the findings

The findings might be generalisable to competitive endurance performance because 7/10 studies [39, 40, 42,43,44, 47, 48] included trained athletes — runners, triathletes, race walkers, cyclists, and soccer players. These 7 studies could also be considered to have a high level of ecological validity because the participants were endurance-trained athletes [39, 40, 42,43,44, 47, 48], athletes continued their normal training habits [39, 42,43,44, 47, 48], athletes trained in their typical training environment [39, 43, 44, 47, 48], the performance tests were designed to simulate the demands of a race [40], or performance was assessed using a race against other athletes [42]. One example from Stevens et al. [42] examined world-class race walkers at a training camp and used a 10 km race against other athletes as the performance outcome — this is as ecologically valid as it gets. Meanwhile, the remaining 3/10 studies [41, 45, 46] have a low-to-moderate level of ecological validity because the participants are recreationally active and not endurance trained, and because the daily exercise sessions were identical (40–60 min/day at the same fixed intensity), which does not represent endurance athletes’ habits.

The overall generalisability of the findings is doubtful because there were only 10 studies with small sample sizes (N = 6 to 41) and the studies were only conducted in Europe (Norway, Spain, UK), Australia, and New Zealand. There was also a predominance of young male participants (156 males and 43 females; age 20 ± 2 to 32 ± 4 years) — only 3/10 studies [42, 47, 48] included female participants — and no study described the race or ethnicity of the participants. Furthermore, only 5/10 studies [39, 42, 44, 47, 48] completed the exercise intervention outdoors in the participants’ typical training environment, with the remaining 5 studies [40, 41, 43, 45, 46] completing the training interventions indoors in a laboratory setting. Additionally, the interventions and performance outcomes were highly heterogeneous across studies, making it difficult to accurately summarise the generalisability.

Strengths and weaknesses in the review process

Limitations in the body of evidence

During study selection, several studies were excluded because they lacked a non-heat-exposure control group (e.g. [51,52,53,54,55]) or did not measure performance via time-to-completion (time trials or races) or time-to-exhaustion tests (e.g. [51, 54,55,56,57,58,59]). Of the included studies, the body of evidence is very small: there are few studies (k = 10) with small sample sizes (N = 6 to N = 41 across studies, with 199 participants in total). Larger sample sizes would enable more reliable detection of a wider range of effect sizes. Because of the small sample size, the statistical power was low for every outcome across all studies, indicating that this meta-analysis contains a body of studies that cannot reliably detect effect sizes of interest and, therefore, may not be very informative. All studies were also rated as having a high risk of bias or some concerns across all outcomes; no study was rated with an overall low risk of bias for any outcome. In general, most studies do not describe the randomisation, allocation concealment, or blinding approaches, have poor descriptions of the methods used to measure outcomes, and do not adequately describe the selection of the outcome (e.g. it is often unclear whether a physiological measure during exercise is a mean or peak value or whether the value represents the entire exercise duration, a portion of it, or the endpoint). Specifically, while it is impossible to blind participants to interventions like exercise or heat exposure, it is possible to blind investigators and data analysts to group allocation; no study provided this info.

There is also a sex imbalance in this body of evidence: only 3/10 studies [42, 47, 48] included female participants and only 43/199 participants in the included studies were female. While a female-specific systematic review published in 2023 [6] concluded that heat acclimation can improve performance in females, it pooled all types of heat acclimation approaches: exercise-in-the-heat and post-exercise heat exposure interventions.

Limitations in this systematic review and meta-analysis

The small number of included studies and the small sample size created several limitations. For example, it was not possible to conduct planned sub-group analyses to examine whether training status (trained athletes vs. non-athletes/untrained people) or temperature dependency (performance in hot vs. thermoneutral conditions) were influential. It was also not feasible to complete additional unplanned sub-group analyses (sex, male vs. female; heat exposure type, sauna vs. hot water immersion; intervention duration; etc.). This is unfortunate given the influence of training status/fitness, sex, and intervention duration seen with exercise-in-the-heat interventions [2, 3, 6].

The small number of studies and low sample size also create substantial uncertainty in the value of I2 and the validity of the funnel plots and their associated publication bias metrics, which should be interpreted with caution. This same reason combined with the large heterogeneity of study designs also creates uncertainty in the accuracy of the prediction intervals. There was also an issue with missing data: for example, some of the included papers lacked sufficient data to perform analyses and some data requests were problematic (some authors were unreachable while others refused to share data). This is unfortunate given the current need for open science and transparency. Missing data may influence the conclusions of the meta-analysis; however, this is unlikely given the low number of studies and small sample sizes.

A further limitation arises due to meta-bias. Firstly, bias was introduced by pooling time-to-completion and time-to-exhaustion performance tests using different units of measurement (watts, kcals, seconds, metres, etc.), which is important because time-to-exhaustion can have poorer reliability than time-to-completion tests [60]. A second limitation of the meta-analysis is the analysis of study-level rather than individual subject-level data. Thirdly, there was potential for meta-bias caused by participant data duplication in the studies by Kirby et al. [47, 48]; however, their data is open access [61] and the authors kindly agreed to share their raw data to help avoid this issue. Fourthly, due to unobtainable data, the within-group changes for 3/10 studies [40, 43, 44] were estimated as the change in the post minus pre mean value rather than the mean of the individual participant post minus pre changes. Fifthly, due to unobtainable data, between-group differences and baseline SD values were extracted from figures in 2/10 studies [43, 44] using WebPlotDigitizer [17]. Such data estimates reduce the accuracy of the meta-analytical calculations. And, lastly, in the study by Vaile et al. [40], participants completed identical training sessions on 5 consecutive days consisting of 105 min of cycling with 66 maximal sprints and 9 min of sustained time trial effort, with outcomes measured on each of the 5 days. The design and data analysis are complex with multiple time-points measured within multiple days and multiple “performance” outcomes reported; to answer the specific question being asked in the current review, the meta-analysis compared total work done during 9 min of time trial on the last (day 5) with measurements made on the first day (day 1). Selecting this specific outcome and these specific time points amongst all combinations introduces meta-bias; a sensitivity analysis showed that this study had a substantial impact on the precision of the effect estimate.

Solutions to the limitations

Following this systematic review, several concepts remain unclear. For example, the current body of evidence cannot conclude whether there is an optimal temperature, duration, or modality (e.g. sauna vs. hot water immersion) of post-exercise heat exposure or an optimal time course of delay between exercise and heat exposure. Furthermore, while most [62,63,64,65,66] but not all [67,68,69] studies show that hydration status doesn’t influence adaptations to active heat acclimation (exercise-in-the-heat), it remains unclear whether hydration status influences the effect of post-exercise heat exposure. Accordingly, none of the studies included in this review measured hydration status and only 5/10 studies described fluid intake: three [39, 47, 48] reported that ab libitum water intake was permitted during post-exercise heat exposure, one [44] described that fluid intake was limited to 250 ml, and one [41] described that there was no fluid intake. Additionally, it is possible that natural heat acclimatisation — caused when athletes’ habitual training is conducted outdoors in hot conditions — may mask the effect of post-exercise heat exposure interventions. For example, in the study by Stevens et al. [42], participants were likely already heat acclimatized because they trained in the summer heat for 4 weeks before the study and for 15 days during the study. This may explain the trivial effect of post-exercise heat exposure in that study (SMD = 0.15, 95%CI -0.97 to 1.31). The studies by Kirby et al. [47, 48] attempted to minimise this issue by conducting interventions in the UK between October and March. This type of limitation should be considered in future studies.

To resolve these limitations, future studies should: (i) Pre-register their protocols with methods that clearly describe how outcomes will be compared. (ii) Use a randomised controlled design, ideally with crossover, with large sample sizes that are sufficiently powerful to detect meaningful differences. (iii) Fully describe randomisation, allocation concealment, and (participant and investigator) blinding procedures. (iv) Include female participants. (v) Include endurance-trained athletes, especially elite athletes, if possible. (vi) Measure hydration status and describe fluid intake. (vii) Determine the dose response of post-exercise heat exposure. (viii) Determine the optimal modality of post-exercise heat exposure. And (ix) Publish raw data in line with open science practices, which would improve efficiency in the synthesis of future systematic reviews and enable subject-level meta-analyses.

Strengths of this systematic review

To minimise reporting bias and increase research transparency, the review protocol was registered on OSF before the literature search commenced and, in line with open science practices, all outcome data is freely available from the OSF registry [15]. To broaden the coverage of the literature search, an independent scientist followed PRESS guidelines [14] to peer-review the search strategy before the literature search commenced. To obtain relevant information with high ecological validity, performance outcomes from time-to-completion tests (time trials or races) and time-to-exhaustion tests were chosen as the primary outcome. To account for variation in these types of performance tests and their units of measurement, we used a ratio of means method with natural log transformation in the meta-analysis. The Cochrane Handbook [12] was used as a framework to use standardized and repeatable approaches to synthesise the data, assess the risk of bias [18], and GRADE the certainty in the effect estimates [37]. The authors (TS and ML) independently completed several aspects of the review (literature searching, screening, risk of bias analyses, data extraction) and met to discuss and reach agreements after each step of the process. To minimise the risk of garbage-in-garbage-out, an objective risk of bias analysis [18] was used to assess study quality, and sensitivity analyses were completed to determine the impact of high risk of bias studies on the effect estimates. Furthermore, in addition to traditional estimates of precision (SE and 95%CI), prediction intervals were also calculated to determine the expected range of effects of future studies [27,28,29].

Comparison to existing systematic reviews

Several systematic reviews have examined the effect of heat acclimation on exercise performance and physiological measures [1,2,3,4, 6, 7, 11]. However, such reviews have either studied the effect of “active” heat acclimation (exercise-in-the-heat) or pooled results from both “active” and “passive” heat acclimation (daily post-exercise heat exposure) protocols. Consequently, the current review is the first to examine “passive” heat acclimation in isolation. Nonetheless, it’s important to compare the current findings with those from previous reviews:

Chalmers et al. (2014) did not perform a meta-analysis but concluded that heat acclimation generally improves aerobic, not anaerobic, performance [1]. However, the authors warned about the accuracy of their conclusions due to a moderate level of bias in the induced studies. Tyler et al. (2016) compared the effect of short, medium, and long-term heat acclimation protocols, finding a moderate to large beneficial effect of all protocols on exercise performance (ES = 0.52, 0.75, and 0.93), along with beneficial effects on heart rate (ES = -0.87), RPE (ES = -0.63), core temperature (ES = -0.51), sweat rate (ES = 0.61), and thermal sensation (ES = -0.68) [2]. However, there was a high risk of bias across several domains and the certainty in the effect estimates was not assessed. Benjamin et al. (2019) also found a positive effect of heat acclimation on performance (time-to-exhaustion: effect size, ES = 0.86); time-to-completion time trials; ES = 0.49) and a small effect on V̇O2max (ES = 0.30); however, neither the risk of bias nor the certainty in the effect estimates was assessed [3]. Rahimi et al. (2019) found a moderate beneficial effect of heat acclimation on time trial performance (ES = 0.50), a large beneficial effect on heart rate (ES = 1.0), but no benefit for V̇O2max, RPE, core temperature, or thermal comfort [4]. However, the study quality assessment found a moderate level of bias in the included studies and the certainty in the effect estimates was not assessed. Waldron et al. (2021) found small to moderate benefits on V̇O2max in thermoneutral (ES = 0.42) and hot conditions (ES = 0.63), and although the authors concluded that the included studies had a generally low risk of bias [11], the risk of bias was high or unclear across several domains and the certainty in the effect estimates was not assessed. Kelly et al. (2023), which studied exclusively female participants, found beneficial effects of heat acclimation on performance (pooled time-to-completion and time-to-exhaustion tests: ES = 1.00), heart rate (ES = -0.60), sweat rate (ES = 0.53), and core temperature (ES = -0.81) [6]. Again, although the authors concluded that included studies mostly had a low risk of bias, their data show a high risk of bias or some concerns across several domains and the certainty in the effect estimates was not assessed. And, lastly, an updated 2024 meta-analysis from Tyler et al. confirmed their previous findings (moderate to large beneficial effects across outcomes); however, while the certainty of the evidence was not assessed, there was a high risk of bias across studies, considerable between-study heterogeneity, and wide prediction intervals [7].

In general, existing systematic reviews [2,3,4, 6, 7] find a beneficial effect of heat acclimation (active alone or pooled active and passive) on performance, but they also have a small sample size, a small number of included studies, and concern with the risk of bias. Plus, there is a lack of assessment for the certainty in the effect estimates. This systematic review concludes that there are trivial effects in favour of post-exercise heat exposure (passive heat acclimation) vs. control on exercise performance in hot conditions, performance in thermoneutral conditions, and speed at lactate threshold, small effects on V̇O2max, heart rate, core temperature, and sweat rate, and a moderate-sized effect on thermal sensations. This indicates that the cardiovascular, metabolic, and thermoregulatory adaptations that improve performance might be similar between active and passive heat acclimation strategies. However, there is a low to very low certainty in the effect estimates across all outcomes — this is the primary reason for the difference in the effect of heat acclimation on performance in hot conditions between this review and existing reviews. However, other reasons include: (i) the possibility that “active” heat acclimation is superior to “passive” heat acclimation, but this remains to be tested, and (ii) the dose of heat exposure in the current review is relatively short (mean of 287 min over 8.5 sessions) compared to most of the existing reviews (e.g., Chalmers et al. reported a mean of 419 min over 5.8 sessions [1]), while the reviews by Benjamin et al., Waldron et al., and Kelly et al. found that the number of heat exposure days or total heat dose significantly influenced the performance effects [3, 6, 11].

While this is the first systematic review to examine the specific effect of post-exercise heat exposure, the practicalities of such an approach have been articulated elsewhere. For example, Heathcote and colleagues recommended 6–7 heat sessions on consecutive days for at least 30 min as soon as possible after exercise to improve performance but, in agreement with the current review, cautioned the need for more studies to fully understand the effect [10]. Casadio and colleagues agree that heat acclimation using post-exercise heat exposure could improve performance, but pose additional considerations [9]: namely, they ponder whether heat stress alters total training stress and the quality of athletes’ subsequent sessions, and how much between-athlete variability in performance outcomes exists between active and passive heat acclimation approaches. Such questions must be answered by future studies.

Relevance of findings to coaching practice and athleteperformance

The current evidence suggests small to moderate-sized beneficial effects of post-exercise heat exposure on V̇O2max and some physiological measures (heart rate, core temperature, sweat rate, and thermal sensations). Such effects have the potential to extrapolate to performance benefits because, in elite sports, even small-to-moderate effect sizes can translate to meaningfully large, perhaps unrealistic, improvements in endurance performance (e.g., 2–3 min in a marathon). Nonetheless, the current evidence shows only a trivial effect on performance. Despite the percentage equivalents of these effect estimates potentially equating to meaningful performance gains — 4% (95%CI: -6% to 15%) in hot conditions and 6% (95%CI: -1% to 12%) in thermoneutral conditions, on average — the large uncertainty in these estimates combined with the low to very low quality of evidence prevents firm conclusions about the efficacy of this type of heat acclimation until further high-quality trials are published. That said, because post-exercise heat exposure doesn’t appear to harm endurance performance, coaches and athletes could consider its use. However, it is important to consider whether the additional 30 to 40 min a day required for this strategy could be better used in other areas of training and recovery — training load optimization, sleep, nutrition, rest, etc. Furthermore, coaches and athletes should consider the practicality of different heat acclimation approaches. For example, if an athlete travels to a race or lives in an Olympic village before a race, post-exercise heat exposure in a hot bath or sauna (passive acclimation) allows an athlete to train and taper as usual without having to find exercise equipment located in a heat chamber (active acclimation).

Conclusions

The current evidence shows that heat acclimation using post-exercise heat exposure might improve physiological responses during submaximal exercise (increased sweat rate and decreased heart rate, core temperature, and thermal sensations). However, given the predominance of low to very low quality evidence, the effect of this method of heat acclimation on endurance exercise performance is uncertain. Further high-quality trials are needed to bolster the evidence and to enable conclusions concerning the efficacy of post-exercise heat exposure for improving endurance exercise performance.

Data availability

The datasets supporting the conclusions of this article are available in the Open Science Foundation repository (https://doiorg.publicaciones.saludcastillayleon.es/https://doiorg.publicaciones.saludcastillayleon.es/10.17605/OSF.IO/6FGC2) [1].

Abbreviations

RPE:

Rating of perceived exertion

PICO:

Population, Interventions, Comparisons, and Outcomes

OSF:

Open Science Foundation

PRISMA:

Preferred Reporting Items for Systematic Reviews and Meta-Analyses

BMI:

Body mass index

SD:

Standard deviation

SE :

Standard error

RoB2:

Cochrane Risk of Bias 2

ROM:

Ratio of mean

SMD:

Standardised mean difference

GRADE:

Grading of Recommendations Assessment, Development and Evaluation

OIS:

Optimal information size

RH:

Relative humidity

CI:

Confidence interval

PI:

Prediction interval

References

  1. Chalmers S, Esterman A, Eston R, Bowering KJ, Norton K. Short-term heat acclimation training improves physical performance: a systematic review, and exploration of physiological adaptations and application for team sports. Sports Med. 2014;44:971–88.

    Article  PubMed  Google Scholar 

  2. Tyler CJ, Reeve T, Hodges GJ, Cheung SS. The effects of heat adaptation on physiology, perception and exercise performance in the heat: a meta-analysis. Sports Med. 2016;46:1699–724.

    Article  PubMed  Google Scholar 

  3. Benjamin CL, Sekiguchi Y, Fry LA, Casa DJ. Performance changes following heat acclimation and the factors that influence these changes: meta-analysis and meta-regression. Front Physiol. 2019;10:1448.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Rahimi GRM, Albanaqi AL, Van der Touw T, Smart NA. Physiological responses to heat acclimation: a systematic review and meta-analysis of randomized controlled trials. J Sports Sci Med. 2019;18:316–26.

    PubMed  PubMed Central  Google Scholar 

  5. Daanen HAM, Racinais S, Périard JD. Heat Acclimation decay and re-induction: a systematic review and meta-analysis. Sports Med. 2018;48:409–30.

    Article  PubMed  Google Scholar 

  6. Kelly MK, Bowe SJ, Jardine WT, Condo D, Guy JH, Snow RJ, et al. Heat adaptation for females: a systematic review and meta-analysis of physiological adaptations and exercise performance in the heat. Sports Med. 2023;53:1395–421.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Tyler CJ, Reeve T, Sieh N, Cheung SS. Effects of heat adaptation on physiology, perception, and exercise performance in the heat: an updated meta-analysis. J Sci Sport Exerc. 2024. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s42978-023-00263-8.

    Article  Google Scholar 

  8. Périard JD, Racinais S, Sawka MN. Adaptations and mechanisms of human heat acclimation: Applications for competitive athletes and sports. Scand J Med Sci Sports. 2015;25(Suppl 1):20–38.

    Article  PubMed  Google Scholar 

  9. Casadio JR, Kilding AE, Cotter JD, Laursen PB. From lab to real world: heat acclimation considerations for elite athletes. Sports Med. 2017;47:1467–76.

    Article  PubMed  Google Scholar 

  10. Heathcote SL, Hassmén P, Zhou S, Stevens CJ. Passive heating: reviewing practical heat acclimation strategies for endurance athletes. Front Physiol. 2018;9:1851.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Waldron M, Fowler R, Heffernan S, Tallent J, Kilduff L, Jeffries O. Effects of heat acclimation and acclimatisation on maximal aerobic capacity compared to exercise alone in both thermoneutral and hot environments: a meta-analysis and meta-regression. Sports Med. 2021;51:1509–25.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Higgins J, Thomas J, Chandler J, Cumpston M, Li T, Page M, et al. Cochrane Handbook for Systematic Reviews of Interventions version 6.4 (updated August 2023). Cochrane, 2023.

  13. Garber CE, Blissmer B, Deschenes MR, Franklin BA, Lamonte MJ, Lee I-M, et al. American College of Sports Medicine position stand Quantity and quality of exercise for developing and maintaining cardiorespiratory, musculoskeletal, and neuromotor fitness in apparently healthy adults: guidance for prescribing exercise. Med Sci Sports Exerc. 2011;43:1334–59.

    Article  PubMed  Google Scholar 

  14. McGowan J, Sampson M, Salzwedel DM, Cogo E, Foerster V, Lefebvre C. PRESS peer review of electronic search strategies: 2015 guideline statement. J Clin Epidemiol. 2016;75:40–6.

    Article  PubMed  Google Scholar 

  15. Solomon TP. Open Science Framework - Registered protocol: A systematic review of the effect of passive heat acclimation on exercise performance. 2022. https://doiorg.publicaciones.saludcastillayleon.es/10.17605/OSF.IO/256XZ.

  16. Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, The PRISMA, et al. statement: an updated guideline for reporting systematic reviews. BMJ. 2020;2021:n71.

    Google Scholar 

  17. Rohatgi A. Webplotdigitizer: Version 4.6. 2022.

  18. Sterne JAC, Savović J, Page MJ, Elbers RG, Blencowe NS, Boutron I, et al. RoB 2: a revised tool for assessing risk of bias in randomised trials. BMJ. 2019:l4898.

  19. McGuinness LA, Higgins JPT. Risk-of-bias VISualization (robvis): An R package and Shiny web app for visualizing risk-of-bias assessments. Res Synth Methods. 2021;12:55–61.

    Article  PubMed  Google Scholar 

  20. Friedrich JO, Adhikari NK, Beyene J. The ratio of means method as an alternative to mean differences for analyzing continuous outcome variables in meta-analysis: A simulation study. BMC Med Res Methodol. 2008;8:32.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Friedrich JO, Adhikari NKJ, Beyene J. Ratio of means for analyzing continuous outcomes in meta-analysis performed as well as mean difference methods. J Clin Epidemiol. 2011;64:556–64.

    Article  PubMed  Google Scholar 

  22. Thorlund K, Walter SD, Johnston BC, Furukawa TA, Guyatt GH. Pooling health-related quality of life outcomes in meta-analysis—a tutorial and review of methods for enhancing interpretability. Res Synth Methods. 2011;2:188–203.

  23. Review Manager (RevMan) version: 7.5.0. The Cochrane Collaboration. 2024. Available at revman.cochrane.org.

  24. Morris SB. Estimating Effect Sizes From Pretest-Posttest-Control Group Designs. Organ Res Methods. 2008;11:364–86.

    Article  Google Scholar 

  25. Suurmond R, Van Rhee H, Hak T. Introduction, comparison, and validation of Meta-Essentials: A free and simple tool for meta-analysis. Res Synth Methods. 2017;8:537–53.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Deeks J, Higgins J, Altman D. Chapter 10.10 Heterogeneity. In: Higgins JPT, Thomas J, Chandler J, Cumpston M, Li T, Page MJ, Welch VA (editors). Cochrane Handbook for Systematic Reviews of Interventions version 6.4 (updated August 2023). Cochrane, 2023.

  27. Higgins JPT, Thompson SG, Spiegelhalter DJ. A re-evaluation of random-effects meta-analysis. J R Stat Soc Ser A Stat Soc. 2009;172:137–59.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Riley RD, Higgins JPT, Deeks JJ. Interpretation of random effects meta-analyses. BMJ. 2011;342:d549–d549.

  29. Deeks J, Higgins J, Altman D. Chapter 10: Analysing data and undertaking meta-analyses. In: Higgins JPT, Thomas J, Chandler J, Cumpston M, Li T, Page MJ, Welch VA (editors). Cochrane Handbook for Systematic Reviews of Interventions version 6.4 (updated August 2023). Cochrane, 2023.

  30. Quintana DS. A guide for calculating study-level statistical power for meta-analyses. Adv Methods Pract Psychol Sci. 2023;6:25152459221147260.

    Article  Google Scholar 

  31. Page M, Higgins J, Sterne J. Chapter 13: Assessing risk of bias due to missing results in a synthesis. In: Higgins JPT, Thomas J, Chandler J, Cumpston M, Li T, Page MJ, Welch VA (editors). Cochrane Handbook for Systematic Reviews of Interventions version 6.4 (updated August 2023). Cochrane, 2023.

  32. Peters JL, Sutton AJ, Jones DR, Abrams KR, Rushton L. Contour-enhanced meta-analysis funnel plots help distinguish publication bias from other causes of asymmetry. J Clin Epidemiol. 2008;61:991–6.

    Article  PubMed  Google Scholar 

  33. Egger M, Davey Smith G, Schneider M, Minder C. Bias in meta-analysis detected by a simple, graphical test. BMJ. 1997;315:629–34.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Duval S, Tweedie R. A nonparametric, “Trim and Fill” method of accounting for publication bias in meta-analysis. J Am Stat Assoc. 2000;95:89–98.

    Google Scholar 

  35. Deeks J, Higgins J, Altman D. Chapter 10.11.3 Undertaking subgroup analyses. In: Higgins JPT, Thomas J, Chandler J, Cumpston M, Li T, Page MJ, Welch VA (editors). Cochrane Handbook for Systematic Reviews of Interventions version 6.4 (updated August 2023). Cochrane, 2023.

  36. Deeks J, Higgins J, Altman D. Chapter 10.14 Sensitivity analyses. In: Higgins JPT, Thomas J, Chandler J, Cumpston M, Li T, Page MJ, Welch VA (editors). Cochrane Handbook for Systematic Reviews of Interventions version 6.4 (updated August 2023). Cochrane, 2023.

  37. Schünemann H, Brożek J, Guyatt G, Oxman A. GRADE handbook for grading quality of evidence and strength of recommendations. Updated October 2013. The GRADE Working Group; 2013.

  38. Brant R. Sample size calculator. Inference for Means: Comparing Two Independent Samples.

  39. Scoon GSM, Hopkins WG, Mayhew S, Cotter JD. Effect of post-exercise sauna bathing on the endurance performance of competitive male runners. J Sci Med Sport. 2007;10:259–62.

    Article  PubMed  Google Scholar 

  40. Vaile J, Halson S, Gill N, Dawson B. Effect of hydrotherapy on recovery from fatigue. Int J Sports Med. 2008;29:539–44.

    Article  CAS  PubMed  Google Scholar 

  41. Zurawlew MJ, Walsh NP, Fortes MB, Potter C. Post-exercise hot water immersion induces heat acclimation and improves endurance exercise performance in the heat. Scand J Med Sci Sports. 2016;26:745–54.

    Article  CAS  PubMed  Google Scholar 

  42. Stevens CJ, Ross MLR, Carr AJ, Vallance B, Best R, Urwin C, et al. Postexercise hot-water immersion does not further enhance heat adaptation or performance in endurance athletes training in a hot environment. Int J Sports Physiol Perform. 2021;16:480–8.

    Article  PubMed  Google Scholar 

  43. Lundby C, Svendsen IS, Urianstad T, Hansen J, Rønnestad BR. Training wearing thermal clothing and training in hot ambient conditions are equally effective methods of heat acclimation. J Sci Med Sport. 2021;24:763–7.

    Article  PubMed  Google Scholar 

  44. Bartolomé I, Siquier-Coll J, Pérez-Quintero M, Robles-Gil MC, Grijota FJ, Muñoz D, et al. 3-Week passive acclimation to extreme environmental heat (100± 3 °C) in dry sauna increases physical and physiological performance among young semi-professional football players. J Therm Biol. 2021;100:103048.

    Article  PubMed  Google Scholar 

  45. McIntyre RD, Zurawlew MJ, Oliver SJ, Cox AT, Mee JA, Walsh NP. A comparison of heat acclimation by post-exercise hot water immersion and exercise in the heat. J Sci Med Sport. 2021;24:729–34.

    Article  PubMed  Google Scholar 

  46. McIntyre RD, Zurawlew MJ, Mee JA, Walsh NP, Oliver SJ. A comparison of medium-term heat acclimation by post-exercise hot water immersion or exercise in the heat: adaptations, overreaching, and thyroid hormones. Am J Physiol Regul Integr Comp Physiol. 2022;323:R601–15.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  47. Kirby NV, Lucas SJE, Cable TG, Armstrong OJ, Weaver SR, Lucas RAI. Sex differences in adaptation to intermittent post-exercise sauna bathing in trained middle-distance runners. Sports Med - Open. 2021;7:51.

    Article  PubMed  PubMed Central  Google Scholar 

  48. Kirby NV, Lucas SJE, Armstrong OJ, Weaver SR, Lucas RAI. Intermittent post-exercise sauna bathing improves markers of exercise capacity in hot and temperate conditions in trained middle-distance runners. Eur J Appl Physiol. 2021;121:621–35.

    Article  CAS  PubMed  Google Scholar 

  49. Higgins J, Li T, Deeks J. Chapter 6.5.2.2: Choosing effect measures and computing estimates of effect. In: Higgins JPT, Thomas J, Chandler J, Cumpston M, Li T, Page MJ, Welch VA (editors). Cochrane Handbook for Systematic Reviews of Interventions version 6.4 (updated August 2023). Cochrane, 2023.

  50. Solomon TP, Laye MJ. Open Science Framework - Data registry: A systematic review of the effect of passive heat acclimation on exercise performance. 2024. https://doiorg.publicaciones.saludcastillayleon.es/10.17605/OSF.IO/6FGC2.

  51. Kissling LS, Akerman AP, Campbell HA, Prout JR, Gibbons TD, Thomas KN, et al. A crossover control study of three methods of heat acclimation on the magnitude and kinetics of adaptation. Exp Physiol. 2022;107:337–49.

    Article  PubMed  Google Scholar 

  52. Ashworth E, Cotter J, Kilding A. Post-exercise, passive heat acclimation with sauna or hot-water immersion provide comparable adaptations to performance in the heat in a military context. Ergonomics. 2023;66:49–60.

    Article  PubMed  Google Scholar 

  53. Pokora I, Sadowska-Krępa E, Wolowski Ł, Wyderka P, Michnik A, Drzazga Z. The effect of medium-term sauna-based heat acclimation (MPHA) on thermophysiological and plasma volume responses to exercise performed under temperate conditions in elite cross-country skiers. Int J Environ Res Public Health. 2021;18:6906.

    Article  PubMed  PubMed Central  Google Scholar 

  54. Zurawlew MJ, Mee JA, Walsh NP. Post-exercise Hot Water Immersion Elicits Heat Acclimation Adaptations in Endurance Trained and Recreationally Active Individuals. Front Physiol. 2018;9:1824.

    Article  PubMed  PubMed Central  Google Scholar 

  55. Zurawlew MJ, Mee JA, Walsh NP. Post-exercise hot water immersion elicits heat acclimation adaptations that are retained for at least two weeks. Front Physiol. 2019;10:1080.

    Article  PubMed  PubMed Central  Google Scholar 

  56. Barry H, Gendron P, Gagnon C, Bherer L, Gagnon D. Passive heat acclimation does not modulate processing speed and executive functions during cognitive tasks performed at fixed levels of thermal strain. Appl Physiol Nutr Metab Physiol Appl Nutr Metab. 2022;47:261–8.

    Article  CAS  Google Scholar 

  57. Ko Y, Kang J, Seol S-H, Lee J-Y. Effectiveness of skin-heating using a water-perfused suit as passive and post-exercise heat acclimation strategies. J Therm Biol. 2020;93:102703.

    Article  PubMed  Google Scholar 

  58. Vesic Z, Jakovljevic V, Nikolic Turnic T, Vukasinovic-Vesic M, Bolevich S, Radakovic S. The influence of acclimatization on stress hormone concentration in serum during heat stress. Mol Cell Biochem. 2021;476:3229–39.

    Article  CAS  PubMed  Google Scholar 

  59. Karolkiewicz J, Nieman DC, Cisoń T, Szurkowska J, Gałęcka M, Sitkowski D, et al. No effects of a 4-week post-exercise sauna bathing on targeted gut microbiota and intestinal barrier function, and hsCRP in healthy men: a pilot randomized controlled trial. BMC Sports Sci Med Rehabil. 2022;14:107.

    Article  PubMed  PubMed Central  Google Scholar 

  60. Currell K, Jeukendrup AE. Validity, reliability and sensitivity of measures of sporting performance. Sports Med. 2008;38:297–316.

    Article  PubMed  Google Scholar 

  61. Kirby NV, Lucas SJE, Cable TG, Armstrong OJ, Weaver SR, Lucas RAI. Additional file 1 of Sex differences in adaptation to intermittent post-exercise sauna bathing in trained middle-distance runners. Available at https://doiorg.publicaciones.saludcastillayleon.es/10.6084/m9.figshare.15047670.v1. 2021. https://doiorg.publicaciones.saludcastillayleon.es/10.6084/M9.FIGSHARE.15047670.V1.

  62. Travers G, González-Alonso J, Riding N, Nichols D, Shaw A, Périard JD. Exercise heat acclimation with dehydration does not affect vascular and cardiac volumes or systemic hemodynamics during endurance exercise. Front Physiol. 2021;12:740121.

    Article  PubMed  PubMed Central  Google Scholar 

  63. Haroutounian A, Amorim FT, Astorino TA, Khodiguian N, Curtiss KM, Matthews ARD, et al. Change in exercise performance and markers of acute kidney injury following heat acclimation with permissive dehydration. Nutrients. 2021;13:841.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  64. Sekiguchi Y, Filep EM, Benjamin CL, Casa DJ, DiStefano LJ. Does dehydration affect the adaptations of plasma volume, heart rate, internal body temperature, and sweat rate during the induction phase of heat acclimation? J Sport Rehabil. 2020;29:847–50.

    Article  PubMed  Google Scholar 

  65. Schleh MW, Ruby BC, Dumke CL. Short term heat acclimation reduces heat stress, but is not augmented by dehydration. J Therm Biol. 2018;78:227–34.

    Article  PubMed  Google Scholar 

  66. Neal RA, Massey HC, Tipton MJ, Young JS, Corbett J. Effect of permissive dehydration on induction and decay of heat acclimation, and temperate exercise performance. Front Physiol. 2016;7:564.

    Article  PubMed  PubMed Central  Google Scholar 

  67. Travers G, Nichols D, Riding N, González-Alonso J, Périard JD. Heat acclimation with controlled heart rate: influence of hydration status. Med Sci Sports Exerc. 2020;52:1815–24.

    Article  CAS  PubMed  Google Scholar 

  68. Pethick WA, Murray HJ, McFadyen P, Brodie R, Gaul CA, Stellingwerff T. Effects of hydration status during heat acclimation on plasma volume and performance. Scand J Med Sci Sports. 2019;29:189–99.

    Article  PubMed  Google Scholar 

  69. Garrett AT, Goosens NG, Rehrer NJ, Patterson MJ, Harrison J, Sammut I, et al. Short-term heat acclimation is effective and may be enhanced rather than impaired by dehydration. Am J Hum Biol Off J Hum Biol Counc. 2014;26:311–20.

    Article  CAS  Google Scholar 

Download references

Acknowledgements

The authors thank Janice Thompson (Professor of Public Health Nutrition, University of Birmingham; J.Thompson.1@bham.ac.uk) for peer-reviewing the search strategy.

Funding

No funding was received for this work.

Author information

Authors and Affiliations

Authors

Contributions

TS and ML contributed equally to conceptualising the idea, the study design, methodology, data extraction, data analysis, data presentation, and the writing and editing of the manuscript. Both authors approved the final version before submission. TS curated the data and takes responsibility for its integrity.

Corresponding author

Correspondence to Matthew J. Laye.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

TS has given invited talks at societal conferences and university/pharmaceutical symposia for which the organisers paid for travel and accommodation; he has also received research money from publicly funded national research councils and medical charities, and private companies, including Novo Nordisk Foundation, AstraZeneca, Amylin, AP Møller Foundation, and Augustinus Foundation; and, he has consulted for Boost Treadmills, GU Energy, and Examine.com, and owns a consulting business, Blazon Scientific, and an endurance athlete education business, Veohtu. These companies have had no control over the research design, data analysis, or publication outcomes of this work. ML has given invited talks at societal conferences and university symposia and meetings for which the organisers paid for travel and accommodation; he has received research money from Augustinus Foundation, American College of Sports Medicine, and national research institutions; and, he has consulted for Zepp Health, Levels Health, GU Energy, and EAB labs, and has coached for Sharman Ultra Coaching. These companies have had no control over the research design, data analysis, or publication outcomes of this work.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Solomon, T.P.J., Laye, M.J. The effect of post-exercise heat exposure (passive heat acclimation) on endurance exercise performance: a systematic review and meta-analysis. BMC Sports Sci Med Rehabil 17, 4 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13102-024-01038-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s13102-024-01038-6

Keywords