- Systematic review
How big is the effect of spinal manipulation on the pressure pain threshold and for how long does it last? – secondary analysis of data from a systematic review
Chiropractic & Manual Therapiesvolume 27, Article number: 22 (2019)
Spinal manipulation (SM) has been shown in a systematic review to have a statistically significant effect on the pressure pain threshold (PPT) in asymptomatic subjects, when SM is compared to a sham intervention. The magnitude and duration of this effect is unclear.
To determine the effect-size of SM in asymptomatic subjects and its duration.
This is a secondary analysis of data from a previous review. We sought to compare the effect-sizes in the various articles but had to calculate them ourselves, at different follow-up time measurements. Effect-sizes (Cohen’s d or Hedge’s g coefficient) were considered low, medium, and large, at the cut points of 0.2, 0.5, and 0.8, respectively.
Effect-sizes were reported in 6/8 studies, but all had calculated ‘within-group’ changes, not ‘between-group’ differences. Immediately after SM, only one study of four (with four measurements) had a statistically significant ‘medium’ effect size (d = 0.56; 95% CI: 00.4–1.08 to d = 0.70; 95% CI:0.18–1.22). Five minutes after SM, 4/5 studies found a statistically significant ‘medium to large’ effect-size (d = 0.51; 95% CI: 0.04–0.98 to d = 1.24; 95% CI: 0.28–2.20). Ten minutes after SM, two studies reported a ‘medium’ effect-size with statistical significance (d = 0.58; 95% CI: 0.11–1.05 to d = 0.80; 95% CI: 0.12–1.48). We drew no conclusions for the effect-sizes at one minute and thirty minutes after SM, as no between-group statistical difference was found.
Authors need to revise their approach to ‘effect size’. Our calculations showed that the effect-size of SM on PPT may go from ‘medium’ to ‘large’ within the first five minutes but appears to diminish again within ten minutes. Research of this type should collect information for longer periods and compare results to other interventions to put results into perspective.
Une revue systématique a montré que la manipulation vertébrale (MV) avait un effet statistiquement significatif sur le seuil de douleur à la pression (SDP) chez les sujets asymptomatiques, lorsque la MV était comparée à une manœuvre placébo. Cependant, l’ampleur et la durée de cet effet n’ont pas été clairement rapportées.
Déterminer la taille d’effet de la MV chez les sujets asymptomatiques et sa durée.
Il s’agit d’une deuxième analyse des données de notre précédente revue systématique de la littérature. Nous avons cherché à comparer les tailles d’effet dans les différents articles, mais nous avons dû nous résoudre à les calculer, à différents temps de mesures. La taille d’effet rapportée (comme les coefficients d de Cohen ou g de Hedge) était considérée comme faible, moyenne et grande, aux seuils respectifs de 0,2, 0,5 et 0,8.
Nous avons constaté que 6 études sur 8 montraient le calcul de la taille d’effet, mais que toutes étaient fondées sur des changements « intra-groupe », et non sur des différences « intergroupes ». Immédiatement après la MV, seule une étude sur quatre (avec quatre mesures) présentait une taille d’effet « moyenne » statistiquement significative (d = 0,56 [IC à 95%: 00,4 à 1,08] à d = 0,70 [IC à 95%: 0,18 à 1,22]). Cinq minutes après la MV, quatre études sur cinq ont révélé une taille d’effet « moyenne à grande » statistiquement significative (d = 0,51 [IC à 95%: 0,04–0,98] à d = 1,24 [IC à 95%: 0,28-2,20]). Dix minutes après la MV, deux études ont indiqué une taille d’effet « moyenne » statistiquement significative (d = 0,58 [IC à 95%: 0,11-1,05] à d = 0,80 [IC à 95%: 0,12-1,48]). Aucune conclusion n’a été tirée des résultats une minute et trente minutes après la MV car la différence entre les groupes n’était pas statistiquement significative.
Les auteurs doivent revoir leur approche de la « taille d’effet ». Nos calculs ont montré que l’effet de la manipulation vertébrale sur le seuil de la douleur à la pression peut passer de « moyen » à « important » dans les cinq premières minutes mais semble diminuer à nouveau dans les dix minutes. Les recherches de ce type devraient collecter des informations sur des périodes plus longues et comparer les résultats à ceux d’autres interventions afin de les mettre en perspective.
Taille d’effet; durée; manipulation vertébrale; seuil de douleur à la pression; sujets asymptomatiques.
The effect of treatment
In order to study the validity of a treatment, it is necessary to compare its outcome with the outcome of another treatment. However, in order to test the specific effect of this treatment, it would be necessary to compare it with a well masked placebo.
Measuring the effect
The effect of a treatment is, in fact, the difference between the differences in outcome in these two interventions, i.e. (i) the follow-up measurement of the treatment group minus the baseline measurement is estimated and also (ii) the follow-up measurement of the placebo group minus the baseline measurement. Thereafter, (iii) the difference between these two differences is subjected to a test for statistical significance. However, if the baseline measurements in the two groups is more or less identical, then there is no need to take the baseline measurements into account, and it would be sufficient to test only if the difference between the outcome measurements is statistically significant.
Statistical significance vs. clinical significance
However, the statistical significance (i.e p-value) is not an indication of the clinical significance. The clinical significance can be judged by comparing the estimates of the treatment and placebo groups and by calculating, for example, the numbers needed to treat, in clinical studies. In experimental studies, the clinical significance would often be assessed by calculating what is called an ‘effect size’.
To show how big an effect of a particular treatment might be, one could proceed to calculate the ‘difference of differences’, as explained above. However, it would be difficult to compare studies that used different methods and units. Therefore, an index is often used, such as the Cohen’s d or Hedge’s g coefficients.
To relate the effect size to the concept of clinical significance, Cohen created a scale that can be used for either the d or the g coefficients. According to this scale, an effect size of 0.2 represents an overlap between the compared populations in their distributions of means of about 85%, which Aron  suggests could be compared to a difference of height between 15 and 16 years old girls, thus considered ‘small’. An effect size of 0.5 represents an overlap of about 67%, which, in the same way, could be interpreted as a difference of height between 14 and 18 years old girls, thus considered ‘medium’. An effect size of 0.8 (or above) is an overlap of 53% (or less) and represents a difference of height between 13 and 18 years old girls, thus considered ‘large’. As another example, Cohen suggests that the difference in IQ between holders of a PhD degree and a ‘typical college freshman’ is comparable to an effect size of 0.8 .
Calculation of the effect size
The Cohen’s d coefficient is commonly calculated by the subtraction of the mean of the experimental group from the mean of the control group, and the division of this result by the standard deviation. However, this equation should be used under specific conditions, which are described extensively in the literature albeit without a clear consensus. Therefore, the effect size can be calculated in different ways and unless it is clearly described, there is room for errors and confusion. (Please, see Additional file 1 for more details).
The regional effect of spinal manipulation on experimentally induced pain in asymptomatic subjects
Spinal manipulation (SM) is used for its beneficial effect on musculoskeletal pain. It is reported to very quickly soothe musculoskeletal pain in some patients  but its mechanisms are not yet well understood. Nevertheless, according to a previous systematic review, SM has been reported in some studies to have a hypoalgesic effect in asymptomatic subjects exposed to experimentally induced pain, such as increasing the pressure pain threshold (PPT) . However, not all studies used proper sham-controlled studies to control for the placebo effect.
Pressure pain threshold
Pressure pain threshold (PPT) is a type of quantitative sensory testing that can be used to understand the somato-sensory profiles of people in pain , but also in asymptomatic subjects . It is defined as the minimal pressure which provokes a pain or a discomfort . A reported increase of PPT values on subjects after a treatment would suggest an hypoalgesic action of the SM, whereas a reported decrease of PPT values would suggest hyperalgesia.
Spinal manipulation and its regional effect on the pressure pain threshold
To be able to establish whether SM truly has a hypoalgesic effect, we performed a systematic review in which we separated out the sham-controlled studies, as reported elsewhere . A description of these studies is found in Additional file 2.
Thus, we found eight randomized controlled trials of good and medium quality. They investigated the regional effect of SM, compared to a sham procedure in asymptomatic subjects. As previously reported (Additional file 2), five out of these eight studies found that SM had a statistically significant effect on the PPT in these asymptomatic subjects.
However, the effect size and the duration of this effect need also to be investigated, to conclude whether this reaction is also clinically relevant.
The research objectives
To the best of our knowledge, the i) effect size and (ii) this effect size over time, for the PPT in asymptomatic subjects after a spinal manipulation, compared to a sham procedure, have never been reported in a systematic review. Therefore, we returned to the articles in our previous review  to report on these values.
This work consists of a secondary analysis of data from our previous systematic literature review, using data from eight randomized controlled trials that reported the regional effect of spinal manipulation on PPT in asymptomatic subjects compared to a sham procedure .
Search strategy and extraction of data
The search strategy and extraction of data for the original review have been extensively reported (Additional file 3). The flow chart for the screening process presented in the previous review has been included in this report for information. (Please, see Additional file 4).
For the present review, a descriptive and a quality checklist were created (Tables 1-2) to fit our new objectives. The quality score for the research method in general as reported in the previous article was included in the descriptive checklist for information (Table 1).
The present quality checklist (Table 2) is based on the various recommendations from the creator of the original effect size index  and its coeval authors , supported by more recent texts on the same subject (; ). This consists of information on:
- whether the between-group effect size was provided ,
- whether the formula for calculation was provided  or if, at least, an exact reference was provided (document and page, not just the name of a textbook), the reporting of the number of study participants, the exact mean values and standard deviations necessary to calculate the effect size , and the reporting of 95% confidence intervals (95% CI) . If all this information was available, it would be possible to calculate the effect size.
Where effect size had not been calculated, it was our intention to do so, with the formulae provided in the Additional file 1 and verifying all calculations by a blinded third person. The information on the effect size at each time of measurement (provided by the authors or calculated by us) was collected in a table (Table 3) and illustrated in Fig. 1. We chose to report, when needed, the data concerning what happened on the “dominant side” , as we cared only for the regional effect.
Effect sizes were calculated using Eqs. 1 to 7 (as described in Additional file 1). The effect sizes and their 95% CI were calculated with Microsoft® Excel, version 16.17 (180909). Statistical significance of the effect size was defined as when the 95% CI does not include ‘zero’ . The effect size was defined as small (d < 0.5), medium (0.5 < d < 0.8) or large (d > 0.8) .
General description of the studies and their reported effect size
These studies have been extensively described in our previous review , with a general description available in Additional file 2. Briefly, the eight studies included in our new analyses provided information on the PPT at different follow-up times; four studies immediately after, one study one minute after, five studies five minutes after, two studies ten minutes after, one study fifteen and thirty minutes after the interventions. The quality of the eight studies was established in the previous review to range from ‘medium’ to ‘good’ (Table 1).
In the present review, no additional quality score for the effect size was given, as no definitive consensus can be found on this subject. However, we noted that no study reported a between-group effect size. Instead, they reported ‘effect sizes’ of intra-group differences, i.e. in fact, the ‘outcome sizes’. Further, two studies failed to report any effect size at all (Table 1). In addition, no study reported the exact formula they used, and only one provided a ‘precise’ reference (Table 2). No additional descriptive information on the ‘effect size’ was given (95% CI or SD (d)). In sum, the reported ‘effect sizes’ were not real effect sizes, not transparent and possibly not comparable.
Fortunately, all studies had exploitable information, with reported numbers of study participants in each group, exact mean values and exact standard deviations, and only one study provided values only in a figure, which made it possible to retrieve information although it, for this reason, lacked precision. Thus, all authors had provided sufficient information to make it possible for us to calculate their between-group effect sizes (Table 3).
Our calculations of effect sizes at each follow-up time
Effect size immediately after spinal manipulation
Four studies had effect sizes immediately after SM, ranging from small [15,16,17] to medium  but only one study (with four different measurements) found these to be statistically significant , ranging from d = 0.56 (95% CI: 0.04–1.08) to d = 0.70 (95% CI: 0.18–1.22). In sum, the immediate effect would be considered as of medium size.
Effect size one minute after spinal manipulation
One study  was found to have a non-significant and small effect size one minute after SM (d = 0.42, 95% CI: - 0.24-1.08) We drew no conclusion for this time interval.
Effect size five minutes after spinal manipulation
Five studies provided information on effect size five minutes after SM. One study  had a small non-significant effect size (g = 0.17, 95% CI: -0.34-0.68) whereas one  was classified as medium (d = 0.51, 95% CI: 0.04–0.98) and three as large [13, 19, 21] (from d = 0.93, 95% CI: 0.24–1.08 to d = 1.24, 95% CI: 0.28–2.20) and all statistically significant. In general, the effect at this time can therefore be considered mainly large.
Effect size ten minutes after spinal manipulation
Two studies had data that could be transformed into ten minutes effect sizes,  defined as medium (d = 0.58, 95% CI: 0.11–1.05) and the other  as borderline large (d = 0.80, 95% CI: 0.12–1.48), both statistically significant. This effect at ten minutes is therefore considered medium.
Effect size fifteen minutes after spinal manipulation
One study  found a non-significant, medium effect size fifteen minutes after SM (d = 0.59, 95% CI: - 0.08-1.26). No conclusion was drawn on this result.
Effect size thirty minutes after spinal manipulation
One study  had a non-significant, small effect size thirty minutes after SM (g = 0.03; 95% CI: - 0.48-0.54). No conclusion was drawn on this, but it is likely that the effect is no more present.
The results are illustrated in Fig. 1.
In this additional analysis of data from a previous systematic review, we were confused by the reported effect sizes. A systematic approach revealed that no study reported between-group size differences, instead using the within-group differences, when reported at all. Further, none provided details on how this ‘effect size’ had been calculated. Therefore, we used information provided in the reviewed articles to produce our own estimates.
According to our own calculations obtained from data available in the eight reviewed studies, the estimated effect size of spinal manipulation on the PPT in asymptomatic subjects is ‘medium’ immediately after the intervention (T0), ‘mainly large’ five minutes after (T + 5) and ‘mainly medium’ ten minutes after the intervention (T + 10). No certain estimation of the effect size can be reported beyond T + 10, but it may be small after 30 min.
Using the examples provided in the introduction [1, 2] to explain the clinical importance of the different effect sizes, the ‘medium’ effect size immediately after SM would thus correspond to a difference in height between 14 and 18 years old girls. The ‘large’ effect size five minutes after SM would correspond to the difference in IQ between holders of a PhD degree and a ‘typical college freshman’. The ‘medium’ effect size ten minutes after the intervention would, again, correspond to a difference in height between 14 and 18 years old girls.
The effect of SM on the PPT in asymptomatic subjects is therefore reported to be a reasonably large but probably short-lasting phenomenon. Whether these changes can also be ‘appreciated’ by study subjects, in such a way that they can differentiate between a small, medium and large effect size, is not known. Nevertheless, it serves as a comparator with other interventions in the same domain. For example, it could be used to compare the effect over time or effects of different types of interventions.
Methodological considerations of our own review
A description of the studies is found in the Table 1. Our quality checklist was established according to the various recommendations in the literature, including those provided by the creator of the Cohen’s d coefficient. There is no definitive consensus on the calculation of the effect size nor on how to assess its quality, so we did not judge the quality of work in the reviewed articles but used our systematic approach to obtain a general understanding of the various effect size values (Table 2). This can be done online to obtain the effect size with its SD (d) and 95% CI, with A Practical Meta-Analysis Effect Size Calculator . A blinded third researcher verified the calculations. Obviously, this approach assumes that the groups that are being compared are fairly similar at base-line. We did not investigate if this was the case.
Other methods of reporting the treatment effect
Depending on the type of data (continuous or categorial), treatment effect can be reported in other ways than with Cohen’s d. Relative Risk, Odds Ratio, Number Needed to Treat (NNT) and Area Under the Curve are other possibilities .
Recommendations regarding effect size reporting
This additional analysis of data from our previous review on the effect of spinal manipulation on the pressure pain threshold in asymptomatic subjects revealed that all reviewed studies that reported an effect size used the within-group rather than the between-group differences. However, the within-group effect size is a purely descriptive outcome, interesting perhaps to understand the full picture of an effect, but it should never be provided alone. Therefore, the between-group calculations need to be calculated properly and in a transparent manner, to ensure that they are correct and comparable to other reports. We provide some information on how to do this in our Additional file 1, and we also provide references for these calculations.
The effect of spinal manipulation on the pressure pain threshold in asymptomatic subjects, as calculated by us, is ‘medium’ immediately after the intervention, has increased to mainly ‘large’ five minutes after and descended to mainly ‘medium’ ten minutes after intervention. The potential effect should be investigated over a longer period of time, and for other comparable interventions, to confirm if this effect is indeed only short lasting and to put it into a clinical perspective.
High Velocity Low Amplitude
Pressure Pain Threshold
Aron. Statistics for psychology. 2nd ed: Prentice Hall; 1999. p. 229–30.
Coe Robert. It’s the effect size, stupid. What effect size is and why it is important. Annual conference of the British Research Association, University of Exeter England, 12–14. September 2002.
Malmqvist S, Leboeuf-Yde C, Ahola T, Andersson O, Ekstrom K, Pekkarinen H, Turpeinen M, Wedderkopp N. The Nordic back pain subpopulation program: predicting outcome among chiropractic patients in Finland. Chiropractic & osteopathy. 2008;16:13.
Millan M, Leboeuf-Yde C, Budgell B, Amorim MA. The effect of spinal manipulative therapy on experimentally induced pain: a systematic literature review. Chiropractic & manual therapies. 2012;20(1):26.
Hübscher M, Moloney N, Leaver A, Rebbeck T, McAuley JH, Refshauge KM. Relationship between quantitative sensory testing and pain or disability in people with spinal pain-a systematic review and meta-analysis. Pain. 2013;154(9):1497–504.
Waller R, Smith AJ, O'Sullivan PB, Slater H, Sterling M, McVeigh JA, Straker LM. Pressure and cold pain threshold reference values in a large, young adult, pain-free population. Scand J Pain. 2016 Oct;13:114–22.
Laura Frank PM, Vaughan B. The repeatability of pressure algometry in asymptomatic individuals over consecutive days. International Journal of Osteopathic Medicine. 2013;16(3):143–52.
Honoré M, Leboeuf-Yde C, Gagey O. The regional effect of spinal manipulation on the pressure pain threshold in asymptomatic subjects: a systematic literature review. Chiropractic & Manual Therapies. 2018;26(1). https://doi.org/10.1186/s12998-018-0181-3.
Cohen J. The earth is round (p < .05). Am Psychol. 1994;49(12):997–1003.
Ray JW, Shadish WR. How interchangeable are different estimators of effect size? J Consult Clin Psychol. 1996;64(6):1316–25. https://doi.org/10.1037/0022-006x.64.6.1316.
Cuijpers P, Weitz E, Cristea IA, Twisk J. Pre-post effect sizes should be avoided in meta-analyses. Epidemiology and Psychiatric Sciences. 2016;26(04):364–8. https://doi.org/10.1017/s2045796016000809.
Baguley T. Standardized or simple effect size: what should be reported? Br J Psychol. 2009;100(3):603–17. https://doi.org/10.1348/000712608x377117.
Fernandez-de-Las-Penas C, Alonso-Blanco C, Cleland JA, Rodriguez-Blanco C, Alburquerque-Sendin F. Changes in pressure pain thresholds over C5-C6 zygapophyseal joint after a cervicothoracic junction manipulation in healthy subjects. J Manip Physiol Ther. 2008;31(5):332–7.
Dong Kyu Lee, 2016 Alternatives to p value : confidence interval and effect size. Korean Journal of Anesthesiology 2016;69(6):555–562. Published online: October 25, 2016 DOI: https://doi.org/10.4097/kjae.2016.69.6.555
Ruiz-Saez M, Fernandez-de-las-Penas C, Blanco CR, Martinez-Segura R, Garcia-Leon R. Changes in pressure pain sensitivity in latent myofascial trigger points in the upper trapezius muscle after a cervical spine manipulation in pain-free subjects. J Manip Physiol Ther. 2007;30(8):578–83.
Thomson O, Haig L, Mansfield H. The effects of high-velocity low-amplitude thrust manipulation and mobilisation techniques on pressure pain threshold in the lumbar spine. Int J Osteopath Med. 2009;12(2):56–62. https://doi.org/10.1016/j.ijosm.2008.07.003.
Fryer G, Mclver JCS. he effect of manipulation and mobilisation on pressure pain thresholds in the thoracic spine. Journal of Osteopathic Medicine. 2004.
Yu X, Wang X, Zhang J, Wang Y. Changes in pressure pain thresholds and basal electromyographic activity after instrument-assisted spinal manipulative therapy in asymptomatic participants: a randomized, controlled trial. J Manip Physiol Ther. 2012;35(6):437–45.
Srbely JZ, Vernon H, Lee D, Polgar M. Immediate effects of spinal manipulative therapy on regional antinociceptive effects in myofascial tissues in healthy young adults. J Manip Physiol Ther. 2013;36(6):333–41.
Luke Hamilton CB, Fryer G. The effects of high-velocity, low-amplitude manipulation and muscle energy technique on suboccipital tenderness. International Journal of Osteopathic Medicine. 2007.
Fernandez-de-las-Penas C, Perez-de-Heredia M, Brea-Rivero M, Miangolarra-Page JC. Immediate effects on pressure pain threshold following a single cervical spine manipulation in healthy subjects. The Journal of orthopaedic and sports physical therapy. 2007;37(6):325–9.
David B. Wilson, Ph.D., George Mason University. Practical Meta-Analysis Effect Size Calculator http://www.campbellcollaboration.org/escalc/html/EffectSizeCalculator-SMD1.php Accessed on Sept 2018.
McGough JJ, Faraone SV. Estimating the size of treatment effects: moving beyond p values. Psychiatry (Edgmont). 2009 Oct;6(10):21–9.
Ellis, P.D. (2009). “Effect size equations,” website: [http://www.polyu.edu.hk/mm/effectsizefaqs/effect_size_equations2.html] accessed on [September 2018].
Cohen. Statistical power analysis for the behavioral sciences. 2nd edition. 1988. Page 43.
Hedges LV. Distribution theory for Glass’s estimator of effect size and related estimators. Journal of Educational Statistics. 1981;6(2):106–28 Page 110.
Hedge LV, Olkin. Statistical methods for meta-analysis. Orlando: Academic Press Inc; 2014. p. 86.
The authors would like to acknowledge Michel Debarle, M.Sc. for editorial assistance.
No external funding was provided.
Availability of data and materials
The articles used for the current study are available from the corresponding author on reasonable request.
Ethics approval and consent to participate
Consent for publication
The authors report that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
General description of data extracted from the abstracts of eight randomized controlled trials on the regional effect of spinal manipulation on the pressure pain threshold in asymptomatic subjects. (DOCX 20 kb)
Search equation, inclusion and exclusion criterion of the previous review. (DOCX 14 kb)
PRISMA flow-chart of the previous review. (DOCX 88 kb)