Systematic review of clinical trials of cervical manipulation: control group procedures and pain outcomes
© Vernon et al; licensee BioMed Central Ltd. 2011
Received: 23 August 2010
Accepted: 11 January 2011
Published: 11 January 2011
To characterize the types of control procedures used in controlled clinical trials of cervical spine manipulation and to evaluate the outcomes obtained by subjects in control groups so as to improve the quality of future clinical trials
A search of relevant clinical trials was performed in PubMed 1966-May 2010 with the following key words: "Chiropractic"[Mesh] OR "Manipulation, Spinal"[Mesh]) AND "Clinical Trial "[Publication Type]. Reference lists from these trials were searched for any additional trials. The reference lists of two prior studies, one review and one original study were also searched. Accepted reports were then rated for quality by 2 reviewers using the PEDro scale. Studies achieving a score of >50% were included for data extraction and analysis. Intra-group change scores on pain outcomes were obtained. For determining clinically important outcomes, a threshold of 20% improvement was used where continuous data were available; otherwise, an effect size of 0.30 was employed
The PubMed search yielded 753 citations of which 13 were selected. Eight (8) other studies were identified by reviewing two systematic reviews and through reference searches. All studies scored >50% on the PEDro scale. There were 9 multi-session studies and 12 single-session studies. The most commonly used control procedure was "manual contact/no thrust". Four (4) studies used a placebo-control (patient blinded). For two of these studies with VAS data, the average change reported was 4.5 mm. For the other control procedures, variable results were obtained. No clinically important changes were reported in 57% of the paired comparisons, while, in 43% of these, changes which would be considered clinically important were obtained in the control groups. Only 15% of trials reported on post-intervention group registration.
Most control procedures in cervical manipulation trials result in small clinical changes, although larger changes are observed in 47% of paired comparisons. The vast majority of studies do not result in subject blinding; the effect of unmasking of control subjects in these studies makes the interpretation of the existing clinical trials challenging. The greatest majority of trials do not report on post-intervention blinding. A small number of candidate procedures for effective control interventions exist. Much more research is required to improve this important aspect of clinical trial methodology in cervical manipulation studies.
Clinical trials of spinal manipulation for neck pain have been published since the early 1980's. Numerous reviews of these trials have been published in the ensuing years [1–3]. The lack of a valid control group has been a consistent criticism of this body of studies [1–5]. In 2005, Vernon et al.  reported on a candidate manoeuvre for a cervical sham manipulation (sham cervical thrust using a "drop" headpiece). In a small group of neck pain patients, 60% mis-registered the sham manoeuvre as a "real treatment". In these subjects, no clinically important changes were obtained post-intervention in paraspinal pressure pain thresholds (R-PPT decreased by an average of 1.2%; L-PPT decreased by an average of 6%) as well as in cervical ranges of motion.
In that report, the literature on studies of manipulation with sham/placebo manoeuvres was briefly reviewed. Of note were the studies of Hawk and her colleagues [7–9] who identified numerous issues attendant with the development and use of sham manipulations. Their work was focused on the lumbar spine and low back pain patients. The review by Ernst and Harkness  was also mentioned as one of the works critical of the extant clinical trials in manipulation for neck pain.
Vernon et al.  conducted a systematic review of the outcome of control groups used in clinical trials of conservative treatments for chronic neck pain. These trials included primarily laser and acupuncture studies; no study of manual therapy was included. In this review, the mean [95% CI] effect size of change in pain ratings in the no-treatment control studies at outcome points up to 10 weeks was 0.18 [-0.05, 0.41] and for outcomes from 12-52 weeks it was 0.4 [0.12, 0.68]. In the placebo control groups it was 0.50 [0.10, 0.90] at up to 10 weeks and 0.33. [-1.97, 2.66] at 12-24 weeks. None of the comparisons between the no-treatment and placebo groups were statistically significant. It was concluded that changes in pain scores in subjects with chronic neck pain not due to whiplash who are enrolled in no-treatment and placebo control groups were similarly small and not significantly different. As well, they do not appear to increase over longer-term follow-up. The placebo and no-treatment control procedures in these trials appeared to be successful in inducing relatively little therapeutic benefit.
There has been no similar review of the control procedures and control group outcomes of trials of manipulation in the cervical spine for neck pain and headaches, although a review of control group outcomes in lumbar spine trials has recently been published . Such a review would assist clinicians and researchers in determining the validity of the existing evidence base as well as the applicability and generalizability of the control procedures which have been employed to date. It would also identify issues for consideration by future clinical trial groups.
A search for randomized controlled clinical trials was performed in PubMed 1966-May 2010 with the following key words: "Chiropractic"[Mesh] OR "Manipulation, Spinal"[Mesh]) AND "Clinical Trial "[Publication Type]. Reference lists from these selected trials were searched for any additional trials. The reference lists of two prior studies, one review  and one original study  were also reviewed. Finally, after reviewing the retrieval lists from these searches, the authors identified some additional trials from the general literature.
randomized clinical trial
cervical spinal manipulation was the index treatment (studies of thoracic manipulation were excluded)
- c)a control group was used in any of the following forms
non-blinded control treatment
no-treatment or waiting list control
the clinical complaint was neck pain, neck and arm pain or headaches
data from a pain-related outcome was provided for each group at relevant times
The inclusion criteria were applied by the senior author to the titles and abstracts of the studies identified in the searches.
Studies included in the review were then subjected to quality rating by two independent raters (not the senior author). Ratings were derived using the PEDro Scale  for a score out of 11. Scores were converted into a percentage figure. Each rater conducted a separate rating. After this, ratings were compared. When exact agreement was not achieved, a consensus method was used to resolve any disagreements in ratings. This method involved the two raters working together first. If any disagreements could not be resolved between them, the senior author joined the discussion and forced a consensus rating. Studies scoring higher than 50% were included in the review.
Categorization of the included studies
Studies were separated into two categories: 1) single and 2) multiple intervention session trials.
Data extraction and analyses
Data were extracted by a single author. The following data was extracted: complaint type, number of subjects in the control group, control intervention type, type of primary outcome measure, whether blinding was checked post-intervention, primary pain-related outcome data for the control group(s) (typically a VAS: means, variance measures, effect sizes). For determining clinically important changes, several criteria were employed. Where continuous data were available, a threshold of 20% improvement was used; otherwise, an effect size of 0.30 was employed [similar to Vernon et al. [10, 13, 14]. Data were not formally pooled, although, when possible, means (sd) of the outcomes of selected groups of trials were computed. Data from the index treatment group was not analyzed.
The PubMed search yielded 753 citations of which 13 were selected [15–26]. The manual search of Ernst and Harkness' study  identified no additional studies. The search of Vernon et al.  identified 3 additional studies [27–29]. The senior author identified 5 additional studies [30–34] for a total of 21 studies. There were four exceptions to the inclusion rules as follows: the trial by Bakris et al  was accepted for its unique approach to creating a control procedure; the trials by Buchman et al. , Tuttle et al.  and Dunning and Rushton  were accepted as manual control studies, even though the outcome measure was not pain-related.
Types of control interventions
Multiple session studies
Single session studies
Placebo: drug only
Placebo: non-manual therapy
Placebo: drug + sham manual therapy
Manual contact: no thrust
Low level manual therapy
Low level non-manual therapy only
Low level manual + non-manual therapy
Review of studies of manipulation and control or placebo comparison
STUDY (First author name)
N (CONTROL GROUP)
PRIMARY OUTCOME MEASURE
PRIMARY OUTCOME FOR CONTROL GROUP
Sloop, 1982 
Diazepam (anamnestic) with no treatment
For neck pain: - VAS
VAS change @ 3 wks = 5 mm (±32 mm)
NVS = 28% subjects reported treatment helped
Nilsson et al., 1997 
Low power laser light and deep friction massage
- headache hrs
Avg HA (hrs/day ± inter4tile range) pre = 4.0; post = 2.4; Δ = -1.6 ± 2.5
- headache intensity
Avg HA intensity (mm ± inter4tile range)
Pre = 41; post = 36; Δ = -4.2 ± 26
- analgesic use
Avg analgesics/day ± inter4tile range
Pre = 1.0; post = 0.7; Δ = -0.3 ± 1
Bove and Nilsson, 1998 
Cervical spine: Headache
Low power laser light and deep friction massage (described as both an "active control" and as the "placebo" group)
pre = 2 wks post = 7 wks follow-up = 19 wks
Avg HA hrs/day (95% CI)
Pre = 3.4 (2.4-4.4)
post = 1.9 (0.9-2.9); Δ = -1.5
follow-up = 2.2 (1.2-3.2); Δ = -1.2
-headache intensity (VAS)
Avg HA intensity (95% CI)
Pre = 37 mm (33-41)
post = 34 mm (26-38); Δ = -3
follow-up = 26 mm (20-32); Δ = -11 *
Avg analgesics/day (95% CI)
Pre = 0.82 (0.5-1.14)
post = 0.59 (0-1.49); Δ = -0.23
follow-up = 0.56 (0.22-0.9); Δ = -0.26
Tuchin et al., 2000 
Cervical spine - migraines
De-tuned interferential therapy
Headaches per month
Avg HA/month (SD)
Pre = 7.3 (6.53)
post = 6.9 (6.6); ES = 0.06
Headache intensity (VAS)
Avg HA intensity VAS (SD)
Pre = 7.89 (1.2)
post = 6.2 (1.7); ES = 1.17 *
Headache duration (hrs) analgesics/month
Avg HA duration in hrs/episode (SD)
Pre = 22.6 (27.4)
post = 19.8 (17.7); ES = 0.12
Avg analgesics/month (SD)
Pre = 20.1 (28.4)
post = 16.2 (12.4); ES = 0.19
Alison et al., 2002 
Neck and arm pain
Median SF-MPQ (inter4ile range) Pre = 10.0 (9.0)
- NPQ - VAS
Post = 7.5 (4.0); Δ = -2.5 *
median NPQ (inter4tile range)
Pre = 12.5 (4.0)
Post = 11.5 (6.0); Δ = -1.0
median VAS (inter4tile range)
Pre = 3.3 (3.5)
Post = 3.8 (3.9); Δ = 0.5
Bakris et al., 2007 
Cervical spine - hypertension
Manual contact, Inappropriate direction of thrust
BP & pulse
Systolic BP: Pre = 145.3; post = 142.1; Δ = -3.2
Diastolic BP: Pre = 91.0; post = 89.2;
Δ = -1.8 Pulse rate Pre = 73.3; post = 73.8, Δ = -0.5
Vernon et al., 2009 
Cervical spine: headache
Sham manipulation with head thrust, but no segmental thrust + placebo meds
Reduction of headache days
Outcome not reported for sham manipulation only
Borusiak et al., 2009 
Cervical spine: pediatric headache
Light touch/no thrust
- % of days with HA
% of days with HA (SD)
Pre = 41.2 (28.5)
Post = 31.8 (28.3); ES = 0.33 *
- Duration of HA
Duration of HA in hours (SD)
Pre = 113.8 (115.1) Post = 107.2 (121.1); ES = 0.06
- Intensity of HA
Intensity of HA VAS (SD)
Pre = 4.9 (1.8)
Post = 5.0 (1.8); ES = 0.06
Haas et al., 2010 
Cervical spine/cervicogenic headache
Heat + light massage:
Numerical rating scales for:
A - Headache intensity
Group 1- 8 sessions
Percentage score - Group 1:
Pre = 56.8 (15.8)
12 wk = 42.0 (20.6); ES = 0.81 *
24 wk = 41.5 (18.2); ES = 0.90 *
Group 2- 16 sessions
B - Neck pain
Percentage score - Group 2:
Pre = 58.7 (17.1)
12 wk = 49.4 (19.0); ES = 0.52 *
24 wk = 48.6 (21.4); ES = 0.52 *
Percentage score - Group 1:
Pre = 60.5 (21.4)
12 wk = 47.1 (24.2); ES = 0.59 *
24 wk = 47.2 (21.8); ES = 0.58 *
Percentage score - Group 2:
Pre = 48.5 (23.6)
12 wk = 42.8 (21.6); ES = 0.25
24 wk = 48.4 (23.1); ES = 0.004
Pikula, 1999 
1. SMT- contralateral
2. Detuned US
VAS neck pain
1. Contralateral manip VAS (SD)
Pre = 44.1 (27.5)
post = 41.4 (28.4)
ES = 0.10
2. Placebo UltraSound VAS (SD)
Pre = 50.4 (22.5)
post = 46.5 (21.8)
ES = 0.18
Haas et al 2003 
Alternate site manipulation
Pain (VAS) Measured immediately after and later that evening (~ 6 hours)
Pain VAS (SD)
Pre = 40.4 (20.9)
post = 24.7 (19.5); ES = 0.78 *
follow-up = 28.7 (19.6); ES = 0.58 *
Buchmann et al., 2005 
Manual contact, no rotation, no thrust
Dysfunctional motion segments
Pre contact = 13 dysfunctional segments
Post contact = 13 dysfunctional segments
Martinez-Segura et al., 2006 
Cervical spine - Neck pain
Manual contact, cervical rotation, no thrust
Resting neck pain (VAS)
Resting neck pain VAS (SD) Pre = 5.5 (1.7) post = 5.1 (1.9) ES = 0.22
Fernandez De Las Penas et al., 2007 
Cervical spine - healthy subjects
Manual contact, cervical rotation, no thrust
PPT at elbow both ipsi and contral.
Ipsi elbow PPT
Pre = 2.3(0.4);
post = 2.3 (0.5); ES = 0
Contra elbow PPT
Pre = 2.3(0.5); post = 2.3 (0.6); ES = 0
Ruiz-Saez et al., 2007 
Cervical spine - trapezius MTrP's
Manual contact, cervical rotation, no thrust
PPT at trapezius trigger points
Pre = 1.34 (0.4)
Post = 1.27 (0.4); ES = -0.18
Post 5 min = 1.15 (0.4); ES = 0.48 *
Post 10 min = 1.1 (0.5); ES = 0.53 *
Fernandez-Carnero et al., 2008 
Cervical spine - tennis elbow
Manual contact, cervical rotation, no thrust
Affected elbow PPT
Pre = 314.4 (11.6); post = 327.7 (18.6) ES = 0.88 *
Contralateral elbow PPT
Pre = 475.2 (78.5); post = 481.2 (84.6) ES = 0.07
- Thermal pain threshold (TPT)
Affected elbow TPT ( o C)
Pre = 41.1 (3.4); post = 41.8 (1.3)
ES = 0.3
Contralateral elbow TPT ( o C)
Pre = 44.3 (1.5); post = 43.4 (0.9)
ES = 0.75
- Pain free grip strength (PFG)
PFG (KG) affected side
Pre = 14.7 (6.0); post = 13.6 (6.2)
ES = 0.18
Fernandez De Las Penas et al., 2008 
Cervical spine- healthy subjects
Manual contact, cervical rotation, no thrust
C5-C6 Z-joint tenderness (PPT)
Left Z-joint PPT
Pre = 316.4 (30.5); post = 311.8 (32.8) ES = 0.15
Right Z-joint PPT
Pre = 315.0 (43.8); post = 312.3 (47.7) ES = 0.06
Tuttle et al., 2008 
Cervical spine - neck pain
Mobilization applied to non-symptomatic level
NS decrease in flex/ext AROM
NS increase in lat flex and rotation
NS decrease in stiffness (data not reported; only graphic data)
Dunning and Rushton, 2009 
Cervical spine - EMG of biceps muscle
Manual contact, cervical rotation, no thrust
Biceps resting EMG
21.12% (±5%) increase in resting EMG of right bicep after sham
17.15% (±7%) increase in resting EMG of left bicep after sham
Mansila-Ferragut et al., 2009 
Upper cervical spine - neck pain
Manual contact, cervical rotation, no thrust
- PPT at the sphenoid
PPT (95% CI)
Pre = 0.8 (0.7 - 0.9)
Post = 0.7 (0.5 - 0.9); Δ = -0.1
- Active mouth opening
Active mouth opening in mm (95% CI)
Pre = 36.2 (34.3 - 38.2)
Post = 35.9 (33.7 - 38.0); Δ = -0.3
Sterling et al., 2010 
Neck pain findings
- PPT at C6
PPT at C6 (SD)
Pre = 216.1 (103.2)
Post = 253.4 (114.2); ES = 0.34 *
- Nociceptive Flexion Reflex (NFR) threshold
NFR threshold (SD)
Pre = 8.0 (5)
Post = 7.9 (5.4); ES = -0.02
- VAS pain from NFR
VAS pain from NFR (SD)
Pre = 4.5 (3.8)
Post = 3.6 (2.8); ES = 0.27
Ten (10) trials [15, 16, 18, 26–30, 32, 34] employed a pain visual analogue scale (VAS) to record neck pain or headache intensity; one [Haas 10] used an 11-point numerical rating scale. Six (6) trials [19–22, 25, 26], all single-session, employed a pressure threshold algometer to measure pressure pain thresholds over the neck, upper back or upper limb.
Pain outcomes by control group type
Four trials employed some form of placebo control. Sloop et al.'s trial for neck pain  employed anamnestic valium in both groups with the control group receiving no actual manipulations. Outcomes were obtained at an average of two weeks post-treatments. Most patients received only one treatment, while some received two. The mean change in VAS scores in the control group was -5 mm; however, the standard deviation for this value was quite large at 32 mm. Vernon et al.'s  trial for tension-type headache employed a factorial design whereby three of four groups received at least one placebo/sham version of the therapies (amitryptiline and spinal manipulation) with one of these groups receiving both placebo treatments. This was the only trial to employ a sham cervical manipulation treatment; however, there was no report of the outcomes for each group separately, so no data were available for this review on the outcome of the double placebo group.
Two trials used de-tuned therapy devices as the control treatment. Tuchin et al.s'  multi-session trial for migraine headaches employed de-tuned interferential therapy and reported an average headache intensity reduction in the control group of 17 mm with an effect size of 1.17. Pikula's single session study  employed two control groups, one of which received du-tuned ultrasound. The immediate pain reduction averaged 4 mm with an effect size of 0.18.
Using the Sloop et al. and Pikula trials for estimating pain reduction on a VAS in the placebo control groups provides an average of 4.5 mm reduction, which is well below the level most often adopted for minimal clinically important difference and is in accord with the values of placebo control group outcomes reported by Vernon et al.  in non-manual therapy trials.
Three trials employed, as the control treatment, cervical manipulation at an "alternate" site. Bakris et al.  employed recoil manipulation at what they defined as an ineffective site. As their study investigated the effect of manipulation on blood pressure, no pain-related outcomes were available. They did report virtually no difference in systolic and diastolic blood pressure pre-post intervention in this control group. Pikula  employed manipulation to the contralateral side in his single-session control group. He reported an average of 3 mm reduction on a pain VAS (effect size = 0.10). Haas et al.  employed a cervical manipulation at an alternate site from the target pain site. This site was determined randomly and compared to sites which had been determined by manual palpation. They reported an average VAS reduction of 16 mm (effect size = 0.78) which is considered a clinically important change.
Data on pain outcomes was available from fourteen (14) trials with non-placebo control groups (5 multi-session; 9 single session). In the five multi-session trials, control groups received either low-level manual contact with no thrust [28, 29, 34, 35] or a waiting list . In three of these studies [16, 28, 29], the average pain reduction on a VAS was 3 mm. Bove and Nilsson  also reported a 7-week follow-up of an average 11 mm reduction in their tension-type headache patient's headache intensity. Haas et al. reported relatively small percentage reductions in headache pain (14.8% for the 8-treatment group, 9.3% for the 16-treatment group at 12 weeks, 15.3% and 9.9% respectively at 24 weeks); however, the effect sizes for these changes were above our threshold (0.90 and 0.52 for 12 weeks; 0.52 and 0.52 for 16 weeks). The fifth trial  reported virtually no change in their pediatric headache control group.
Of the nine (9) single-session studies [18–20, 22, 23, 25, 26, 30, 32], two [30, 32] employed a contra-lateral or alternate site thrusting manipulation; the other 7 studies used a "manual contact/no thrust" procedure. The mean [95% CI] effect size for VAS reduction of neck pain (five groups [Pikula x2 , Haas et al. , Martinez-Segura et al. , Sterling et al. ) was 0.31 [-0.02,0.64]. The mean [95% CI] effect size for neck, trapezius and elbow PPT reduction in 6 groups [19–22, 25, 26] was 0.34 [0,0.68]. These mean ES just exceeded our threshold for minimally important clinical difference.
Three single-session studies which used manual contact/no thrust controls did not report pain-related outcomes. Their results were as follows: in Buchman et al. , the control procedure appeared to have no effect on the presence of palpable cervical segmental dysfunctions; in Tuttle et al. , there was no significant change in active ranges of motion; in Dunning and Rushton's trial , resting EMG of the biceps muscle increased by an average of 21% on the ipsilateral side and 17% on the opposite side. The first two of these studies provided some support for the proposition that non-thrust procedures do not result in changes in the mobility of the cervical spine.
With regard to clinically important changes, the 21 reviewed studies provided 35 comparisons of baseline to post-treatment pain scores (some trials had no pain-related outcomes; some trials had 2 or 3 comparison times for a pain-related outcome (Haas et al.  had 8 such comparisons). In 15 (43%) of these comparisons (8 trials), the control group outcomes exceeded the minimum threshold of 20% reduction of pain/tenderness or effect size greater than 0.30 (See Table 2 for *). In these 8 trials, 5 employed manual contact/no thrust controls [20, 21, 26, 30, 35], 1 employed a waiting list , 1 employed low level laser and deep massage  and 1 employed detuned ultrasound . This latter trial is notable as it was the only one of these 8 trials to use a single-blind placebo treatment; the control group effect size for average headache intensity reduction was 1.17.
The nine (9) multi-session studies merit additional analysis. Seven of these trials [15, 16, 27, 28, 29, 34, 35] provided pain-related outcome comparisons; however, five of them were for headache and only two [16, 27] were for neck pain. Within these trials, the control procedures which did not result in a mean reduction of pain that reached the clinically important threshold included anamnestic valium, low power laser + deep frictions, light touch/no thrust.
With respect to the issue of confirming the success of the blinding or the subject's identification of their group assignment, only 3 trials (15%) reported performing this check [27, 31, 34], all of which used placebo controls. These studies reported that the blinding was generally successful. Two studies which did use a placebo control [32, 33] did not report post-intervention registration. None of the studies which used non-placebo control groups reported on the degree to which subjects in each of their study groups could identify their group registration.
The primary objective of this review was to characterize the types of control procedures used in controlled clinical trials of cervical spinal manipulation and the outcomes obtained by subjects in these groups. The goal of this analysis was to identify areas for improvement in future controlled clinical trials. Twenty-one (21) trials were identified, 9 multi-session trials and 12 single-session trials. The most commonly employed control group procedure was "manual contact/no thrust" (12 groups). The clinical outcomes obtained in these control groups are varied, as discussed below.
Clinical trial theory posits that the ideal control treatment should account for all of the non-specific effects of the index treatment but carry none of the direct therapeutic benefits [5–11]. Machado et al.  used the following terms to describe these attributes: a placebo treatment that has no known or substantiated therapeutic mechanism is termed "inert"; an inert placebo which mimics the index treatment in all aspects, including the replication of any common side effects is termed "indistinguishable"; when placebos cannot be made indistinguishable, researchers should strive to create "structural equivalence". This term refers to the degree to which the control procedures are as similar in nature and delivery as possible to the index treatment.
Any difference between the index and control treatments that is obtained in the trial ("the trial effect size") should, theoretically, result from the therapeutic mechanism purported to exist in the index therapy. Any deviation from this ideal circumstance has important effects on the potential success of a clinical trial by, for example, increasing the therapeutic effect of the control treatment, thus reducing the trial effect size (Type II error) or by increasing the therapeutic effect of the index treatment (Type I error).
Hawk and colleagues [7–9] and Hancock et al.  have noted that the development of placebo manipulation procedures by researchers in manipulative therapy has been challenging. They identified two important objectives of placebo manipulation procedures: 1) the equalization of the non-specific effect of physical touch between groups of subjects, and 2) the blinding of the subject as to the nature of the treatment. Hawk et al. identified the essence of such a placebo manoeuvre in that it "increase(es) the believability of the intervention, thus equalizing the effect of expectation of improvement between groups" .
Strong placebogenic effects of manipulation [4–9, 36] have been hypothesized. Key factors in this regard include the encouragement of patient relaxation in order to facilitate the procedure, the generic or non-specific effects of manual contact, including fulfillment of patients' expectations regarding manual contact on subjectively felt problem sites and the effect of the thrust and cavitation in fulfilling patient expectations that "something important has just happened". This latter point is often reinforced or amplified by positive feedback statement or behavior from the clinician .
Sham cervical manipulation procedures should account for the following issues: tactile contact with the skin, head and neck motions involved in the procedure, mechanical loads applied to the tissues and the sounds associated with them. Differences between a sham and an actual procedure for any one or more of these characteristics might be responsible for cuing the patient as to the nature of the procedure applied. These criteria can be applied to an evaluation of the control procedures described above.
Aside from the control procedure used in Vernon et al., [6, 31], which did not involve an actual thrusting manipulation, all other control procedures identified in this review do not provide for the following important elements: a) simulated manual thrust, b) distracting noise to create ambiguity on the issue of cavitation and, c) proven lack of therapeutic effectiveness a priori. This combination creates the maximum level of "indistinguishability" possible in manual therapy research. However, with Vernon et al.'s clinical trial , the results of the sham manipulation as a control procedure cannot be confirmed separately from the placebo medication, as both were used in the true control group (see below).
The few studies that did employ a single thrusting manipulation control procedure did so on the basis of applying it to sites designated by the investigators as alternative to the "clinically important manipulation site". Whether this was at an alternate segment in the cervical spine [30, 32] or at a supposedly non-effective site at the same segment (Bakris et al.  who also used a supposedly ineffective thrust direction), by actually providing a "real" manipulation (but at a supposedly inert site), these procedures did not accomplish the goals of simulating thrust and cavitation sounds (they actually produced them). Furthermore, these procedures were not tested previously for their inertness; in other words, there may have been some element of "indistinguishability", but the level of inertness was not established a priori.
In the case of Haas et al. , the alternate site procedure produced clinically important changes that were roughly equivalent to the index procedure; thus invalidating this procedure as a useful control manipulation (although the intention of these authors was not to establish the alternate site approach as a "control" procedure). In the case of both Pikula  and Bakris et al. , their control groups' results were only confirmed posteriorly. Pikula's small study provides very limited support for the idea that a real manipulation at a site designated as clinically less important may work as a control procedure in a single session. In the case of Bakris et al., it appears that their control procedure is highly dependent on the model of manipulation and the skill of the chiropractor involved and may not be generalizable to other circumstances.
Distinguishing control procedures which do and don't employ thrusts becomes important for two reasons. These will be discussed with respect to manual-type procedures and then non-manual type procedures. With respect to manual-type procedures, the first issue pertains to the control group subjects who receive non-thrust procedures, particularly those that involve manual contact without thrust (the predominant category in this review). While the strategy of "manual contact without thrust" does account for some similarities with real manipulation in patient positioning and in initial manual contact, such subjects do come to know after-the-fact that they have not received a thrusting procedure (because there is no simulation of thrust and cavitation noise). This may become incongruent with their expectations, especially in multi-session trials, and create a psychological factor superimposed on the more direct treatment-related outcome (which should, theoretically, be minimal). This could even rise to the status of a "nocebo" effect if the subject's posterior knowledge and resultant unmet expectations (especially over several sessions) combine to result in a negative attitude to the circumstances and in a poorer response on clinical outcome measures.
Summary of single-session studies using manual contact/no thrust control procedure
Buchman et al. 
No change in fixations
Martinez-Segura et al. 
No change in neck pain
Fernandez De Las Penas et al. 
No change in elbow PPT
Ruis-Saez et al. 
Moderate change trapezius TP at 5 and 10 minutes follow-up
Fernandez-Carnero et al. 
No change in elbow PPT
Fernandez De Las Penas et al. 
No change in neck PPT
No change in ROM
Dunning and Rushton 
Moderate change in biceps EMG
Mansila-Ferragut et al. 
No change in TMJ PPT
Sterling et al. 
Moderate change in Neck PPT
The second issue applies to those subjects who receive the "real manipulation" in studies where other groups of subjects do not. By experiencing thrusting manipulations (with resultant audible cavitations (i.e., clicking sounds), subjects in these groups automatically receive indications which they would interpret to mean that they did receive the "index" treatment. This may result in an effect opposite to the one described above, where their treatment expectations are strongly confirmed. This may result in a non-specific effect which adds to that of the direct effects of the therapy, leading to a positive augmentation of the clinical outcomes (especially those that require subjective responses, and especially those related to satisfaction ratings). In passing, it should be remarked that in virtually all prior clinical trials of manipulation for neck pain and headaches, this is the situation that prevails in the groups receiving spinal manipulation.
With regard to non-manual control procedures, these do more readily lend themselves to the creation of placebo versions by, for example, de-tuning the equipment or applying very low doses of therapy. However, these procedures account for none of the manipulation-specific issues discussed above, making comparisons between these groups problematic, especially when issues of "mechanism of action" become important for the investigation. On strictly pragmatic grounds, non-manual placebo control procedures (such as those reviewed in Vernon et al. ) may be satisfactory for manipulation trials as they clearly result in clinical outcomes below the threshold of minimal clinically important difference.
With the exception of Vernon et al. , all studies have employed a single control procedure. Even if a single procedure is somewhat successful at masking subjects, it must do so entirely on its own. The "double placebo technique" [37–41] uses two procedures (either in a factorial design or in a simpler 2 or 3-group design) to increase the effectiveness of masking. While there is some evidence from the trial by Vernon et al. that this strategy was successful, more studies on this approach are needed.
It is difficult to summarize the clinical outcomes of the control groups analysed in this review on account of the highly variable methods and results. In 43% of the pain-related comparisons used in approximately one-third of the trials (8/21), the control procedure resulted in mean changes (reductions) that would be deemed clinically important. On the other hand, in some proportion of each of the categories of control procedures, these procedures resulted in mean changes which were below the minimal clinically important threshold.
With regard to the multi-session studies, where three trials did report mean changes in the control groups that did not exceed the minimal clinically important threshold, several caveats are offered. In the case of Sloop et al. , while the control group receiving anamnestic valium + no manipulation reported very little improvement, most of the subjects in that study received only one treatment session and the outcome was obtained two weeks following treatment. In the case of Borusiak et al. , it should be noted that this trial, which employed a light touch/no thrust control procedure, included only children as subjects; this finding makes it difficult to justify the use of this control procedure in multi-session clinical trials with adults.
Beyond the issues of indistinguishability, equivalence and clinical effect, a very serious issue is the lack of determination of the level of blinding in the vast majority of these studies. In many cases, a result in favor of the manipulation group (especially in single-session studies) has been superficially interpreted to mean that a beneficial effect exists for manipulation, when, in fact, the investigators have no idea as to the degree to which subjects in the control group(s) may have become unblinded and how that may have affected their responses. There is a strong possibility of a Type I error which is then ignored. This problem is not confined to spinal manipulation as a therapy nor is it confined to treatments for the neck alone. Machado et al.  and Puhl et al.  have shown that the same problem exists for other pharmacologic and non-pharmacologic treatments for low back pain.
Finally, for single-session studies of clinical efficacy, every attempt should be made to report on randomization concealment and intention-to-treat, as this would make for more complete reporting and easier quality scoring.
This review has limitations. The entry-level search strategy was more broadly defined in order to identify the largest range of potential studies. While a search strategy with a more highly specified algorithm might have been employed, we are confident that our strategy is ultimately replicable by other investigators.
We did not specify the results of each item of the PEDro scale for each study, as all studies achieved a score above our threshold for acceptability. In this review, we felt that such an analysis, typically found in other systematic reviews, was not necessary, as the primary objective was not a methodological review of the studies themselves, but of the control group procedures used.
As noted in the Methods, pooling of outcomes data was not conducted. This was due to several factors. First, contrary to other systematic reviews, it was not our intention to derive a pooled estimate of the effect of a specific treatment on specific clinical entities; rather, we sought to characterize the various control group procedures used in a broad range of studies whose primary objective was the determination of the treatment effect of the index treatment, i.e., spinal manipulation. This lead to our review including a wide variety of studies with considerable heterogeneity on issues of clinical condition, subjects selected, outcomes measures and end points used, etc.
This heterogeneity made the reporting of outcomes more difficult, and less conventional, leading to our narrative account of the results.
The most commonly used control group procedure in clinical trials of cervical manipulation is manual contact/no thrust. Most control procedures in cervical manipulation trials do not result in subject blinding. Clinical outcomes of these groups were varied with about one third of groups demonstrating a clinically important change. The greatest majority of trials do not report on post-intervention blinding. The effect of unmasking of control subjects makes the interpretation of the existing clinical trials challenging. At the very least, future clinical trial reports should include an indication of the post-intervention registration of group allocation by subjects. A small number of candidate procedures exists for effective control interventions which create optimum combinations of inertness and practical equivalence with real manipulative therapies. Much more research is required to improve this important aspect of clinical trial methodology in cervical manipulation studies.
This study was supported by: NIH NCCAM grant R21 AT004396-01A1 to HV.
- Vernon H, Humphreys BK: Manual therapy for neck pain: an overview of randomized clinical trials and systematic reviews. Eura Medicophys. 2007, 43: 91-118.PubMedGoogle Scholar
- Miller J, Gross A, D'Sylva J, Burnie SJ, Goldsmith CH, Graham N, Haines T, Brønfort G, Hoving JL: Manual therapy and exercise for neck pain: a systematic review. Man Ther. 2010, 15 (4): 334-54. 10.1016/j.math.2010.02.007.View ArticlePubMedGoogle Scholar
- D'Sylva J, Miller J, Gross A, Burnie SJ, Goldsmith CH, Graham N, Haines T, Brønfort G, Hoving JL, the Cervical Overview Group: Manual therapy with or without physical medicine modalities for neck pain: a systematic review. Man Ther. 2010, 15 (5): 415-433. 10.1016/j.math.2010.04.003.View ArticlePubMedGoogle Scholar
- Ernst E, Harkness E: Spinal manipulation: A systematic review of sham-controlled, double-blind, randomized clinical trials. J Pain Sympt Manage. 2001, 22: 879-889. 10.1016/S0885-3924(01)00337-2.View ArticleGoogle Scholar
- Hancock MJ, Maher CG, Latimer J, McCauley JH: Selecting an appropriate placebo for a trial of spinal manipulative therapy. Aust J Physiother. 2006, 52: 135-138.View ArticlePubMedGoogle Scholar
- Vernon H, MacAdam K, Marshall V, Pion M, Sadowska M: Validation of a sham manipulative procedure fro the cervical spine for use in clinical trials. J Manipulative Physiol Ther. 2005, 28: 662-666. 10.1016/j.jmpt.2005.07.020.View ArticlePubMedGoogle Scholar
- Hawk C, Long C: Use of a pilot to refine the design of a study to develop a manual placebo treatment. J Neuromusculoskel Sys. 2000, 8: 39-48.Google Scholar
- Hawk C, Azad A, Phongphua C, Long C: Preliminary study of the effects of a placebo chiropractic treatment with sham adjustments. J Manip Physiol Therap. 1999, 22: 436-443. 10.1016/S0161-4754(99)70031-X.View ArticleGoogle Scholar
- Hawk C, Long C, Reiter R, Davis C, Cambron J, Evans R: Issues in planning a placebo-controlled trial of manual methods: results of a pilot study. J Altern Complement Med. 2002, 8 (1): 21-32. 10.1089/107555302753507159.View ArticlePubMedGoogle Scholar
- Vernon H, Humphreys BK, Hagino C: The outcome of control groups in clinical trials of conservative treatments for chronic neck pain: a systematic review. BMC Musculoskel Dis. 2006, 7: 58-10.1186/1471-2474-7-58.View ArticleGoogle Scholar
- Machado LAC, Kamper SJ, Herbert RD, Maher CG, McAuley JH: Imperfect placebos are common in low back pain trials: a systematic review of the literature. Eur J Spine. 2008, 17: 889-904. 10.1007/s00586-008-0664-3.View ArticleGoogle Scholar
- Maher CG, Sherrington C, Herbert R, Moseley A, Elkins M: Reliability of the PEDro scale for rating quality of randomized controlled trials. Phys Ther. 2003, 83: 713-721.PubMedGoogle Scholar
- Cohen J: Statistical Power Analysis for the Behavioral Sciences. 1988, New York, NY: Academic Press, 2Google Scholar
- Ottenbacher KJ, Barrett K: Measures of effect size in the reporting of rehabilitation research. Amer J Phys Med. 1989, 68 (2): 52-58.View ArticleGoogle Scholar
- Tuchin PJ, Pollard H, Bonello R: A randomized controlled trial of chiropractic spinal manipulative therapy for migraine. J Manipulative Physiol Ther. 2000, 23 (2): 91-5. 10.1016/S0161-4754(00)90073-3.View ArticlePubMedGoogle Scholar
- Allison GT, Nagy BM, Hall T: A randomized clinical trial of manual therapy for cervico-brachial pain syndrome - a pilot study. Man Ther. 2002, 7 (2): 95-102. 10.1054/math.2002.0453.View ArticlePubMedGoogle Scholar
- Buchman J, Wende K, Kundt G, Haessler F: Manual treatment effects to the upper cervical apophysial joints before, during, and after endotracheal anesthesia. Am J Phys Med Rehabil. 2005, 84 (4): 251-7. 10.1097/01.PHM.0000156895.80533.C1.View ArticleGoogle Scholar
- Martinez-Segura R, Fernandez-de-las-Penas C, Ruiz-Saez M, Lopez-Jimenez C, Rodriguez-Blanco C: Immediate effects on neck pain and active range of motion after a single cervical high-velocity low-amplitude manipulative in subjects presenting with mechanical neck pain: a randomized controlled trial. J Manipulative Physiol Ther. 2006, 29: 511-17. 10.1016/j.jmpt.2006.06.022.View ArticlePubMedGoogle Scholar
- Fernandez-de-las-Penas C, Alonso-Blanco C, Cleland JA, Rodriguez-Blanco C, Alburquerque-Sendin F: Changes in pressure pain thresholds over C5-C6 zygapophysial joint after a cervicothoracic junction manipulation in healthy subjects. J Manipulative Physiol Ther. 2008, 31: 332-7. 10.1016/j.jmpt.2008.04.006.View ArticlePubMedGoogle Scholar
- Ruiz-Saez M, Fernandez-de-las-Penas C, Rodriguez-Blanco C, Martinez-Segura R, Garcia-Leon R: Changes in pressure pain sensitivity in latent myofascial trigger points in the upper trapezius muscle after a cervical spine manipulation in pain-free subjects. J Manipulative Physiol Ther. 2007, 30: 578-83. 10.1016/j.jmpt.2007.07.014.View ArticlePubMedGoogle Scholar
- Fernandez-Carnero J, Fernandez-de-las-Penas C, Cleland JA: Immediate hypoalgesic and motor effects after a single cervical spine manipulation in subjects with lateral epicondylalgia. J Manipulative Physiol Ther. 2008, 31: 675-81. 10.1016/j.jmpt.2008.10.005.View ArticlePubMedGoogle Scholar
- Fernandez-de-las-Penas C, Perez-de-Heredia M, Brea-Rivero M, Miangolarra-Page JC: Immediate effects on pressure pain threshold following a single cervical spine manipulation in healthy subjects. J Orthop Sports Phys Ther. 2007, 37 (6): 325-9.View ArticlePubMedGoogle Scholar
- Tuttle N, Barrett R, Laakso L: Relation between changes in posteroanterior stiffness and active range of movement of the cervical spine following manual therapy treatment. Spine. 2008, 33: E673-9. 10.1097/BRS.0b013e31817f93f9.View ArticlePubMedGoogle Scholar
- Dunning J, Rushton A: The effects of cervical high-velocity low-amplitude thrust manipulation on resting electromyographic activity of the biceps brachii muscle. Man Ther. 2009, 14: 508-13. 10.1016/j.math.2008.09.003.View ArticlePubMedGoogle Scholar
- Mansila-Ferragut P, Fernandez-de-las-Penas C, Alburquerque-Sendin F, Cleland JA, Bosca-Gandia JJ: Immediate effects of atlanto-occipital joint manipulation on active mouth opening and pressure pain sensitivity in women with mechanical neck pain. J Manipulative Physiol Ther. 2009, 32: 101-6. 10.1016/j.jmpt.2008.12.003.View ArticleGoogle Scholar
- Sterling M, Pedler A, Chan C, Puglisi M, Vuvan V, Vicenzino B: Cervical lateral glide increases nociceptive flexion reflex threshold but not pressure or thermal pain thresholds in chronic whiplash associated disorders: a pilot randomized controlled trial. Man Ther. 2010, 15: 149-53. 10.1016/j.math.2009.09.004.View ArticlePubMedGoogle Scholar
- Sloop PR, Smith DS, Goldenberg E, Dore C: Manipulation for chronic neck pain: a double-blind controlled study. Spine. 1982, 7 (6): 532-5. 10.1097/00007632-198211000-00003.View ArticlePubMedGoogle Scholar
- Nilsson N, Christensen HW, Hartvigsen J: The effect of spinal manipulation in the treatment of cervicogenic headache. J Manipulative Physiol Ther. 1997, 20 (5): 326-30.PubMedGoogle Scholar
- Bove G, Nilsson N: Spinal manipulation in the treatment of episodic tension-type headache: a randomized controlled trial. JAMA. 1998, 280 (18): 1576-9. 10.1001/jama.280.18.1576.View ArticlePubMedGoogle Scholar
- Haas M, Groupp E, Panzer D, Partna L, Lumsden S, Aickin M: Efficacy of cervical endplay assessment as an indicator for spinal manipulation. Spine. 2003, 28: 1091-6. 10.1097/00007632-200306010-00002.PubMedGoogle Scholar
- Vernon H, Jansz G, Goldsmith CH, McDermaid C: A randomized, placebo-controlled clinical trial of chiropractic and medical prophylactic treatment of adults with tension-type headache: results from a stopped trial. J Manipulative Physiol Ther. 2009, 32: 344-51. 10.1016/j.jmpt.2009.04.004.View ArticlePubMedGoogle Scholar
- Pikula JR: The effect of spinal manipulative therapy (SMT) on pain reduction and range of motion in patients with acute unilateral neck pain: a pilot study. J Can Chiropr Assoc. 1999, 43 (2): 111-9.PubMed CentralGoogle Scholar
- Bakris G, Dickholtz M, Meyer PM, Kravitz G, Avery E, Miller M, Brown J, Woodfield C, Bell B: Atlas vertebra realignment and achievement of arterial pressure goal in hypertensive patients: a pilot study. J Hum Hypertens. 2007, 21: 347-52.PubMedGoogle Scholar
- Borusiak P, Biedermann H, BoBerhoff S, Opp J: Lack of efficacy of manual therapy in children and adolescents with suspected cervicogenic headache: results of a prospective, randomized, placebo-controlled, and blinded trial. Headache. 2010, 50 (2): 224-30. 10.1111/j.1526-4610.2009.01550.x.View ArticlePubMedGoogle Scholar
- Haas M, Spegman A, Peterson D, Aickin M, Vavrek D: Dose response and efficacy of spinal manipulation for chronic cervicogenic headache: a pilot randomized controlled trial. Spine J. 2010, 10: 117-128. 10.1016/j.spinee.2009.09.002.PubMed CentralView ArticlePubMedGoogle Scholar
- Coulehan JL: Chiropractic and the clinical art. Soc Sci Med. 1985, 21 (4): 383-90. 10.1016/0277-9536(85)90218-7.View ArticlePubMedGoogle Scholar
- Boutron I, Estallat C, Ravaud P: A review of blinding in randomized controlled clinical trials found results inconsistent and questionable. J Clin Epidemiol. 2005, 58: 1220-6. 10.1016/j.jclinepi.2005.04.006.View ArticlePubMedGoogle Scholar
- Boutron I, Tulbach F, Giraudeau B, Ravaud P: Blinding was judged more difficult to achieve and maintain in nonpharmacologic than pharmacologic trials. J Clin Epidemiol. 2004, 57: 543-50. 10.1016/j.jclinepi.2003.12.010.View ArticlePubMedGoogle Scholar
- Hrobjartsson A: What are the main methodological problems in the estimation of placebo effects?. J Clin Epidemiol. 2002, 55: 43-435. 10.1016/S0895-4356(01)00496-6.View ArticleGoogle Scholar
- Day SJ, Altman DG: Blinding in clinical trials and other studies. Br Med J. 2000, 321: 504-10.1136/bmj.321.7259.504.View ArticleGoogle Scholar
- Morin CM, Colecchi C, Brink D, Astruc M, Mercer J, Remsberg S: How "blind" are double-blind placebo-controlled trials of benzodiazepine hypnotics?. Sleep. 1995, 18: 240-245.PubMedGoogle Scholar
- Puhl Aaron A, Reinhart CJ, Rok ER, Injeyan HS: An Examination of the Observed Placebo Effect Associated with the Treatment of Low Back Pain - A Systematic Review. Pain Res Manage. 2011,Google Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.