Trunk-flexor muscle strength plays a fundamental role in athletic performance,1–3 health of children and adolescents,4,5 posture, and rehabilitation. It is also vital for core stability6,7 and situations such as urinary incontinence and athletes’ rehabilitation.8 Therefore, accurate and objective assessment of trunk-flexor muscle strength is essential in follow-up assessments of general physical activities, rehabilitation, and optimal performance in sports.
Isokinetic dynamometers can be used to directly measure trunk-flexor muscle strength,9 but their high cost and lack of portability limit their use. Thus, several alternative methods and instruments have been adopted.10,11 Endurance tests,10 which do not require specific equipment, are easy to apply for this purpose. However, endurance tests do not provide a direct measurement of strength and are more useful when assessing the resistance and functionality of the trunk-flexor muscles.
Handheld dynamometers have also been adapted to measure trunk-flexor muscle strength.11 However, while they provide direct measurement of strength, they are dependent on several factors, such as dynamometer position, the direction of the force applied by the rater, and even the rater’s experience.12 Moreover, a handheld dynamometer in this situation cannot evaluate the strength used to support the trunk weight. Hence, to the best of the authors’ knowledge, no specific portable instrument has been developed for this purpose.
The aim of this study was to assess (1) the concurrent validity of the Measurement System using calibrated barbells and (2) the reliability of the measurements obtained using the Measurement System by conducting test–retests. We hypothesized that concurrent validity and measurement reliability will obtain high indexes of approval.
Methodology
The BIOMEC research group13 has developed a low-cost device capable of isometrically assessing trunk-flexor muscle strength: the portable, one-dimensional, trunk-flexor muscle strength measurement system (Measurement System). The subject to be assessed lies in a supine position with knees and hips flexed (Figure 1A), then performs an isometric trunk flexion against a bar positioned on his/her chest (Figure 1B).
—The portable, one-dimensional, trunk flexor muscle strength measurement system (Measurement System). (A) The subject is assessed in a supine position with knees and hips flexed. (B) The subject is performing an isometric trunk flexion against a bar positioned on his/her chest.
Citation: Journal of Sport Rehabilitation 29, 6; 10.1123/jsr.2019-0210
The Measurement System consists of a base that supports the apparatus and upon which the subject is positioned, an articulated metal bar to which the forces will be applied, and a load cell that measures the forces applied to the bar (Figure 2A). To accommodate individuals of different sizes, the Measurement System has a height adjustment that allows the bar to be positioned on the subject’s chest (at the level of axilla). The Measurement System was assessed in 4 different conditions, the first 3 involving only calibrated barbells: (1) repeatability (ie, consecutive test–retest); (2) time reproducibility (ie, 24-h-interval test–retest); (3) position reproducibility (ie, tests in 3 different bar positions); and (4) subject repeatability, as a complementary assessment, in which the forces applied to the bar by individual subjects were measured. During the first 3 conditions assessed, the apparatus was positioned upside down, so that the force bar remained horizontal (Figure 2B). A rope and a hook were used to suspend the barbells, thus applying force perpendicularly to the bar.
—Measurement System schema. (A) Main parts and dimensions. (B) The apparatus was positioned upside down during the assessment with barbells.
Citation: Journal of Sport Rehabilitation 29, 6; 10.1123/jsr.2019-0210
The number of times known forces were applied to the bar (ie, the sample size) was calculated according to specific literature for reliability studies.14,15 This assumes the value of the null hypothesis of the intraclass correlation coefficient (ICC) is .40 (ie, any value less than .40 would be considered unacceptable), 80% of power, and a significance level of 95% to detect an ICC value of at least .80. For situations involving 2 repetitions, immediate test–retest (repeatability), and 24-hour-interval test–retest (time reproducibility), a minimum of 15.1 situations were found. For situations involving 3 repetitions, with measurements at different bar positions (position reproducibility), a minimum of 9.6 situations were found.
Five sets of measurements with barbells were carried out, as described below:
- •Set 1: 15 loads were applied to the center of the bar (day 1)
- •Set 2: 15 loads were applied to the center of the bar, immediately after set 1 (day 1)
- •Set 3: 10 loads were applied 15 cm to the left of the center of the bar (day 1)
- •Set 4: 10 loads were applied 15 cm to the right of the center of the bar (day 1)
- •Set 5: 15 loads were applied to the center of the bar (day 2)
In the sets with 15 situations (ie, sets 1, 2, and 5), the loads were applied at 49.05 N (5 kg) increments (from 0 to 686.7 N [70 kg]) with the barbells attached to the center of the bar. In the sets with 10 situations (ie, sets 3 and 4), the loads were applied from 49.05 N to 490.5 N (50 kg), with the barbells attached 15 cm to the left and to the right of center of the bar, respectively.
In addition to the assessments conducted with barbells, 15 healthy women, aged 27.7 years (±7.1), body mass 59.8 kg (±8.2), and height 1.66 m (±0.01) performed 2 consecutive maximum force tests, with a 2-minute interval to minimize any variations resulting from fatigue.
Statistical Analysis
The Shapiro–Wilk test was used to check the data’s normality. When assessing repeatability and time reproducibility, the Student t test was used to analyze the concurrent validity, comparing the difference between the barbell weights and the values obtained using the system with the zero value. When assessing position reproducibility, 1-way ANOVA was used to compare the differences between positions. Effect sizes (d for t test and f for ANOVA) were quantified using standardized differences in means (the mean difference divided by the between-subject SD), where effect sizes of 0.20, 0.60, 1.20, 2.0, and 4.0 represented small, moderate, large, very large, and extremely large effects, respectively.16
The ICC was used to assess reliability. The ICC1,2 (1-way random, 2 situations) was adopted for the repeatability, time reproducibility, and subject-repeatability conditions. The ICC1,3 (1-way random, 3 situations) was adopted for the position-reproducibility condition. Munro’s classification was assumed to interpret the ICC values of the reliability coefficients, in which .26 to .49 indicates low correlation, .50 to .69 indicates moderate correlation, .70 to .89 indicates high correlation, and .90 to 1.00 indicates very high correlation.17
Besides the ICC, the SEM and minimal detectable change (MDC) were adopted to assess the reliability of the measurements. The SEM quantifies the precision of individual scores on a test, thus providing an absolute index of reliability. The SEM was estimated using the SD of the measurements,
In all the tests, the level of significance adopted was α ≤ .05. All procedures were performed with the aid of IBM® SPSS Statistics® software (version 20; Chicago, IL).
Results
No significant differences were found when comparing the values obtained with the Measurement System and those from the barbells weight for any set of measurements (Table 1), indicating that the forces applied to the bar were correctly measured. When the test–retest procedures were carried out with barbells, the ICCs were always higher than .999 (Table 2), demonstrating high levels of reliability for all tested conditions.
Concurrent Validity
Assessment | Statistical test | P | Effect size |
---|---|---|---|
Set 1 × set 2 | t14 = –1.849 | .11 | d = 0.006 |
Set 1 × set 5 | t14 = 0.830 | .44 | d = 0.008 |
Set 1 × set 3 × set 4 | F2,44 = 0.001 | .10 | f = 0.012 |
Reliability Results
Assessment | ICC | SEM, N | MDC, N |
---|---|---|---|
Repeatability (set 1 × set 2) | .9999 | .1 | 0.3 |
Time reproducibility (set 1 × set 5) | .9997 | .3 | 0.7 |
Position reproducibility (set 1 × set 3 × set 4) | .9994 | .5 | 0.9 |
Subject repeatability | .9805 | 1.6 | 3.1 |
Abbreviations: ICC, intraclass correlation coefficient; MDC, minimal detectable change.
In the reliability tests conducted with the barbells (Table 2), a maximal SEM value of 0.5 N was found, indicating a very small error throughout the range of the measurements. In the same tests, the MDC values were lower at 1 N, indicating the maximum differences between measurements, which can be attributed to random error. Regarding the test–retest procedure conducted with the subjects, the ICC was also very high and an MDC of 3.1 N was found, indicating the minimum difference between measurements from the same subject required to identify any real difference (Table 2).
Discussion
The purpose of this study was to assess both the concurrent validity and the reliability of the measurements obtained using the Measurement System by conducting test–retests. Our hypotheses were confirmed, as both the concurrent validity and the reliability obtained high indexes of approval. Regarding the concurrent validity, the absence of difference (P > .05) and trivial effect sizes (<0.1) indicate high levels of accuracy (Table 1). In relation to reliability, the results show nearly perfect correlations and very low SEM and MDC values for all tested conditions (Table 2). Together, these findings demonstrate that the Measurement System can be used to objectively assess the strength of the trunk-flexor muscles.
To the best of our knowledge, this is the first portable dynamometer specifically designed to measure the strength of the trunk-flexor muscles. It can also evaluate the effect of the weight of the HAT (head, arms, and trunk) on the measurements. As with isokinetic dynamometers, it is necessary to previously measure the body segment by suspending it while completely relaxed,18 thus evaluating the effect of HAT. For example, if someone’s HAT offers a resistance of 40 N and the Measurement System identifies a force of 10 N, the total force attributed to muscle effort is in fact 50 N. The combined values of the forces measured during the maximum contraction test and HAT evaluation, although not equal to the abdominal muscular force itself (which could only be measured just by an invasive sensor connected directed to the muscles), can be used as an objective indicator of the capacity of the trunk-flexor muscles.
The MDC values establish a confidence interval to consider the difference between the measurements as real and not by chance. Thus, the MDC values lower than 1 N reveal the high level of reliability of the equipment, and 1 N can be considered the sensibility of the equipment. When the subjects performed the test–retest, the reliability was reduced compared with the barbell conditions, represented by a small decrease in the ICC and a slight increase in the SEM and MDC values (Table 2). When someone is required to repeat a maximal effort, factors such as motivation, disposition, learning effect, and fatigue may affect the results. Even so, the MDC obtained during the subject-repeatability condition indicates that differences greater than 3 N between measurements in the same subject cannot be attributed to chance. It is important to highlight that when applying force to the bar, the subject does not necessarily apply that force exactly in the exact center of the bar, as represented in sets 1, 2, and 5 with barbells. Due to factors such as motor coordination, asymmetric trunk-flexor muscle capacity, or a condition such as scoliosis, the center of pressure of resultant force may not be applied to the center of the bar. For example, when the oblique muscles are recruited bilaterally, they can perform a trunk flexion. However, when only one side is recruited, a trunk rotation will also occur. In the same way, if there are any asymmetries in these muscle strengths, a tendency of rotation could occur and the resultant force would not be considered applied at the center of the bar. Given this possibility, we also conducted assessments in which known forces were applied to the left and right of the center of the bar, and the results for both the concurrent validity and reliability tests were compatible with those obtained at the center of the bar. Therefore, when a human being performs the test, it does not matter where he/she will apply the force because the result will be the same. As a limitation, it should be noted that reproducibility evaluations were performed with dumbbells alone. It is necessary to evaluate the SEM and MDC when subjects perform the test on distinct days.
In conclusion, the portable, one-dimensional, trunk-flexor muscle force measurement system provides valid and reliable measures of trunk-flexor muscle strength. Thus, practitioners can use this relatively inexpensive portable force system to record accurate measures of trunk-flexor muscle strength.
Acknowledgments
The authors are grateful to PhysioPilates for providing the material used to construct the portable, one-dimensional, trunk-flexor muscle force measurement system.
References
- 1.↑
Tanaka NI, Komuro T, Tsunoda N, Aoyama T, Okada M, Kanehisa H. Trunk muscularity in throwers. Int J Sports Med. 2013;34(1):56–61. PubMed ID: 22903318 doi:10.1055/s-0032-1316316
- 2.
O’Connor S, McCaffrey N, Whyte E, Moran K. The development and reliability of a simple field based screening tool to assess core stability in athletes. Phys Ther Sport. 2016;20:40–44. PubMed ID: 27325538 doi:10.1016/j.ptsp.2015.12.003
- 3.↑
De Blaiser C, Roosen P, Willems T, Danneels L, Bossche LV, De Ridder R. Is core stability a risk factor for lower extremity injuries in an athletic population? A systematic review. Phys Ther Sport. 2018;30:48–56. PubMed ID: 29246794 doi:10.1016/j.ptsp.2017.08.076
- 4.↑
Smith JJ, Eather N, Morgan PJ, Plotnikoff RC, Faigenbaum AD, Lubans DR. The health benefits of muscular fitness for children and adolescents: a systematic review and meta-analysis. Sports Med. 2014;44(9):1209–1223. PubMed ID: 24788950 doi:10.1007/s40279-014-0196-4
- 5.↑
Moran J, Sandercock G, Ramirez-Campillo R, Clark CC, Fernandes JF, Drury B. A meta-analysis of resistance training in female youth: its effect on muscular strength, and shortcomings in the literature. Sports Med. 2018;48(7):1661–1671. PubMed ID: 29626334 doi:10.1007/s40279-018-0914-4
- 6.↑
Butcher SJ, Craven BR, Chilibeck PD, Spink KS, Grona SL, Sprigings EJ. The effect of trunk stability training on vertical takeoff velocity. J Orthop Sports Phys Ther. 2007;37(5):223–231. PubMed ID: 17549950 doi:10.2519/jospt.2007.2331
- 7.↑
Willardson JM. Core stability training: applications to sports conditioning programs. J Strength Cond Res. 2007;21(3):979–985. PubMed ID: 17685697 doi:10.1519/R-20255.1
- 8.↑
dos Santos KM, Da Roza T, Mochizuki L, Arbieto ERM, Tonon da Luz SC. Assessment of abdominal and pelvic floor muscle function among continent and incontinent athletes. Int Urogynecol J. 2019;30(5):693–699. PubMed ID: 29934766 doi:10.1007/s00192-018-3701-8
- 9.↑
Tsai YJ, Chia CC, Lee PY, Lin LC, Kuo YL. Landing kinematics, sports performance, and isokinetic strength in adolescent male volleyball athletes: influence of core training [published online ahead of print April 16, 2019]. J Sport Rehabil. 2019;29(1):65–72. doi:10.1123/jsr.2018-0015
- 10.↑
Evans K, Refshauge KM, Adams R. Trunk muscle endurance tests: reliability, and gender differences in athletes. J Sci Med Sport. 2007;10(6):447–455. PubMed ID: 17141568 doi:10.1016/j.jsams.2006.09.003
- 11.↑
De Blaiser C, De Ridder R, Willems T, Danneels L, Roosen P. Reliability and validity of trunk flexor and trunk extensor strength measurements using handheld dynamometry in a healthy athletic population. Phys Ther Sport. 2018;34:180–186. PubMed ID: 30366246 doi:10.1016/j.ptsp.2018.10.005
- 12.↑
Moreland J, Finch E, Stratford P, Balsor B, Gill C. Interrater reliability of six tests of trunk muscle function and endurance. J Orthop Sports Phys Ther. 1997;26(4):200–208. PubMed ID: 9310911 doi:10.2519/jospt.1997.26.4.200
- 13.↑
BIOMEC—Grupo de Investigação da Mecânica do Movimento research group. Retrieved from http://www.ufrgs.br/biomec
- 14.↑
Donner A, Eliasziw M. Sample size requirements for reliability studies. Stat Med. 1987;6(4):441–448. PubMed ID: 3629046 doi:10.1002/sim.4780060404
- 15.↑
Walter SD, Eliasziw M, Donner A. Sample size and optimal designs for reliability studies. Stat Med. 1998;17(1):101–110. PubMed ID: 9463853 doi:10.1002/(SICI)1097-0258(19980115)17:1%3C101::AID-SIM727%3E3.0.CO;2-E
- 16.↑
Hopkins W, Marshall S, Batterham A, Hanin J. Progressive statistics for studies in sports medicine and exercise science. Med Sci Sports Exerc. 2009;41(1):3–13. PubMed ID: 19092709 doi:10.1249/MSS.0b013e31818cb278
- 17.↑
Munro BH. Statistical Methods for Health Care Research. Vol. 1. Lippincott Williams & Wilkins; 2005.
- 18.↑
Finucane SD, Mayhew TP, Rothstein JM. Evaluation of the gravity-correction feature of a Kin-Com® isokinetic dynamometer. Phys Ther. 1994;74(12):1125–1133. PubMed ID: 7991654 doi:10.1093/ptj/74.12.1125