Advertisement
Research Article| Volume 65, 104012, September 2022

Download started.

Ok

Multimethodological validation of the modified fatigue impact scale in a Danish population of people with Multiple Sclerosis

Open AccessPublished:July 02, 2022DOI:https://doi.org/10.1016/j.msard.2022.104012

      Highlights

      • Novel multimethodological validation of the Danish version of modified fatigue impact scale.
      • Danish version of the modified fatigue impact scale showed good reliability, a good structural, cross-cultural, and nomological validity with a general and underlying structure including cognitive and physical subscales, an acceptable fit to a multidimensional Rasch analysis, and a good responsiveness.
      • Psychosocial subscale showed low reliability and low factor loadings and should not be interpreted independently.

      Abstract

      Objective

      To provide a multimethodological validation of MFIS in compliance with the COnsensus-based Standards for the selection of health status Measurement INstruments (COSMIN) guidelines using a Danish population of patients with Multiple Sclerosis (pwMS).

      Methods

      Factor analytic and multidimensional Rasch analytic methods were applied to investigate the structural validity of MFIS. We employed measurement and bi-factor models to address divergence and convergence. McDonalds ω was used to assess reliability. Cross-cultural and nomological validity was assessed in relation to relevant external populations and constructs, respectively. Responsiveness was evaluated following one month of multi-disciplinary rehabilitation (MDR).

      Results

      Based on data from 424 Danish MS patients, the MFIS showed an acceptable fit to both factor analytic and multidimensional Rasch models. MFIS was found to be dominated by a general factor but with considerable substructure present. Sufficient divergence was found between cognitive and physical subdomains, but not the psychosocial subscale. Mc Donald's ωt of 0.96 indicated good reliability of the scale, however, with low specific reliability of the psychosocial subscale. Homogeneity across Danish and European populations of pwMS supported cross-cultural validity. Explorative factor analysis of the nomological network of MFIS revealed expected convergence with cognitive and physical aspects. MFIS showed good responsiveness as indicated by moderate to large effect sizes following MDR.

      Conclusion

      The Danish version of the MFIS showed good reliability, a good structural, cross-cultural, and nomological validity an acceptable fit to a multidimensional Rasch analysis, and a good responsiveness. The psychosocial subscale of the MFIS however should be interpreted with caution.

      1. Introduction

      Multiple sclerosis (MS) is a chronic, progressive inflammatory and degenerative disease affecting the central nervous system, characterized by a complex pathogenesis and heterogenous symptoms across physical, cognitive, and neuropsychological domains (
      • Filippi M.
      • Bar-Or A.
      • Piehl F.
      • et al.
      Multiple sclerosis.
      ;
      • Thrue C.
      • Riemenschneider M.
      • Hvid L.G.
      • et al.
      Time matters: early-phase multiple sclerosis is accompanied by considerable impairments across multiple domains.
      ). Fatigue is the most common symptom reported by patients with multiple sclerosis (MS) (
      • Rommer P.S.
      • Eichstadt K.
      • Ellenberger D.
      • et al.
      Symptomatology and symptomatic treatment in multiple sclerosis: results from a nationwide MS registry.
      ), and it is known to have a detrimental impact on patients perception of health and quality of life (
      • Green R.
      • Cutter G.
      • Friendly M.
      • et al.
      Which symptoms contribute the most to patients' perception of health in multiple sclerosis?.
      ;
      • Amato M.P.
      • Ponziani G.
      • Rossi F.
      • et al.
      Quality of life in multiple sclerosis: the impact of depression, fatigue and disability.
      ). The most commonly used assessment of subjective perceptions of fatigue is based on measurements through self-report questionnaires, i.e. health-related patient-reported outcomes (HR-PROs). For this purpose, the Multiple Sclerosis Council for Clinical Practice Guidelines derived the 21-item Modified Fatigue Impact Scale (MFIS), from the original 40-item Fatigue Impact Scale (FIS), and recommended it for clinical use and research (

      Guidelines MSCfCP. Fatigue and multiple sclerosis: evidence-based management strategies for fatigue in multiple sclerosis. 1998.

      ;
      • Fisk J.D.
      • Ritvo P.G.
      • Ross L.
      • et al.
      Measuring the functional impact of fatigue: initial validation of the fatigue impact scale.
      ). Specifically, the MFIS is a three-factor composite scale evaluating and quantifying the trait level of experienced fatigue impact on physical (9 items), cognitive (10 items) and psychosocial functioning (2 items). The MFIS is an acknowledged outcome measure in clinical trials, and despite being widely used in the Danish MS community (
      • Taul-Madsen L.
      • Dalgas U.
      • Kjolhede T.
      • et al.
      A head-to-head comparison of an isometric and a concentric fatigability protocol and the association with fatigue and walking in persons with multiple sclerosis.
      ;
      • Johansson S.
      • Skjerbæk A.G.
      • Nørgaard M.
      • et al.
      Associations between fatigue impact and lifestyle factors in people with multiple sclerosis – the Danish MS hospitals rehabilitation study.
      ;
      • Oervik M.S.
      • Sejbaek T.
      • Penner I.K.
      • et al.
      Validation of the fatigue scale for motor and cognitive functions in a danish multiple sclerosis cohort.
      ;
      • Riemenschneider M.
      • Hvid L.G.
      • Ringgaard S.
      • et al.
      Study protocol: randomised controlled trial evaluating exercise therapy as a supplemental treatment strategy in early multiple sclerosis: the early multiple sclerosis exercise study (EMSES).
      ;
      • Callesen J.
      • Cattaneo D.
      • Brincks J.
      • et al.
      How do resistance training and balance and motor control training affect gait performance and fatigue impact in people with multiple sclerosis? A randomized controlled multi-center study.
      ), the Danish translation of the questionnaire has not yet been validated. Moreover, previous validation studies of other translations of the MFIS scale have reached different conclusions regarding the validity of the MFIS. Kos et al. found MFIS to be a valid scale in a multinational European study, albeit suggesting that the psychosocial subscale should be interpreted with caution as it showed weak homogeneity and added limited extra information (
      • Kos D.
      • Kerckhofs E.
      • Carrea I.
      • et al.
      Evaluation of the Modified Fatigue Impact Scale in four different European countries.
      ). Another study, using Rasch analysis, concluded that the MFIS is not valid in its current form as it did not fit the unidimensional Rasch model used (
      • Mills R.J.
      • Young C.A.
      • Pallant J.F.
      • et al.
      Rasch analysis of the Modified Fatigue Impact Scale (MFIS) in multiple sclerosis.
      ). The inherent multidimensionality of the MFIS scale however warrants a multidimensional Rasch analysis (
      • Whitehead L.
      The measurement of fatigue in chronic illness: a systematic review of unidimensional and multidimensional fatigue measures.
      ).
      Proper validation of HR-PROs is vital as healthcare systems often base important decisions on diagnosis and treatment options on measurement of health status measurements. Generally, a measurement scale is considered methodologically valid, if it measures what it intends to measure with negligible bias, sufficient precision, and sufficient consistency across varying measurement conditions within a pre-specified range (
      • Squara P.
      • Scheeren T.W.L.
      • Aya H.D.
      • et al.
      Metrology part 1: definition of quality criteria.
      ;
      • Squara P.
      • Scheeren T.W.L.
      • Aya H.D.
      • et al.
      Metrology part 2: procedures for the validation of major measurement quality criteria and measuring instrument properties.
      ). This also applies to psychometric scales. However, validation of psychometric scales is a conceptually and methodologically complex task, and as evidenced by the conflicting results on the validity of the MFIS, the choice of validation methodology is crucial for the precision and conclusion of the validation (
      • Mokkink L.B.
      • Terwee C.B.
      • Patrick D.L.
      • et al.
      The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study.
      ;
      • Boateng G.O.
      • Neilands T.B.
      • Frongillo E.A.
      • et al.
      Best practices for developing and validating scales for health, social, and behavioral research: a primer.
      ). On this basis, a comprehensive standardized checklist has been developed to evaluate measurement properties of health status measurement instruments: the COnsensus-based Standards for the selection of health status Measurement INstruments (COSMIN) (
      • Mokkink L.B.
      • Terwee C.B.
      • Patrick D.L.
      • et al.
      The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study.
      ). According to this checklist, the most important measurement properties when evaluating a HR-PRO is the construct validity (including structural validity, hypotheses testing, and cross-cultural validity), the criterion validity, and the content validity. Moreover, the reliability and the responsiveness of the HR-PRO is also considered important for the quality of a measure (
      • Mokkink L.B.
      • Terwee C.B.
      • Patrick D.L.
      • et al.
      The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study.
      ).
      On this basis, the aims of the present study were to (1) validate the Danish translation of the MFIS, by (2) making a comprehensive validation of the MFIS according to the COSMIN guidelines and thereby addressing the limitations of the previous conflicting conclusions on the validity of the MFIS. The latter will be done by (a) addressing the dimensionality of the construct (structural validation) as well as the convergent and discriminant validity of the three subscales within the MFIS construct, (b) investigating the reliability of the construct and its subscales, (c) investigating test- and item information using multidimensional Rasch analysis, (d) addressing group-invariance and performing a cross-cultural validation by comparison with published MFIS results for European populations (
      • Kos D.
      • Kerckhofs E.
      • Carrea I.
      • et al.
      Evaluation of the Modified Fatigue Impact Scale in four different European countries.
      ), (e) addressing the nomological validity of MFIS by testing hypothesized relationships to relevant constructs and external criteria (criterion validity), and (f) investigating the responsiveness of the scale in patients following one month of multidisciplinary rehabilitation (MDR).

      2. Methods

      2.1 Study population

      The study population for this validation study consisted of a total of 424 Danish MS patients. All patients were part of The Danish MS Hospitals Rehabilitation Study, and a detailed description of inclusion criteria, study design, data collection, and data management procedures can be found in the paper by
      • Boesen F.
      • Norgaard M.
      • Trenel P.
      • et al.
      Longer term effectiveness of inpatient multidisciplinary rehabilitation on health-related quality of life in MS patients: a pragmatic randomized controlled trial - the Danish MS Hospitals Rehabilitation Study.
      ) The study population consisted of two geographical subpopulations corresponding to the two Danish MS hospitals positioned in Haslev, Zealand (n = 192), and Ry, Jutland (n = 232). Data were collected at baseline, i.e., before any treatment received, with the exception of follow-up data (collected after one-month of MDR in a subset of the study population (n = 182)). Patients of this subpopulation had been assigned to the Fatigue main focus area of the MDR program of the Danish MS Hospitals based on an initial consultation prior to hospital admission indicating patient challenges and concerns in relation to fatigue.

      2.2 Item translation

      The 21 original MFIS items in English were translated to Danish, as well as back-translated to English by Sanofi-Aventis. Back-translation was used to evaluate the adequacy of the translation. Table 1 lists the 21 MFIS items and the Danish translation of each item.
      Table 1The 21 MFIS items, their subscale and Danish translation.
      NrSubscaleAbbreviationEnglishDanish
      01Cogn.Alertness01 I have been less alert.01 Jeg har været mindre oplagt
      02Cogn.Attention02 I have had difficulty paying attention for long periods of time.02 Jeg har haft problemer med at koncentrere mig i længere tid ad gangen
      03Cogn.Clear.thinking03 I have been unable to think clearly.03 Jeg har ikke været i stand til at tænke klart
      04Phys.Coordination04 I have been clumsy and uncoordinated.04 Jeg har været klodset og haft besvær med at koordinere mine bevægelser
      05Cogn.Forgetfulness05 I have been forgetful.05 Jeg har været glemsom
      06Phys.Pace.phys.act06 I have had to pace myself in my physical activities.06 Jeg har været nødt til at gøre en ekstra anstrengelse for at udføre fysiske aktiviteter
      07Phys.Motiv.phys.effort07 I have been less motivated to do anything that requires physical effort.07 Jeg har haft mindre lyst til at foretage mig noget, der kræver fysisk anstrengelse
      08PsSoc.Motiv.social.act08 I have been less motivated to participate in social activities.08 Jeg har haft mindre lyst til at deltage i sociale aktiviteter
      09PsSoc.Outdoor.act09 I have been limited in my ability to do things away from home.09 Mine muligheder for at deltage i aktiviteter uden for hjemmet har været begrænsede
      10Phys.Maintain.phys.effort10 I have trouble maintaining physical effort for long periods.10 Jeg har haft svært ved at udføre fysisk krævende aktiviteter i længere tid ad gangen
      11Cogn.Making.decisions11 I have had difficulty making decisions.11 Jeg har haft besvær med at træffe beslutninger
      12Cogn.Motiv.mental.effort12 I have been less motivated to do anything that requires thinking.12 Jeg har haft mindre lyst til at foretage mig noget, der krævede tankevirksomhed
      13Phys.Muscle.weakness13 My muscles have felt weak.13 Mine muskler har været svage
      14Phys.Phys.uncomfortable14 I have been physically uncomfortable.14 Jeg har haft fysisk ubehag
      15Cogn.Finish.tasks15 I have had trouble finishing tasks that require thinking.15 Jeg har haft besvær med at fuldføre opgaver, der krævede tankevirksomhed
      16Cogn.Organize.thoughts16 I have had difficulty organizing my thoughts when doing things at home or at work.16 Jeg har haft besvær med at samle tankerne, når jeg foretog mig noget hjemme eller på arbejde
      17Phys.Complete.phys.tasks17 I have been less able to complete tasks that require physical effort.17 Jeg har haft besvær med at fuldføre opgaver, der krævede fysisk anstrengelse
      18Cogn.Slow.thinking18 My thinking has been slowed down.18 Min tankevirksomhed er blevet mere langsom
      19Cogn.Concentration19 I have had trouble concentrating.19 Jeg har haft problemer med at koncentrerer mig
      20Phys.Limit.phys.act20 I have limited my physical activities.20 Jeg har begrænset mine fysiske aktiviteter
      21Phys.Have.rest21 I have needed to rest more often or for longer periods.21 Jeg har haft behov for at hvile hyppigere og/eller i længere tid ad gangen

      2.3 Statistical analyses

      All of the following validation methods were performed in R version 3.6.3 (2020–02–29) (R Core Team, 2020), especially using R-packages psych, (
      • psych W.R.
      Procedures For Psychological, Psychometric, and Personality Research.
      ) lavaan, (
      • Rosseel Y.
      lavaan: an R Package for Structural Equation Modeling.
      ) and mirt, (
      • Chalmers R.P.
      Mirt: a multidimensional item response theory package for the R environment.
      ) see supplementary material for more details.

      2.4 Data quality and scale acceptability

      We assessed the quality and scale acceptability of the data by analyzing missingness of data, scale range coverage, and flooring and ceiling effects (see supplementary material).

      2.5 Local independence, monotonicity, multivariate normality and tau-equivalence

      The validity of a psychometric scale as well as its assessment depends on a set of assumptions and the degree of deviation from those assumptions. We used non-parametric Mokken scale analysis in order to address the assumptions of local independence and monotonicity of the latent scale (
      • Dima A.L.
      Scale validation in applied health research: tutorial for a 6-step R-based psychometrics protocol.
      ). Local independence states that items are independent contingent on the latent scale, while monotonicity states that the latent construct is a monotone increasing scale. Both factor analytic and Rasch analytic models assume local independence and monotonicity. Factor analytic models are furthermore based on a normal-distributional assumption underlying the variance-covariance matrix of item responses, but estimators relaxing this assumption are available, for example the diagonally weighted least squares (DWLS) estimator. We used Mardia's test to test for multivariate normality among items on the total MFIS scale, as well as on the Cognitive and Physical subscales, but not the Psychosocial subscale due to its limited number of items. Reliability of a psychometric scale is often reported using Cronbach's α, but this reliability measure requires the assumption of tau-equivalence to be met. In factor analytic models, tau-equivalence poses a constraint on the variance-covariance matrix by restricting all covariances to be equal. Rasch models share the equivalent property of constraining all items to share the same slope on the latent construct. We evaluated the plausibility of tau-equivalence using confirmatory factor analysis (CFA) by comparing models with increasingly relaxed variance-covariance constraints: (1) all items share a common variance and all inter-item covariances are constrained to be equal (parallel items model), (2) unrestricted item variances but equal inter-item covariances (tau-equivalence model), and (3) unconstrained variance-covariance matrix (congeneric model).

      2.6 Structural validity and reliability

      We addressed the adequacy of the three-subscale model of MFIS consisting of the Cognitive, Physical, and Psychosocial subscales both in a factor analytic and multidimensional Rasch analytic context using the following two models: (1) a bi-factorial model and (2) a measurement model. The bi-factorial model hypothesizes that each item depends on three sources of variance: (1) a common or general factor, (2) a specific or subconstruct factor and (3) random measurement error. A bi-factor analysis (BFA) therefore reveals the degree of convergence of items towards a single underlying dimension. Furthermore, BFA allows quantification of the degree of convergence of items onto the general factor versus onto their specific subconstruct factor. In contrast, the multi-dimensional measurement model provides a different parametrization, with items depending only on its specific subscale factor and random measurement error, whereas items across subscale factors are related only through parameters describing between-factor correlations. The multi-dimensional measurement model is therefore well-suited to address the sufficiency of convergence and divergence of factors. Sufficient item-on-subscale convergence was evaluated as standardized factor loadings > 0.4. Sufficient divergence among subscales was evaluated as inter-factor correlations < 0.8 (

      Gana K. and Broc G. Structural equation modeling with lavaan. 2019.

      ).
      For comparison with previous studies, we also investigated uni-dimensional (1D) models, ignoring the three-subscale structure of MFIS (
      • Whitehead L.
      The measurement of fatigue in chronic illness: a systematic review of unidimensional and multidimensional fatigue measures.
      ), and two-dimensional (2D) models. In the 2D models, the Psychosocial subscale was combined with the Physical subscale into a single subcomponent following the findings of
      • Kos D.
      • Kerckhofs E.
      • Carrea I.
      • et al.
      Evaluation of the Modified Fatigue Impact Scale in four different European countries.
      . Finally, in the factor analytic context, we also investigated a two-layered structural equation model (3D2L), where the Psychosocial subscale is modelled to be secondary to (i.e. depending on) the Physical subscale, as the latter is more likely to be a primary source of fatigue and as physical and psychosocial fatigue factors have been shown to be associated (
      • Kluger B.M.
      • Krupp L.B.
      • Enoka R.M.
      Fatigue and fatigability in neurologic illnesses: proposal for a unified taxonomy.
      ;
      • Trojan D.A.
      • Arnold D.
      • Collet J.P.
      • et al.
      Fatigue in multiple sclerosis: association with disease-related, behavioural and psychosocial factors.
      ).
      In all factor analytic analyses in the current study, we employed the DWLS estimator and constrained the two Psychosocial items (items 8 and 9) to share the same variance to ensure identifiability. Due to the polytomous nature of the MFIS scale, Rasch analyses were carried out using the Rating Scale Model (RSM) and the Partial Credit Model (PCM). The PCM is a generalization of the RSM by including specific item-category parameters for each item, thereby allowing items to have differing category metrics, i.e. each item and item category has its own difficulty parameter.
      From the BFA output we calculated Mc Donald's ωh, the Explained Common Variance (ECV), and Mc Donald's ωt. Mc Donald's ωh gives the proportion of variance in observed scores accounted for by a single general factor and, hence, is indicative of unidimensionality if high (
      • Mair P.
      Modern Psychometrics With R.
      ). The ECV is the ratio of the general factor eigen value to the sum of all factor eigen values and hence indicates the degree of uni-dimensionality versus substructure. Mc Donald's ωt, on the other hand, is the proportion of variance in scores due to all factors and, hence, expresses the expected reliability of the scale.

      2.7 Fit indices

      The following global fit indices were used in all analyses with abbreviations and adequacy thresholds in parentheses: Comparative Fit Index (CFI, good fit: 0.90 ≤ CFI ≤ 0.95, very good fit: ≥ 0.95), Tucker-Lewis Index (TFI, good fit: 0.90 ≤ CFI ≤ 0.95, very good fit: ≥ 0.95), Standardized Root Mean Squared Residuals (SRMSR, ≤ 0.08), and the Root Mean Squared Error of Approximation (RMSEA, ≤ 0.05, and with RMSEA 90% upper confidence limits 90%-UCL ≤ 0.1) (

      Gana K. and Broc G. Structural equation modeling with lavaan. 2019.

      ). Adequacy thresholds for validation followed (

      Gana K. and Broc G. Structural equation modeling with lavaan. 2019.

      ).
      Group invariance and cross-cultural validity – A psychometric scale should be consistent across varying measurement conditions as for example gender, geography and cultural background. Here, we addressed group invariance among gender and among the two Danish MS hospitals located in Ry (Jutland) and Haslev (Sealand) using Andersen's Likelihood Ratio test of group invariance on the uni-dimensional PCM models for the Cognitive and Physical MFIS subscales. Furthermore, we addressed cross-cultural validity using the Tucker index of factor congruence comparing PCA loadings reported by
      • Kos D.
      • Kerckhofs E.
      • Carrea I.
      • et al.
      Evaluation of the Modified Fatigue Impact Scale in four different European countries.
      for a sample across four different European countries (Belgium, Italy, Slovenia, Spain) with PCA loadings for our Danish sample population, with PC loadings calculated according to
      • Kos D.
      • Kerckhofs E.
      • Carrea I.
      • et al.
      Evaluation of the Modified Fatigue Impact Scale in four different European countries.
      . We also report the RV coefficient of overall matrix correlation.
      Nomological validity – We investigated the nomological net of MFIS by performing an Exploratory Factor Analysis (EFA) including the six subscale scores of the Functional Assessment in Multiple Sclerosis scale (FAMS, ref.), Beck's Depression Inventory II (BDI) and physical performance as measured by the 6 min walking test (6MWT) along with the MFIS Cognitive, Physical and Psychosocial subscale scores.

      2.8 Responsiveness

      For a psychometric scale to be valuable, it must be able to capture hypothesized effects of relevant size in the population (
      • Mokkink L.B.
      • Terwee C.B.
      • Patrick D.L.
      • et al.
      The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study.
      ;
      • Hobart J.C.
      • Riazi A.
      • Lamping D.L.
      • et al.
      Improving the evaluation of therapeutic interventions in multiple sclerosis: development of a patient-based measure of outcome.
      ). Here, we investigated the responsiveness of the MFIS scale and its subscales as Cohen's effect size d for changes from baseline to discharge for patients in the thinking/fatigue-targeted patient subset (n = 181) following one month of fatigue-targeted MDR. When assessing the responsiveness, the intervention must be assumed to be effective, why this fatigue-specific subgroup of patients was chosen for this analysis. See Boesen et al. for more details on the MDR intervention (
      • Boesen F.
      • Norgaard M.
      • Trenel P.
      • et al.
      Longer term effectiveness of inpatient multidisciplinary rehabilitation on health-related quality of life in MS patients: a pragmatic randomized controlled trial - the Danish MS Hospitals Rehabilitation Study.
      ). Effect sizes will be interpreted according to the recent empirically based guidelines for rehabilitation research, with effect sizes of 0.14–0.30 being considered small, effect sizes of 0.31–0.60 moderate, and effect sizes ≥ 0.61 being large (
      • Kinney A.R.
      • Eakman A.M.
      • Graham J.E.
      Novel effect size interpretation guidelines and an evaluation of statistical power in rehabilitation research.
      ).

      3. Results

      Characteristics of the Danish study population are shown in Table 2. The thinking/fatigue-targeted patient subset used for the responsiveness evaluation differed from the main study population by higher MFIS at baseline, slightly higher proportion of younger patients, shorter disease duration, higher proportion of relapsing remitting MS and patients with EDSS scores < 3.5, and longer 6MWT distances, see Table 2.
      Table 2Characteristics of the study population.
      All (n = 424)Thinking/fatigue-targeted subpopulation
      This subpopulation was used to investigate the responsiveness of MFIS as mean marginal changes in MFIS from baseline to discharge after one month of multidisciplinary rehabilitation.
      (n = 182)
      n (%)
      Missing data10 (2%)1 (1%)
      Gender: women290 (68%)132 (73%)
      Age group: 40 years or less66 (16%)37 (20%)
      41–50 years137 (32%)55 (30%)
      51–60 years170 (40%)73 (40%)
      over 60 years51 (12%)17 (9%)
      Disease duration: 5 years or less62 (15%)29 (16%)
      6–10 years86 (20%)45 (25%)
      11–20 years148 (35%)62 (34%)
      over 20 years128 (30%)46 (25%)
      Type of MS: relapsing remitting174 (41%)93 (51%)
      secondary progressive180 (42%)67 (37%)
      primary progressive70 (17%)22 (12%)
      EDSS: EDSS < 3.5124 (29%)70 (38%)
      EDSS 3.5–5.5140 (33%)61 (34%)
      EDSS 5.0–6.5145 (34%)48 (26%)
      EDSS > 6.515 (4%)3 (2%)
      Work situation full-time15 (4%)7 (4%)
      part-time80 (19%)40 (22%)
      unemployed51 (12%)30 (17%)
      retired/early retirement261 (63%)101 (56%)
      otherwise5 (1%)3 (2%)
      Current or former smoker Yes286 (69%)129 (71%)
      Marital status married288 (70%)133 (73%)
      median (Q1, Q3)
      MFIS41 (30–50)43 (33–50)
      BMI24 (22–27)24 (23–28)
      BDI11 (6–17)11 (7–16)
      FAMS114 (96–133)114 (98–130)
      6MWT239 (107–356)270 (119–315)
      a This subpopulation was used to investigate the responsiveness of MFIS as mean marginal changes in MFIS from baseline to discharge after one month of multidisciplinary rehabilitation.

      3.1 Data quality and scale acceptability

      Ten out of the 424 patients (2.4%) did not return the MFIS questionnaire at baseline. Scale coverages for MFIS and its subscales were high, ranging from 88.1% (total MFIS) to 100%, and floor and ceiling effects were found to be small (<8% of the sample). See Table S1 in the supplementary material for full results on data quality and scale acceptability.

      3.2 Local independence, monotonicity, multivariate normality and tau-equivalence

      All MFIS items passed the Mokken scale monotonicity and local independence checks (see supplementary Table S2). The assumption of multivariate normality was rejected for both the total MFIS scale and the Cognitive and Physical subscales, as indicated by Mardia tests for skewness and kurtosis (all, p < 0.001) (see supplementary Table S1). Hence, the DWLS estimator was applied in all factor-analytic analyses. CFA models addressing tau-equivalence rejected both the parallel items model (LRT, p < 0.001; ECVI = 2.98) and the tau-equivalence model (LRT, p < 0.001; ECVI = 2.87), when compared to the congeneric model (ECVI = 1.44). Model fit indices of the congeneric model indicated that this model is an adequate representation of the data (CFI = 0.98, TLI = 0.97, RMSEA = 0.06, RMSEA 90%-UCL = 0.07, SRMR = 0.08) and is superior to the parallel item or tau-equivalence version of the model, see Table 3. This indicates that split-half reliability (requiring the parallel condition) and Cronbach's alpha (requiring tau-equivalence) are inadequate estimators of global test reliability for the MFIS scale, and, hence, McDonald's omega (ω) was calculated in the present study.
      Table 3Model fit statistics. Model fit sufficiency is indicated with a good fit shown in bold and a very good fit shown in bold italics.
      ModelTLICFIRMSEAUpper 95%- CLSRMR
      CFA models
       1D1L0.860.870.150.150.15
       2D1L0.970.970.070.080.08
       3D1L, parallel0.930.930.100.110.12
       3D1L, tau-equivalent0.930.930.100.110.12
       3D1L, congeneric0.970.980.060.070.08
       Bi-factor 3D0.990.990.030.040.05
       3D2L0.970.970.070.080.08
      IRT models
       Rasch models
        1D RSM0.690.610.230.24n.a.
         1D RSMCogn0.960.930.130.14n.a.
         1D RSMPhys0.940.910.110.12n.a.
         1D RSMCogn5
      : Scales suggested by Mills et al.14.
      0.980.950.110.13n.a.
         1D RSMPhys8
      : Scales suggested by Mills et al.14.
      0.960.930.100.11n.a.
        1D PCM0.620.620.260.260.18
        2D PCM0.930.930.110.120.11
        3D PCM0.930.930.110.120.10
        Bi-factor 3D PCM0.920.920.110.120.11
        3D GRM0.960.960.080.090.10
      CFA: confirmatory factor analysis.
      1D: one-dimensional model of MFIS.
      2D: two-dimensional measurement model of MFIS with the Psychosocial subscale combined with the Physical subscale.
      3D: three-dimensional measurement model of MFIS with Cognitive, Physical and Psychosocial subscales.
      1L: Single-layered measurement model, with no hierarchical structure among sub-components.
      2L: Two-layered structural model, with the Psychosocial sub-component secondary to the Physical subcomponent.
      IRT: Item response theory.
      RSM: Rating Scale Model; PCM: Partial Credit Model; GRM: Graded Response Model.
      TLI: Tucker-Lewis Index (TFI, good fit: 0.90 ≤ CFI ≤ 0.95, very good fit: ≥ 0.95).
      CFI: Comparative Fit Index (CFI, good fit: 0.90 ≤ CFI ≤ 0.95, very good fit: ≥ 0.95).
      RMSEA: Root Mean Squared Error of Approximation (RMSEA, ≤ 0.05, and with RMSEA 90% upper confidence limits 90%-UCL ≤ 0.1).
      SRMR: Standardized Root Mean Squared Residuals (SRMSR, ≤ 0.08).
      a : Scales suggested by Mills et al.14.

      3.3 Structural validity and reliability

      MFIS in its original form did show poor fit to one-dimensional models that ignore the subscale structure of MFIS regardless of the model framework (Table 3). However, when acknowledging the subscale structure of MFIS, model fit indices improved to acceptable fits. This was the case for both bi-factor models and 3-dimensional measurement models in the factor-analytic and Rasch-analytic context (Table 3 and Table 4). Rasch-analytic models showed generally poorer fits in line with their higher scale standards according to theory. Standardized loadings are shown in Table 5 and Fig. 1. The bi-factor models supported uni-dimensionality of MFIS as indicated by Mc Donald's ωh = 0.72, however, with considerable substructure present (ECV = 0.47), as also indicated by the eigenvalues of the general and specific factors Cognitive, Physical and Psychosocial were 6.48, 4.81, 2.24, and 0.22, respectively. Reliability of the MFIS scale was found to be high as indicated by a Mc Donald's ωt of 0.96. All items showed loadings onto the general factor greater than 0.30, but not all items showed loadings onto their specific factor exceeding the 0.30 threshold (items 1, 8, 14, and 21; see Table 5). The Psychosocial subscale was poorly supported in the bi-factor model, with its two sole items (8 and 9) showing considerable larger loadings onto the general factor than onto their specific factor (Table 5). Item 8 was even found to be uncorrelated with the Psychosocial subscale conditional on its loading onto the general factor.
      Table 4Item fit statistics for multidimensional and unidimensional Rasch analyses. Infit and outfit mean squared (MS) values between 0.5 and 1.5 are considered satisfactory for measurement (
      • Linacre J.M.
      What do infit and outfit, mean-square and standardized mean.
      ). Deviating items are shown in bold.
      Multidimensional Rasch modelUnidimensional Rasch model
      3D RSM3D PCMBi-factor 3D PCM1D PCMCogn
      Infit MSOutfit MSInfit MSOutfit MSInfit MSOutfit MSInfit MSOutfit MSDiscrimination
      Cogn.01Alertness0.710.691.611.631.501.501.621.650.47
      02Attention0.900.910.820.830.780.800.750.770.82
      03Clear.thinking0.830.810.960.970.930.950.930.950.75
      05Forgetfulness0.950.941.141.131.151.150.960.960.75
      11Making.decisions0.810.820.950.960.920.950.790.810.82
      12Motiv.mental.effort0.850.880.920.960.910.960.560.560.88
      15Finish.tasks0.730.750.670.670.620.620.840.820.69
      16Organize.thoughts0.760.780.70.70.650.650.790.800.73
      18Slow.thinking0.970.990.830.840.80.831.391.390.47
      19Concentration0.790.800.670.660.610.610.610.600.82
      1D PCMPhys
      Phys.04Coordination1.281.241.181.181.261.241.241.190.69
      06Pace.phys.act0.850.860.740.740.70.710.940.970.76
      07Motiv.phys.effort0.760.770.860.850.830.820.640.580.87
      10Maintain.phys.effort1.181.190.800.810.830.840.620.620.87
      13Muscle.weakness1.321.280.940.941.010.991.291.240.47
      14Phys.uncomfortable1.301.321.261.281.321.340.670.680.77
      17Complete.phys.tasks0.890.900.660.660.640.641.000.950.64
      20Limit.phys.act1.051.050.740.750.730.750.710.710.76
      21Have.rest1.021.030.930.940.920.940.970.970.64
      PsSoc.08Motiv.social.act0.830.840.860.840.540.54n.a.n.a.n.a.
      09Outdoor.act1.171.150.790.780.570.58n.a.n.a.n.a.
      3D: three-dimensional MFIS model with Cognitive, Physical and Psychosocial subcomponents.
      RSM: Rating Scale Model; PCM: Partial Credit Model.
      Table 5Results of confirmatory factor analyses (CFA) including 3-dimensional (3D) bi-factor analysis, 3D one-layered (3D1L) and 3D two-layered (3D2L) CFA. Item loadings < 0.40 are indicated in bold.
      Bi-factor 3D3D1L3D2L
      SubscaleItemStd. loadings (95%-CI)R2Std. loadings (95%-CI)R2Std. loadings (95%-CI)R2
      General factor gSpecific factor
      Cogn.01Alertness0.64 (0.56,0.71)0.19 (0.12,0.27)0.600.62 (0.55,0.68)0.380.62 (0.56,0.68)0.38
      02Attention0.42 (0.36,0.48)0.71 (0.64,0.79)0.580.81 (0.75,0.87)0.660.81 (0.75,0.87)0.66
      03Clear.thinking0.43 (0.37,0.49)0.63 (0.55,0.70)0.440.76 (0.70,0.82)0.570.76 (0.70,0.82)0.58
      05Forgetfulness0.36 (0.30,0.42)0.60 (0.53,0.68)0.690.69 (0.63,0.74)0.470.69 (0.63,0.74)0.47
      11Making.decisions0.48 (0.42,0.54)0.60 (0.53,0.67)0.580.78 (0.72,0.84)0.610.78 (0.72,0.84)0.61
      12Motiv.mental.effort0.44 (0.38,0.50)0.62 (0.55,0.70)0.490.76 (0.71,0.82)0.580.76 (0.71,0.82)0.58
      15Finish.tasks0.47 (0.41,0.53)0.73 (0.66,0.81)0.590.86 (0.80,0.93)0.740.86 (0.80,0.92)0.74
      16Organize.thoughts0.50 (0.44,0.56)0.71 (0.63,0.78)0.580.87 (0.81,0.93)0.760.87 (0.81,0.93)0.76
      18Slow.thinking0.42 (0.36,0.48)0.71 (0.64,0.79)0.760.81 (0.75,0.87)0.650.81 (0.75,0.87)0.66
      19Concentration0.44 (0.38,0.50)0.79 (0.71,0.87)0.750.87 (0.81,0.93)0.760.87 (0.81,0.93)0.76
      Phys.04Coordination0.35 (0.29,0.40)0.34 (0.25,0.44)0.580.48 (0.43,0.53)0.230.47 (0.42,0.52)0.22
      06Pace.phys.act0.54 (0.47,0.60)0.58 (0.47,0.69)0.650.75 (0.68,0.81)0.560.73 (0.66,0.79)0.53
      07Motiv.phys.effort0.64 (0.57,0.71)0.38 (0.28,0.48)0.240.76 (0.70,0.83)0.580.75 (0.68,0.81)0.56
      10Maintain.phys.effort0.45 (0.39,0.51)0.63 (0.52,0.74)0.620.68 (0.62,0.74)0.460.66 (0.60,0.71)0.43
      13Muscle.weakness0.36 (0.30,0.42)0.55 (0.44,0.65)0.550.57 (0.52,0.63)0.330.55 (0.50,0.61)0.31
      14Phys.uncomfortable0.56 (0.50,0.62)0.12 (0.03,0.21)0.600.57 (0.52,0.62)0.320.56 (0.51,0.61)0.32
      17Complete.phys.tasks0.55 (0.49,0.62)0.65 (0.53,0.76)0.430.78 (0.72,0.85)0.610.76 (0.70,0.82)0.58
      20Limit.phys.act0.55 (0.48,0.61)0.54 (0.43,0.65)0.330.74 (0.68,0.81)0.550.72 (0.66,0.79)0.52
      21Have.rest0.74 (0.67,0.81)0.19 (0.09,0.29)0.720.78 (0.71,0.84)0.600.76 (0.70,0.82)0.58
      PsSoc.08Motiv.social.act0.76 (0.70,0.83)0.00 (-0.28,0.27)0.690.76 (0.66,0.85)0.570.71 (0.63,0.79)0.51
      09Outdoor.act0.69 (0.63,0.75)0.42 (0.24,0.60)0.810.69 (0.61,0.77)0.480.74 (0.67,0.82)0.55
      Fig 1
      Fig 1The bi-factor 3-dimensional model (3D), the 3D one-layered measurement model (3D1L), and the 3D two-layered (3D2L) structural equation model of MFIS. All estimates are standardized estimates (correlations). Manifest standard deviations are not shown.
      Fit indices for the 3-dimensional measurement model were comparable to the bi-factor model (Table 3). All items showed standardized loadings onto their specific factor that were larger than 0.4, indicating sufficient convergence of items onto their factors (Fig. 1, Table 5). This was also found for the two items 8 and 9 of the Psychosocial subscale. Inter-subscale correlations and their 95%-CI were estimated to be 0.44 (0.42,0.47), 0.81 (0.72,0.9), and 0.57 (0.5,0.64) for Physical-Cognitive, Physical-Psychosocial, and Cognitive-Psychosocial correlations, respectively. This indicates sufficient discrimination between the Physical and Cognitive as well as between the Cognitive and Psychosocial subscales, but not between the Physical and Psychosocial subscales. AVE estimates for Cognitive, Physical, and Psychosocial subscales were found to be 0.63, 0.47, and 0.52, respectively, with AVE = 0.55 for the total scale. This indicates that items in the Physical subscale on average are marginally insufficient representatives of the Physical subconstruct, when taking measurement error into account. This is due to items 4, 13, and 14, that all showed R2 < 0.40. Mc Donald's ωt reliability estimates were 0.94, 0.89, and 0.68 for the Cognitive, Physical, and Psychosocial subscales, respectively, compared to a ωt = 0.95 for the total scale.
      The 3D two-layered structural model showed fit indices comparable to the 3D bi-factor and congeneric 3D one-layered measurement models (Table 3), and standard loadings comparable to the congeneric 3D one-layered model (Table 5).

      3.4 Group invariance and cross-cultural validity

      Results on group invariance within the current study population revealed no differences in group comparisons between hospitalization locality (Ry vs. Haslev) in 1D PCM models of the Cognitive and Physical subdimensions but did reveal a difference between males and females for the 1D PCM model of the Physical subdimension (see supplementary Table S3). The Tucker index of factor congruence between PCA loadings reported by
      • Kos D.
      • Kerckhofs E.
      • Carrea I.
      • et al.
      Evaluation of the Modified Fatigue Impact Scale in four different European countries.
      ) in a multinational European cohort and PCA loadings from the current Danish study sample revealed a high degree of factor congruence between the cross-cultural loadings in the Cognitive and Physical subdomains, respectively (Tucker index: Cognitive, 0.97, and Physical, 0.93), but very limited congruence was observed for the Psychosocial subdomain (Tucker index: 0.25). The overall RV coefficient of matrix correlations was estimated to be 0.81 (p< 0.001).

      3.5 Nomological and criterion validity

      Fatigue, measured by the MFIS total score, was found to be unrelated to age, time since diagnosis, and EDSS score, see supplementary Table S4. Exploratory factor analysis revealed an apparently larger nomological network for cognitive fatigue than the physical subdomain of fatigue, as both FAMS Thinking/Fatigue, BDI, and 6MWT loads into the latent factor associated with MFIS Cognitive whereas only the FAMS Mobility and 6MWT performance loads into the latent factor associated with MFIS Physical (Table 6).
      Table 6Exploratory Factor Analysis loadings of MFIS subscales and its nomological net including Health Related Quality of Life (HRQoL) measured by FAMS subscales, Beck's Depression Inventory II (BDI) and physical performance as measured by the 6 min walking test (6MWT). Factor loadings < 0.20 are not shown.
      Latent factor
      F1 CognitiveF2 EmotionalF3 Physical
      MFISCognitive0.95
      MFISPhysical0.42−0.56
      MFISPsychosocial0.39−0.41
      FAMSMobility0.98
      FAMSSymptoms−0.39
      FAMSEmotional wellbeing0.86
      FAMSGeneral contentment0.88
      FAMSThinking/fatigue−0.84
      FAMSFamily/social wellbeing0.6
      BDI0.31−0.49
      6MWT0.270.52
      Proportion of variance explained0.210.20.16
      Cumulative variance explained0.210.40.56

      3.6 Responsiveness

      The MFIS showed overall good responsiveness, as large effect sizes were found on the total score and in the physical subdomain and moderate effect sizes were found for the cognitive and psychosocial subdomains following one month of MDR (Table 7).
      Table 7Cohen's effect sizes d for the MFIS total scale and the cognitive, physical, and psychosocial subscales after one month of MDR.
      ScaleCohen's d95% CIp-value
      MFIS−0.88−1.02−0.73<0.0001
      MFISCognitive−0.60−0.72−0.48<0.0001
      MFISPhysical−0.90−1.05−0.74<0.0001
      MFISPsychosocial−0.48−0.64−0.33<0.0001

      4. Discussion

      Previous validations of the MFIS have applied different validation methodologies and have reached different conclusions on the validity of the scale (
      • Kos D.
      • Kerckhofs E.
      • Carrea I.
      • et al.
      Evaluation of the Modified Fatigue Impact Scale in four different European countries.
      ;
      • Mills R.J.
      • Young C.A.
      • Pallant J.F.
      • et al.
      Rasch analysis of the Modified Fatigue Impact Scale (MFIS) in multiple sclerosis.
      ;
      • Amtmann D.
      • Bamer A.M.
      • Noonan V.
      • et al.
      Comparison of the psychometric properties of two fatigue scales in multiple sclerosis.
      ;
      • Bakalidou D.
      • Voumvourakis K.
      • Tsourti Z.
      • et al.
      Validity and reliability of the Greek version of the Modified Fatigue Impact Scale in multiple sclerosis patients.
      ). The present study provide the first ever validation of the MFIS following the COSMIN checklist, ensuring a standardized, modern, and comprehensive multimethodological evaluation of the measurement properties of the scale. As this validation is based on the Danish translation of the MFIS, the present study also serves as the first validation of the Danish version of the MFIS.
      The present study found that both 3D one-layered measurement, two-layered structural, and bi-factor CFA models showed acceptable fits of the current MFIS data. The fact that the two-layered structural modeling of the data (psychosocial domain modelled as a secondary structure to the cognitive and physical domains) shows an equally good fit of the data as the one-layered measurement model underline the general finding of a limited benefit of the psychosocial subscale. However, the question whether the perception of psychosocial fatigue is secondary to cognitive and physical fatigue remains unanswered. Moreover, eigenvalues of general and specific factors indicated a structure of a dominant general factor, with underlying specific factors. Generally, all items loaded satisfactory onto the general factor, but the specific psychosocial subscale was poorly supported as its two items showed larger loading onto the general rather than their specific factor. These findings are well in line with the findings of Kos et al. in the multinational European validation of the MFIS and Bakalidou et al. in the Greek validation (
      • Kos D.
      • Kerckhofs E.
      • Carrea I.
      • et al.
      Evaluation of the Modified Fatigue Impact Scale in four different European countries.
      ;
      • Bakalidou D.
      • Voumvourakis K.
      • Tsourti Z.
      • et al.
      Validity and reliability of the Greek version of the Modified Fatigue Impact Scale in multiple sclerosis patients.
      ). Also aligned with previous findings, the reliability of the total score and the cognitive and physical subscales of the MFIS showed good reliability in the current study, whereas the psychosocial subscale did not (
      • Kos D.
      • Kerckhofs E.
      • Carrea I.
      • et al.
      Evaluation of the Modified Fatigue Impact Scale in four different European countries.
      ;
      • Larson R.D.
      Psychometric properties of the modified fatigue impact scale.
      ). The low reliability of the Psychosocial subscales is probably related to only two items constituting the subscale. Overall, the Psychosocial subscale shows low reliability, too high correlation to the Physical subscale, and its items are more related to a general factor than to their own subscale factor. All this indicates that the Psychosocial subscale is poorly defined and cannot be recommended to be used separately. However, the items may still be informative for assessment of the general underlying factor in MFIS as indicated by the BFA.
      One previous study evaluated the MFIS in MS using Rasch analysis, and Mills et al. found that the MFIS did not fit the unidimensional Rasch model applied and concluded that the summed score of MFIS is invalid because of multidimensionality (
      • Mills R.J.
      • Young C.A.
      • Pallant J.F.
      • et al.
      Rasch analysis of the Modified Fatigue Impact Scale (MFIS) in multiple sclerosis.
      ). The authors did find an underlying structure with cognitive and physical subdomains, but was forced to remove items 1, 2, 3, 4, 5, 11, 14, and 17 in order to obtain acceptable model fits. Instead of discarding the MFIS total score based on a unidimensional model we applied multidimensional Rasch analyses, relaxing the assumption of unidimensionality. Using this approach, we found acceptable fits to the models of the total score as well as the cognitive and physical subscales, indicating that these are valid measures of fatigue impact. We also found that correlations between total sum scores and corresponding predicted latent factor values were high for both the CFA models and the more restrictive Rasch models, indicating that the degree of misfit to an ideal Rasch scale influenced MFIS sum scores only to a limited degree in the current data. Cross-cultural PCA loadings revealed a high degree of factor congruence in the cognitive and physical subdomains, but not the psychosocial domain, again adding to the overall validity of the cognitive and physical subscales but also underlining the limited validity of the psychosocial subscale.
      In the assessment of the validity of an instrument, it is also important to consider the nomological network of the construct, and to evaluate theoretically sound associations to relevant other constructs (
      • Larson R.D.
      Psychometric properties of the modified fatigue impact scale.
      ). The findings of relevant loadings of FAMS subdimensions, BDI, and 6MWT onto the latent factor associated with MFIS Cognitive and FAMS Mobility and 6MWT performance onto the latent factor of MFIS physical are confirmative of the nomological validity of MFIS, and also replicate previous findings (
      • Larson R.D.
      Psychometric properties of the modified fatigue impact scale.
      ;
      • Tellez N.
      • Rio J.
      • Tintore M.
      • et al.
      Does the Modified Fatigue Impact Scale offer a more comprehensive assessment of fatigue in MS?.
      ). However, this should also be confirmed by associations to other more fatigue-specific validated constructs (i.e. the Fatigue Severity Scale and/or objective measures of fatigability) in future studies.
      The MFIS has been reported to have limited responsiveness to changes in patients with MS (
      • Rietberg M.B.
      • Van Wegen E.E.
      • Kwakkel G.
      Measuring fatigue in patients with multiple sclerosis: reproducibility, responsiveness and concurrent validity of three Dutch self-report questionnaires.
      ), but important for the use in clinical trials, the MFIS showed good responsiveness over time following a 1-month MDR intervention in the current study.

      5. Conclusion

      The Danish version of the MFIS showed good reliability, a good structural, cross-cultural, and nomological validity with a general and underlying structure including cognitive and physical subscales, an acceptable fit to a multidimensional Rasch analysis, and a good responsiveness. The psychosocial subscale of the MFIS however showed low reliability and low factor loadings and on this basis we do not recommend to interpret the scores of this subscale independently.

      CRediT authorship contribution statement

      Morten Riemenschneider: Conceptualization, Methodology, Validation, Writing – original draft, Writing – review & editing. Philipp Trénel: Conceptualization, Methodology, Data curation, Formal analysis, Software, Validation, Writing – review & editing. Michael Nørgaard: Conceptualization, Data curation, Writing – review & editing. Finn Boesen: Conceptualization, Supervision, Writing – review & editing.

      Declaration of Competing Interest

      The authors have no conflicts of interests associated with the present study.

      Funding

      The present study was not financed by any external funds.

      Appendix. Supplementary materials

      References

        • Amato M.P.
        • Ponziani G.
        • Rossi F.
        • et al.
        Quality of life in multiple sclerosis: the impact of depression, fatigue and disability.
        Mult. Scler. 2001; 7: 340-344
        • Amtmann D.
        • Bamer A.M.
        • Noonan V.
        • et al.
        Comparison of the psychometric properties of two fatigue scales in multiple sclerosis.
        Rehabil. Psychol. 2012; 57 (2012/06/13): 159-166https://doi.org/10.1037/a0027890
        • Bakalidou D.
        • Voumvourakis K.
        • Tsourti Z.
        • et al.
        Validity and reliability of the Greek version of the Modified Fatigue Impact Scale in multiple sclerosis patients.
        Int. J. Rehabil. Res. 2014; 37 (2014/02/22): 271-276https://doi.org/10.1097/MRR.0000000000000057
        • Boateng G.O.
        • Neilands T.B.
        • Frongillo E.A.
        • et al.
        Best practices for developing and validating scales for health, social, and behavioral research: a primer.
        Front. Public Health. 2018; 6 (2018/06/27): 149https://doi.org/10.3389/fpubh.2018.00149
        • Boesen F.
        • Norgaard M.
        • Trenel P.
        • et al.
        Longer term effectiveness of inpatient multidisciplinary rehabilitation on health-related quality of life in MS patients: a pragmatic randomized controlled trial - the Danish MS Hospitals Rehabilitation Study.
        Mult. Scler. 2018; 24 (2017/10/07): 340-349https://doi.org/10.1177/1352458517735188
        • Callesen J.
        • Cattaneo D.
        • Brincks J.
        • et al.
        How do resistance training and balance and motor control training affect gait performance and fatigue impact in people with multiple sclerosis? A randomized controlled multi-center study.
        Mult. Scler. 2020; 26 (2019/07/25): 1420-1432https://doi.org/10.1177/1352458519865740
        • Chalmers R.P.
        Mirt: a multidimensional item response theory package for the R environment.
        J. Stat. Softw. 2012; 48: 1-29https://doi.org/10.18637/jss.v048.i06
        • Dima A.L.
        Scale validation in applied health research: tutorial for a 6-step R-based psychometrics protocol.
        Health Psychol. Behav. Med. 2018; 6 (2018/05/10): 136-161https://doi.org/10.1080/21642850.2018.1472602
        • Filippi M.
        • Bar-Or A.
        • Piehl F.
        • et al.
        Multiple sclerosis.
        Nat. Rev. Dis. Prim. 2018; 4 (2018/11/10): 43https://doi.org/10.1038/s41572-018-0041-4
        • Fisk J.D.
        • Ritvo P.G.
        • Ross L.
        • et al.
        Measuring the functional impact of fatigue: initial validation of the fatigue impact scale.
        Clin. Infectious Dis. 1994; 18: S79-S83
      1. Gana K. and Broc G. Structural equation modeling with lavaan. 2019.

        • Green R.
        • Cutter G.
        • Friendly M.
        • et al.
        Which symptoms contribute the most to patients' perception of health in multiple sclerosis?.
        Mult. Scler. J. Exp. Transl. Clin. 2017; 3 (2017/09/15)2055217317728301https://doi.org/10.1177/2055217317728301
        • Hobart J.C.
        • Riazi A.
        • Lamping D.L.
        • et al.
        Improving the evaluation of therapeutic interventions in multiple sclerosis: development of a patient-based measure of outcome.
        Health Technol. Assess. 2004; 8 (iii2004/02/26): 1-48https://doi.org/10.3310/hta8090
        • Johansson S.
        • Skjerbæk A.G.
        • Nørgaard M.
        • et al.
        Associations between fatigue impact and lifestyle factors in people with multiple sclerosis – the Danish MS hospitals rehabilitation study.
        Mult. Scler. Relat. Disord. 2021; 50102799https://doi.org/10.1016/j.msard.2021.102799
        • Kinney A.R.
        • Eakman A.M.
        • Graham J.E.
        Novel effect size interpretation guidelines and an evaluation of statistical power in rehabilitation research.
        Arch. Phys. Med. Rehabil. 2020; 101 (2020/04/10): 2219-2226https://doi.org/10.1016/j.apmr.2020.02.017
        • Kluger B.M.
        • Krupp L.B.
        • Enoka R.M.
        Fatigue and fatigability in neurologic illnesses: proposal for a unified taxonomy.
        Neurology. 2013; 80 ([doi]): 409-416https://doi.org/10.1212/WNL.0b013e31827f07be
        • Kos D.
        • Kerckhofs E.
        • Carrea I.
        • et al.
        Evaluation of the Modified Fatigue Impact Scale in four different European countries.
        Mult. Scler. 2005; 11 (2005/03/01): 76-80https://doi.org/10.1191/1352458505ms1117oa
        • Larson R.D.
        Psychometric properties of the modified fatigue impact scale.
        Int. J. MS Care. 2013; 15 (2014/01/24): 15-20https://doi.org/10.7224/1537-2073.2012-019
        • Linacre J.M.
        What do infit and outfit, mean-square and standardized mean.
        Rasch Measur. Trans. 2002; 16: 878
        • Mair P.
        Modern Psychometrics With R.
        New York: Springer, 2018
        • Mills R.J.
        • Young C.A.
        • Pallant J.F.
        • et al.
        Rasch analysis of the Modified Fatigue Impact Scale (MFIS) in multiple sclerosis.
        J. Neurol. Neurosurg. Psychiatry. 2010; 81 (2010/06/16): 1049-1051https://doi.org/10.1136/jnnp.2008.151340
        • Mokkink L.B.
        • Terwee C.B.
        • Patrick D.L.
        • et al.
        The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study.
        Qual. Life Res. 2010; 19 (2010/02/20): 539-549https://doi.org/10.1007/s11136-010-9606-8
      2. Guidelines MSCfCP. Fatigue and multiple sclerosis: evidence-based management strategies for fatigue in multiple sclerosis. 1998.

        • Oervik M.S.
        • Sejbaek T.
        • Penner I.K.
        • et al.
        Validation of the fatigue scale for motor and cognitive functions in a danish multiple sclerosis cohort.
        Mult. Scler. Relat. Disord. 2017; 17 (2017/10/23): 130-134https://doi.org/10.1016/j.msard.2017.07.017
        • psych W.R.
        Procedures For Psychological, Psychometric, and Personality Research.
        Northwestern University, Evanston, Illinois2020 (R package version 2.0.12)
        • Riemenschneider M.
        • Hvid L.G.
        • Ringgaard S.
        • et al.
        Study protocol: randomised controlled trial evaluating exercise therapy as a supplemental treatment strategy in early multiple sclerosis: the early multiple sclerosis exercise study (EMSES).
        BMJ Open. 2021; 11 (2021/01/14)e043699https://doi.org/10.1136/bmjopen-2020-043699
        • Rietberg M.B.
        • Van Wegen E.E.
        • Kwakkel G.
        Measuring fatigue in patients with multiple sclerosis: reproducibility, responsiveness and concurrent validity of three Dutch self-report questionnaires.
        Disabil. Rehabil. 2010; 32 (2010/03/30): 1870-1876https://doi.org/10.3109/09638281003734458
        • Rommer P.S.
        • Eichstadt K.
        • Ellenberger D.
        • et al.
        Symptomatology and symptomatic treatment in multiple sclerosis: results from a nationwide MS registry.
        Mult. Scler. 2019; 25 (2018/09/20): 1641-1652https://doi.org/10.1177/1352458518799580
        • Rosseel Y.
        lavaan: an R Package for Structural Equation Modeling.
        J. Stat. Softw. 2012; 48: 1-36https://doi.org/10.18637/jss.v048.i02
        • Squara P.
        • Scheeren T.W.L.
        • Aya H.D.
        • et al.
        Metrology part 1: definition of quality criteria.
        J. Clin. Monit. Comput. 2021; 35 (2020/03/19): 17-25https://doi.org/10.1007/s10877-020-00494-y
        • Squara P.
        • Scheeren T.W.L.
        • Aya H.D.
        • et al.
        Metrology part 2: procedures for the validation of major measurement quality criteria and measuring instrument properties.
        J. Clin. Monit. Comput. 2021; 35 (2020/03/19): 27-37https://doi.org/10.1007/s10877-020-00495-x
        • Taul-Madsen L.
        • Dalgas U.
        • Kjolhede T.
        • et al.
        A head-to-head comparison of an isometric and a concentric fatigability protocol and the association with fatigue and walking in persons with multiple sclerosis.
        Neurorehabil. Neural Repair. 2020; 34 (2020/05/13): 523-532https://doi.org/10.1177/1545968320920250
        • Tellez N.
        • Rio J.
        • Tintore M.
        • et al.
        Does the Modified Fatigue Impact Scale offer a more comprehensive assessment of fatigue in MS?.
        Mult. Scler. 2005; 11 (2005/03/30): 198-202https://doi.org/10.1191/1352458505ms1148oa
        • Thrue C.
        • Riemenschneider M.
        • Hvid L.G.
        • et al.
        Time matters: early-phase multiple sclerosis is accompanied by considerable impairments across multiple domains.
        Mult. Scler. 2020; (13524585209362312020/07/03)https://doi.org/10.1177/1352458520936231
        • Trojan D.A.
        • Arnold D.
        • Collet J.P.
        • et al.
        Fatigue in multiple sclerosis: association with disease-related, behavioural and psychosocial factors.
        Mult. Scler. 2007; 13 (2007/05/01): 985-995https://doi.org/10.1177/1352458507077175
        • Whitehead L.
        The measurement of fatigue in chronic illness: a systematic review of unidimensional and multidimensional fatigue measures.
        J. Pain Symptom Manag. 2009; 37 (2008/12/30): 107-128https://doi.org/10.1016/j.jpainsymman.2007.08.019