Advertisement
Review article| Volume 73, 104674, May 2023

Download started.

Ok

Smartphone monitoring of cognition in people with multiple sclerosis: A systematic review

Open AccessPublished:March 27, 2023DOI:https://doi.org/10.1016/j.msard.2023.104674

      Highlights

      • Smartphone applications are valid and reliable in assessing cognition in pwMS.
      • Smartphone versions of the symbol digit modalities test are most used.
      • Short and long-term practice effects are present.
      • Longitudinal data, predictive and ecological validity are lacking.

      Abstract

      Background

      Current cognitive monitoring of people with multiple sclerosis (pwMS) is sporadic, resource intensive and insensitive for detection of real-world cognitive performance and decline. Smartphone applications may provide us with a more sensitive biomarker for cognitive decline that reflects real-world performance. The goal of this study was to perform a systematic review and qualitative synthesis of all current smartphone apps monitoring cognition in pwMS.

      Methods

      A systematic search of five major online databases (PubMed/Medline, Scopus, Web of Science, Cumulative Index of Nursing and Allied Health Literature and IEEE Xplore) was performed in accordance with the Cochrane Handbook and Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) statement. We included all studies with at least one measure of phone-based digital biomarkers for monitoring cognition in pwMS above the age of 18.
      Two authors independently screened the articles retrieved. Data on test-retest reliability, validity coefficients, feasibility and practice effects were extracted from the studies identified. Critical appraisal of the studies was performed using the National Institute of Health quality assessment tool for observational cohort and cross-sectional studies.

      Results

      12 articles covering six smartphone apps were included in this review. All articles had a low risk of bias, though sample size calculation was rarely performed. Of the six apps, five used smartphone versions of the symbol digit modalities test. The final app examined keystroke features passively.
      Test-retest reliability ranged from good to excellent. Concurrent validity was demonstrated through moderate to strong correlation with neuropsychological tests and weak to moderate correlations with EDSS, radiological biomarkers and patient-reported outcomes. Mobile apps performed comparably, and in some cases outperformed established cognitive tests. Whilst reported acceptability was high, significant attrition rates were present in longitudinal cohorts. There were significant short and long-term practice effects.
      Overall, smartphone versions of the SDMT showed strong psychometric properties across multiple apps.

      Conclusion

      Smartphone applications are reliable and valid biomarkers of real-world cognition in pwMS. Further longitudinal data would allow for a better understanding of their predictive and ecological validity.

      Keywords

      Abbreviations:

      9HPT (9-hole peg test), BICAMS (brief international cognitive assessment for MS), BV (brain volume), BVMT-R (brief visuospatial memory test-revised), CCC (Lin's Concordance correlation coefficient), CI (cognitively impaired), CVLT (California verbal learning test), EDSS (expanded disability status scale), FLAIR (fluid attenuated inversion recovery), HC (healthy controls), HCP (healthcare professionals), ICC (intra-class correlation), MCID (minimum clinically important difference), MSFC (MS functional composite), MSIS-29 (MS impact scale), pwMS (people with MS), pwMSwoCI (people with MS without cognitive impairment), SDC (smallest detectable change), SDMT (symbol digit modalities test), T2LV (T2 lesion volume)

      1. Introduction

      Cognitive impairment is a common manifestation of multiple sclerosis (MS), affecting 35–65% of people with MS (pwMS) (
      • Benedict R.H.B.
      • Amato M.P.
      • DeLuca J.
      • Geurts J.J.G.
      Cognitive impairment in multiple sclerosis: clinical management, MRI, and therapeutic avenues.
      ). Implications include unemployment, impaired activities of daily living, carer stress and reduced social participation (
      • Kalb R.
      • Beier M.
      • Benedict R.H.B.
      • Charvet L.
      • Costello K.
      • Feinstein A.
      • Gingold J.
      • Goverover Y.
      • Halper J.
      • Harris C.
      • Kostich L.
      • Krupp L.
      • Lathi E.
      • LaRocca N.
      • Thrower B.
      • DeLuca J.
      Recommendations for cognitive screening and management in multiple sclerosis care.
      ;
      • Raggi A.
      • Covelli V.
      • Schiavolin S.
      • Scaratti C.
      • Leonardi M.
      • Willems M.
      Work-related problems in multiple sclerosis: a literature review on its associates and determinants.
      ;
      • Schiavolin S.
      • Leonardi M.
      • Giovannetti A.M.
      • Antozzi C.
      • Brambilla L.
      • Confalonieri P.
      • Mantegazza R.
      • Raggi A.
      Factors related to difficulties with employment in patients with multiple sclerosis: a review of 2002–2011 literature.
      ). However, it is not routinely evaluated and is often under-reported by pwMS (
      • Lysandropoulos A.
      • Havrdova E.
      ‘Hidden'factors influencing quality of life in patients with multiple sclerosis.
      ). Current neuropsychiatric batteries require significant time and trained staff to administer and interpret the results. Access to neuropsychologists is limited and a formal, comprehensive neuropsychological assessment can take several hours. Whilst brief, simple-to-administer cognitive tests such as the symbol digit modalities test (SDMT) and the Brief International Cognitive Assessment for MS (BICAMS) have been developed to reduce the demand on busy healthcare professionals (HCP) (
      • Goverover Y.
      • Chiaravalloti N.
      • DeLuca J.
      Brief International Cognitive Assessment for Multiple Sclerosis (BICAMS) and performance of everyday life tasks: actual reality.
      ;
      • Strober L.
      • DeLuca J.
      • Benedict R.H.B.
      • Jacobs A.
      • Cohen J.A.
      • Chiaravalloti N.
      • Hudson L.D.
      • Rudick R.A.
      • LaRocca N.G.
      Symbol digit modalities test: a valid clinical trial endpoint for measuring cognition in multiple sclerosis.
      ), they are challenging to perform routinely in the outpatient clinic setting. Testing usually takes place in a clinical setting instead of the home environment, which can underestimate the cognitive demands required in a dynamic, real-world setting (
      • Weber E.
      • Goverover Y.
      • DeLuca J.
      Beyond cognitive dysfunction: relevance of ecological validity of neuropsychological tests in multiple sclerosis.
      ). The current model also lacks the frequency required to detect subtle cognitive changes (
      • Lam K.H.
      • van Oirschot P.
      • den Teuling B.
      • Hulst H.E.
      • de Jong B.A.
      • Uitdehaag B.M.J.
      • de Groot V.
      • Killestein J.
      Reliability, construct and concurrent validity of a smartphone-based cognition test in multiple sclerosis.
      c).
      In response to this, computerization of neuropsychological testing has gathered pace in the past two decades. There are now several computerized neuropsychological assessment devices that are valid and reliable supplements to a full neuropsychological assessment (
      • Wojcik C.M.
      • Beier M.
      • Costello K.
      • DeLuca J.
      • Feinstein A.
      • Goverover Y.
      • Gudesblatt M.
      • Jaworski 3rd, M.
      • Kalb R.
      • Kostich L.
      • LaRocca N.G.
      • Rodgers J.D.
      • Benedict R.H.
      Computerized neuropsychological assessment devices in multiple sclerosis: a systematic review.
      ). However, many of these are computer or tablet-computer-based and validation studies were performed mostly in a clinical, supervised setting (
      • Merlo D.
      • Darby D.
      • Kalincik T.
      • Butzkueven H.
      • van der Walt A.
      The feasibility, reliability and concurrent validity of the MSReactor computerized cognitive screening tool in multiple sclerosis.
      ,
      • Merlo D.
      • Stankovich J.
      • Bai C.
      • Kalincik T.
      • Zhu C.
      • Gresle M.
      • Lechner-Scott J.
      • Kilpatrick T.
      • Barnett M.
      • Taylor B.
      Association between cognitive trajectories and disability progression in patients with relapsing-remitting multiple sclerosis.
      ;
      • Wojcik C.M.
      • Beier M.
      • Costello K.
      • DeLuca J.
      • Feinstein A.
      • Goverover Y.
      • Gudesblatt M.
      • Jaworski 3rd, M.
      • Kalb R.
      • Kostich L.
      • LaRocca N.G.
      • Rodgers J.D.
      • Benedict R.H.
      Computerized neuropsychological assessment devices in multiple sclerosis: a systematic review.
      ). With further technological advances, several smartphone applications have been developed to monitor cognition in pwMS. Given the ubiquity of smartphones, these apps are ideal for ecological assessment of pwMS’ cognition. However, many of these applications lack validation and reliability data. The advent of COVID-19 and its impact on immunocompromised pwMS has further emphasized the need for valid and reliable remote monitoring.
      Our objective was to comprehensively review smartphone applications monitoring cognition in MS and their psychometric properties. In particular, we focused on test–retest reliability, validity (including discriminant, ecological, predictive, concurrent validity), practice effects and feasibility.

      2. Methods

      2.1 Search protocol and inclusion criteria

      The systematic review is reported in accordance with the Cochrane Handbook and Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) statement (
      • Cumpston M.
      • Li T.
      • Page M.J.
      • Chandler J.
      • Welch V.A.
      • Higgins J.P.
      • Thomas J.
      Updated guidance for trusted systematic reviews: a new edition of the cochrane handbook for systematic reviews of interventions.
      ;
      • Moher D.
      • Liberati A.
      • Tetzlaff J.
      • Altman D.G.
      Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement.
      ). A systematic literature search in PubMed/Medline, Scopus, Web of Science, Cumulative Index of Nursing and Allied Health Literature (CINAHL), and IEEE Xplore was performed. The search terms included: multiple sclerosis OR MS AND Cognition OR Cognitive Reserve OR cognit* OR Memory OR Executive Function OR Language OR processing speed AND Smartphone OR Cell Phone OR phone OR Mobile Application. The search algorithm was adapted according to each database and filters were used to refine the searches. Searches were made in May 2022. Appendix 1 shows the specific search strategy for each database.
      Two authors (YF) and (FB) independently screened each title and abstract to identify potential articles investigating the use of smartphone to monitor cognition in pwMS. Full text articles were then retrieved and reviewed based on the eligibility criteria to determine articles for inclusion. A manual search in the reference lists (forward and backward searches) of included articles was conducted to identify other relevant studies. Any discrepancies were discussed until consensus was reached with a third author (DM).
      To be included in the systematic review, the following criteria had to be met: (1) Inclusion of pwMS above the age of 18 (2) Included at least one measure of a mobile-phone based digital biomarker for monitoring cognition (3) Includes sufficient data to analyse one or more of: test-retest reliability, validity, practice effects and feasibility. Review articles, abstracts, protocol papers, and patient-reported outcomes were not included in this review.

      2.2 Data extraction and quality assessment

      The following data were extracted: app name, authors, publication year, study design, expanded disability status scale (EDSS), sex, duration of disease, type of MS, age, number of pwMS in the study, smartphone test features, cognitive outcome measures and measures of validity and reliability. The following smartphone test features were used: mean number of correct responses on the smartphone SDMT, passive keystroke features for Neurokeys, median number of correct responses and standard deviation of time to identify a digit for elevateMS. The extraction of smartphone test features was based on the features examined in the selected studies.
      To aid readability, the following cut-offs were used to colour the cells of Table 2:
      • Pearson's r or Spearman's rho correlation coefficients (
        • Mukaka M.M.
        A guide to appropriate use of correlation coefficient in medical research.
        ): Strong was defined as >0.7 (coloured green), moderate between >0.4 and 0.7 (coloured yellow), and weak between >0.1 and 0.4 (coloured red).
      • Intraclass correlation coefficient (ICC) or Lin's Concordance correlation coefficient (CCC) (
        • Fleiss J.L.
        • Levin B.
        • Paik M.C.
        Statistical Methods for Rates and Proportions.
        ): Excellent was defined as >0.9 (coloured green), good as >0.75–0.9 (coloured yellow) and weak as </= 0.75 (coloured red).
      • Area under the receiver operating characteristic (AUC) (
        • Rice M.E.
        • Harris G.T.
        Comparing effect sizes in follow-up studies: ROC area, Cohen's d, and r.
        ): Acceptable was defined as >/= 0.7 (coloured green) and poor as <0.7 (coloured red).
      • T-tests or Wilcoxon's signed-rank tests (
        • Fleiss J.L.
        • Levin B.
        • Paik M.C.
        Statistical Methods for Rates and Proportions.
        ): Acceptable was defined as p-values <0.05 (coloured green) and poor as values >/=0.05 (coloured red).
      Where multiple correlation coefficients were reported for a range of smartphone test features, the best value was taken.
      The methodological quality of the included studies was evaluated using the Quality Assessment Tool for Observational Cohort and Cross-Sectional Studies developed by the

      National Institute of Health, 2014. Quality assessment tool for observational cohort and cross-sectional studies [Internet]. Available from: https://www.nhlbi.nih.gov/health-topics/study-quality-assessment-tools.

      . This tool includes 14 criteria that are rated according to three responses: Yes, No, or other. Studies with greater than 50% of responses (eight or more) being “yes” are considered “good” quality, 30–50% are considered ‘fair quality’ (five to seven), and less than 30% (four or less) are considered “poor” quality based on previous systematic reviews (
      • Abou L.
      • Alluri A.
      • Fliflet A.
      • Du Y.
      • Rice L.A.
      Effectiveness of physical therapy interventions in reducing fear of falling among individuals with neurologic diseases: a systematic review and meta-analysis.
      ;
      • Peters J.
      • Abou L.
      • Wong E.
      • Dossou M.S.
      • Sosnoff J.J.
      • Rice L.A.
      Smartphone-based gait and balance assessment in survivors of stroke: a systematic review.
      ).

      3. Results

      3.1 Study selection

      There were 2189 citations identified from the initial search of pre-specified databases and an additional citation was found through backward and forward searching. 230 duplicates were removed. Following title and abstract screening by two review authors a further 1925 were excluded. The remaining 35 articles were assessed for eligibility after retrieving full text articles. 12 articles were found that met the study inclusion criteria, covering six smartphone applications. The remaining 23 articles were excluded and reasons for exclusion have been specified. The PRISMA flow diagram is shown in Fig. 1.

      3.2 Study analysis

      Table 1 presents the characteristics of the studies included in this review. 12 studies were identified, covering six smartphone apps (FLOODLIGHT, MSSherpa, MSCopilot, Neurokeys, NeuFun, elevateMS). Six were prospective cohort studies, five were cross-sectional and one had both a cross-sectional and prospective cohort component. Both MSCopilot and FLOODLIGHT had two papers arising out of the same cohort, whilst MSSherpa and Neurokeys had initial cross-sectional studies with longitudinal follow-up reports with minor differences in participant demographics between the two timepoints. Six studies utilised the participant's own smartphones. Three studies examined only participants with RRMS, eight included all types of MS and one did not specify the types of MS included. The number of pwMS ranged from 21 to 495. Percentage of females ranged from 53 to 95%.
      Table 1Characteristics of included studies.
      App studiedRefs.Participant characteristicsSmartphone modelStudy designNumber of pwMS
      Floodlight
      • Montalban X.
      • Graves J.
      • Midaglia L.
      • Mulero P.
      • Julian L.
      • Baker M.
      • Schadrack J.
      • Gossens C.
      • Ganzetti M.
      • Scotland A.
      • Lipsmeier F.
      • van Beek J.
      • Bernasconi C.
      • Belachew S.
      • Lindemann M.
      • Hauser S.L.
      A smartphone sensor-based digital outcome assessment of multiple sclerosis.
      -Mean (±SD) EDSS: 2.4 ± 1.4

      -Mean (±SD) Age: 39.5 ± 7.9

      -Female%: 53%

      -Mean time since onset (±SD): 11.3 ± 7

      -Type of MS: all
      Samsung Galaxy S7Prospective cohort76
      • Midaglia L.
      • Mulero P.
      • Montalban X.
      • Graves J.
      • Hauser S.L.
      • Julian L.
      • Baker M.
      • Schadrack J.
      • Gossens C.
      • Scotland A.
      • Lipsmeier F.
      • van Beek J.
      • Bernasconi C.
      • Belachew S.
      • Lindemann M.
      Adherence and satisfaction of smartphone- and smartwatch-based remote active testing and passive monitoring in people with multiple sclerosis: nonrandomized interventional feasibility study.
      • Woelfle T.
      • Pless S.
      • Wiencierz A.
      • Kappos L.
      • Naegelin Y.
      • Lorscheider J.
      Practice effects of mobile tests of cognition, dexterity, and mobility on patients with multiple sclerosis: data analysis of a smartphone-based observational study.
      -Mean (±SD) EDSS: unknown

      -Median (IQR) Age: 50.2 (42.0–58.0)

      -Female%: 70.2%

      -Mean time since onset (±SD): unknown

      -Type of MS: unknown
      Participant's smartphoneProspective cohort262
      MSSherpa
      • Lam K.H.
      • van Oirschot P.
      • den Teuling B.
      • Hulst H.E.
      • de Jong B.A.
      • Uitdehaag B.M.J.
      • de Groot V.
      • Killestein J.
      Reliability, construct and concurrent validity of a smartphone-based cognition test in multiple sclerosis.
      c
      -Median (Range) EDSS: 3.5 (1.5–7)

      -Mean (±SD) Age: 46.9 ± 10.1

      -Female%: 73.9%

      -Median time since onset (IQR): 10.9 (5.3–18.3)

      -Type of MS: all
      Participant's smartphoneCross-sectional92
      • Lam K.H.
      • Bucur I.G.
      • Van Oirschot P.
      • De Graaf F.
      • Weda H.
      • Strijbis E.
      • Uitdehaag B.
      • Heskes T.
      • Killestein J.
      • De Groot V.
      Towards individualized monitoring of cognition in multiple sclerosis in the digital era: a one-year cohort study.
      -Median (IQR) EDSS: 3.5 (2.5–6)

      -Mean (±SD) Age: 46.5 ± 10.3

      -Female%: 74%

      -Median time since onset (IQR): 5.7 (3.1–27.1)

      -Type of MS: all
      Participant's smartphoneProspective cohort100
      • van Oirschot P.
      • Heerings M.
      • Wendrich K.
      • den Teuling B.
      • Martens M.B.
      • Jongen P.J.
      Symbol digit modalities test variant in a smartphone app for persons with multiple sclerosis: validation study.
      -Mean (±SD) EDSS: 3.1 (1.4)

      -Mean (±SD) Age: 40 ± 8

      -Female%: 92%

      -Mean time since onset (±SD): 6 ± 4.4

      -Type of MS: RRMS
      Participant's smartphoneCross-sectional25
      MSCopilot
      • Maillart E.
      • Labauge P.
      • Cohen M.
      • Maarouf A.
      • Vukusic S.
      • Donzé C.
      • Gallien P.
      • De Sèze J.
      • Bourre B.
      • Moreau T.
      • Louapre C.
      • Mayran P.
      • Bieuvelet S.
      • Vallée M.
      • Bertillot F.
      • Klaeylé L.
      • Argoud A.L.
      • Zinaï S.
      • Tourbah A.
      MSCopilot, a new multiple sclerosis self-assessment digital solution: results of a comparative study versus standard tests.
      -Mean (±SD) EDSS: 3.6 (1.6)

      -Mean (±SD) Age: 46 ± 10

      -Female%: 61.2%

      -Mean time since onset (±SD): 12 ± 7

      -Type of MS: RRMS
      Not specifiedCross-sectional116
      • Tanoh I.C.
      • Maillart E.
      • Labauge P.
      • Cohen M.
      • Maarouf A.
      • Vukusic S.
      • Donzé C.
      • Gallien P.
      • De Sèze J.
      • Bourre B.
      • Moreau T.
      • Louapre C.
      • Vallée M.
      • Bieuvelet S.
      • Klaeylé L.
      • Argoud A.L.
      • Zinaï S.
      • Tourbah A.
      MSCopilot: new smartphone-based digital biomarkers correlate with expanded disability status scale scores in people with multiple sclerosis.
      Neurokeys
      • Lam K.H.
      • Twose J.
      • McConchie H.
      • Licitra G.
      • Meijer K.
      • de Ruiter L.
      • van Lierop Z.
      • Moraal B.
      • Barkhof F.
      • Uitdehaag B.
      • de Groot V.
      • Killestein J.
      Smartphone-derived keystroke dynamics are sensitive to relevant changes in multiple sclerosis.
      -Median (IQR) EDSS: 3.5 (2.5–4.0)

      -Mean (±SD) Age: 46.7 ± 10.4

      -Female%: 72.3%

      -Median time since diagnosis (IQR): 6.0 (3.0–12.4)

      -Type of MS: all
      Participant's smartphoneProspective cohort94
      • Lam K.H.
      • Meijer K.A.
      • Loonstra F.C.
      • Coerver E.M.E.
      • Twose J.
      • Redeman E.
      • Moraal B.
      • Barkhof F.
      • de Groot V.
      • Uitdehaag B.M.J.
      • Killestein J.
      Real-world keystroke dynamics are a potentially valid biomarker for clinical disability in multiple sclerosis.
      -Median (IQR) EDSS: 3.5 (2.5–4.0)

      -Mean (±SD) Age: 46.4 ± 10.1

      -Female%: 75.3%

      -Median time since onset (IQR): 11.3 (5.1–17.7)

      -Type of MS: all
      Participant's smartphoneCross-sectional85
      elevateMS
      • Pratap A.
      • Grant D.
      • Vegesna A.
      • Tummalacherla M.
      • Cohan S.
      • Deshpande C.
      • Mangravite L.
      • Omberg L.
      Evaluating the utility of smartphone-based sensor assessments in persons with multiple sclerosis in the real-world using an App (elevateMS): observational, prospective pilot digital health study.
      -Mean (±SD) EDSS: NA

      -Mean (±SD) Age: 46.2 ± 11.6

      -Female%: 76.8%

      -Mean time since onset (±SD): 12.0 ± 9.0

      -Type of MS: all
      Participant's smartphone (iPhone 5 or newer)Prospective cohort495
      Neufun
      • Pham L.
      • Harris T.
      • Varosanec M.
      • Morgan V.
      • Kosa P.
      • Bielekova B.
      Smartphone-based symbol-digit modalities test reliably captures brain damage in multiple sclerosis.
      -Mean (Range) EDSS (for subset of 112 pwMS): 4.9 (0–7)

      -Mean (Range) Age: 54.3 (18.5 - 77.2)

      -Female%: 58%

      -Mean time since onset (±SD): NA

      -Type of MS: all
      Google Pixel XL/2XL phones running Android 9 and aboveCross-sectional (154 pwMS) & longitudinal (15 pwMS)154
      Abbreviations: EDSS: Expanded Disability Status Scale, IQR: Interquartile Range, MS: Multiple Sclerosis, NA: Not Available, PP: Primary Progressive, pwMS: people with multiple sclerosis, RIS: Radiologically Isolated Syndrome, RR: Relapsing-Remitting, SD: Standard Deviation, SP: Secondary Progressive.
      Of the six apps, five used smartphone versions of the SDMT to assess cognition. A figure showcasing the subtle differences between different versions of smartphone SDMTs are attached in Figure Appendix 1. Of these, only elevateMS utilised a voice-controlled version. Neurokeys examined passive keystroke features.

      3.3 Psychometric properties of smartphone applications

      3.3.1 Smartphone SDMTs

      The MSSherpa version of the smartphone SDMT has been shown to correlate moderate to strongly with paper-based SDMT (r = 0.622 to 0.784), although there were concerns regarding incorrectly administered SDMT in one of the validation studies (
      • van Oirschot P.
      • Heerings M.
      • Wendrich K.
      • den Teuling B.
      • Martens M.B.
      • Jongen P.J.
      Symbol digit modalities test variant in a smartphone app for persons with multiple sclerosis: validation study.
      ). Correlation with other measures of cognition such as the CVLT-II (r = 0.451 to 0.516) and BVMT-R (r = 0.532 to 0.599) were moderate (
      • Lam K.H.
      • van Oirschot P.
      • den Teuling B.
      • Hulst H.E.
      • de Jong B.A.
      • Uitdehaag B.M.J.
      • de Groot V.
      • Killestein J.
      Reliability, construct and concurrent validity of a smartphone-based cognition test in multiple sclerosis.
      c). Test-retest reliability values were good to excellent (ICC 0.874 to 0.934) (
      • Lam K.H.
      • Bucur I.G.
      • Van Oirschot P.
      • De Graaf F.
      • Weda H.
      • Strijbis E.
      • Uitdehaag B.
      • Heskes T.
      • Killestein J.
      • De Groot V.
      Towards individualized monitoring of cognition in multiple sclerosis in the digital era: a one-year cohort study.
      ,
      • Lam K.H.
      • van Oirschot P.
      • den Teuling B.
      • Hulst H.E.
      • de Jong B.A.
      • Uitdehaag B.M.J.
      • de Groot V.
      • Killestein J.
      Reliability, construct and concurrent validity of a smartphone-based cognition test in multiple sclerosis.
      c;
      • van Oirschot P.
      • Heerings M.
      • Wendrich K.
      • den Teuling B.
      • Martens M.B.
      • Jongen P.J.
      Symbol digit modalities test variant in a smartphone app for persons with multiple sclerosis: validation study.
      ). Discriminative validity was demonstrated, with AUC as high as 0.922 for distinguishing pwMS with and without CI (
      • Lam K.H.
      • van Oirschot P.
      • den Teuling B.
      • Hulst H.E.
      • de Jong B.A.
      • Uitdehaag B.M.J.
      • de Groot V.
      • Killestein J.
      Reliability, construct and concurrent validity of a smartphone-based cognition test in multiple sclerosis.
      c). However, the ability to predict longitudinal change in BICAMS and traditional SDMT was less impressive, with AUCs below 0.7 (
      • Lam K.H.
      • Bucur I.G.
      • Van Oirschot P.
      • De Graaf F.
      • Weda H.
      • Strijbis E.
      • Uitdehaag B.
      • Heskes T.
      • Killestein J.
      • De Groot V.
      Towards individualized monitoring of cognition in multiple sclerosis in the digital era: a one-year cohort study.
      ). The minimum clinically important difference (MCID) was also found to be smaller than the smallest detectable change (SDC) (
      • Lam K.H.
      • Bucur I.G.
      • Van Oirschot P.
      • De Graaf F.
      • Weda H.
      • Strijbis E.
      • Uitdehaag B.
      • Heskes T.
      • Killestein J.
      • De Groot V.
      Towards individualized monitoring of cognition in multiple sclerosis in the digital era: a one-year cohort study.
      ). Both pwMS and healthy controls (HC) scored lower on the smartphone versions on the SDMT, likely due to the response time required in between symbols (
      • Lam K.H.
      • van Oirschot P.
      • den Teuling B.
      • Hulst H.E.
      • de Jong B.A.
      • Uitdehaag B.M.J.
      • de Groot V.
      • Killestein J.
      Reliability, construct and concurrent validity of a smartphone-based cognition test in multiple sclerosis.
      c;
      • van Oirschot P.
      • Heerings M.
      • Wendrich K.
      • den Teuling B.
      • Martens M.B.
      • Jongen P.J.
      Symbol digit modalities test variant in a smartphone app for persons with multiple sclerosis: validation study.
      ).
      The MS Copilot version was shown to correlate strongly with MSFC (r = 0.81) (250) and moderately with EDSS (r = 0.65) as part of a combined test battery (
      • Maillart E.
      • Labauge P.
      • Cohen M.
      • Maarouf A.
      • Vukusic S.
      • Donzé C.
      • Gallien P.
      • De Sèze J.
      • Bourre B.
      • Moreau T.
      • Louapre C.
      • Mayran P.
      • Bieuvelet S.
      • Vallée M.
      • Bertillot F.
      • Klaeylé L.
      • Argoud A.L.
      • Zinaï S.
      • Tourbah A.
      MSCopilot, a new multiple sclerosis self-assessment digital solution: results of a comparative study versus standard tests.
      ;
      • Tanoh I.C.
      • Maillart E.
      • Labauge P.
      • Cohen M.
      • Maarouf A.
      • Vukusic S.
      • Donzé C.
      • Gallien P.
      • De Sèze J.
      • Bourre B.
      • Moreau T.
      • Louapre C.
      • Vallée M.
      • Bieuvelet S.
      • Klaeylé L.
      • Argoud A.L.
      • Zinaï S.
      • Tourbah A.
      MSCopilot: new smartphone-based digital biomarkers correlate with expanded disability status scale scores in people with multiple sclerosis.
      ), with comparable performance to the traditional MSFC. It also displayed comparable discriminative validity as part of MSCopilot compared to MSFC with significant differences between HC, pwMS with mild disability and pwMS with moderate disability (
      • Maillart E.
      • Labauge P.
      • Cohen M.
      • Maarouf A.
      • Vukusic S.
      • Donzé C.
      • Gallien P.
      • De Sèze J.
      • Bourre B.
      • Moreau T.
      • Louapre C.
      • Mayran P.
      • Bieuvelet S.
      • Vallée M.
      • Bertillot F.
      • Klaeylé L.
      • Argoud A.L.
      • Zinaï S.
      • Tourbah A.
      MSCopilot, a new multiple sclerosis self-assessment digital solution: results of a comparative study versus standard tests.
      ;
      • Tanoh I.C.
      • Maillart E.
      • Labauge P.
      • Cohen M.
      • Maarouf A.
      • Vukusic S.
      • Donzé C.
      • Gallien P.
      • De Sèze J.
      • Bourre B.
      • Moreau T.
      • Louapre C.
      • Vallée M.
      • Bieuvelet S.
      • Klaeylé L.
      • Argoud A.L.
      • Zinaï S.
      • Tourbah A.
      MSCopilot: new smartphone-based digital biomarkers correlate with expanded disability status scale scores in people with multiple sclerosis.
      ). On its own, it had an AUC of 0.72 for predicting pwMS with EDSS >3.5. When included as part of the MScopilot battery the AUC increased to 0.92 (
      • Maillart E.
      • Labauge P.
      • Cohen M.
      • Maarouf A.
      • Vukusic S.
      • Donzé C.
      • Gallien P.
      • De Sèze J.
      • Bourre B.
      • Moreau T.
      • Louapre C.
      • Mayran P.
      • Bieuvelet S.
      • Vallée M.
      • Bertillot F.
      • Klaeylé L.
      • Argoud A.L.
      • Zinaï S.
      • Tourbah A.
      MSCopilot, a new multiple sclerosis self-assessment digital solution: results of a comparative study versus standard tests.
      ;
      • Tanoh I.C.
      • Maillart E.
      • Labauge P.
      • Cohen M.
      • Maarouf A.
      • Vukusic S.
      • Donzé C.
      • Gallien P.
      • De Sèze J.
      • Bourre B.
      • Moreau T.
      • Louapre C.
      • Vallée M.
      • Bieuvelet S.
      • Klaeylé L.
      • Argoud A.L.
      • Zinaï S.
      • Tourbah A.
      MSCopilot: new smartphone-based digital biomarkers correlate with expanded disability status scale scores in people with multiple sclerosis.
      ). There was no significant difference between the discriminative validity of the smartphone SDMT compared to traditional SDMT. Test-retest reliability was good (ICC = 0.9). Acceptability was high amongst both participants and HCP, with 85% of participants willing to use it at least monthly (
      • Maillart E.
      • Labauge P.
      • Cohen M.
      • Maarouf A.
      • Vukusic S.
      • Donzé C.
      • Gallien P.
      • De Sèze J.
      • Bourre B.
      • Moreau T.
      • Louapre C.
      • Mayran P.
      • Bieuvelet S.
      • Vallée M.
      • Bertillot F.
      • Klaeylé L.
      • Argoud A.L.
      • Zinaï S.
      • Tourbah A.
      MSCopilot, a new multiple sclerosis self-assessment digital solution: results of a comparative study versus standard tests.
      ).
      The FLOODLIGHT version correlates strongly with traditional SDMT (r = 0.85) and moderately with EDSS (r = −0.43), magnetic resonance imaging (MRI) measures (r = 0.42 to 0.54) and multiple sclerosis impact scale (MSIS-29) (r = −0.52) (
      • Montalban X.
      • Graves J.
      • Midaglia L.
      • Mulero P.
      • Julian L.
      • Baker M.
      • Schadrack J.
      • Gossens C.
      • Ganzetti M.
      • Scotland A.
      • Lipsmeier F.
      • van Beek J.
      • Bernasconi C.
      • Belachew S.
      • Lindemann M.
      • Hauser S.L.
      A smartphone sensor-based digital outcome assessment of multiple sclerosis.
      ). Test-retest reliability was good (ICC = 0.85) (
      • Montalban X.
      • Graves J.
      • Midaglia L.
      • Mulero P.
      • Julian L.
      • Baker M.
      • Schadrack J.
      • Gossens C.
      • Ganzetti M.
      • Scotland A.
      • Lipsmeier F.
      • van Beek J.
      • Bernasconi C.
      • Belachew S.
      • Lindemann M.
      • Hauser S.L.
      A smartphone sensor-based digital outcome assessment of multiple sclerosis.
      ). It was highly acceptable to pwMS, with 84.7% of participants reporting at least acceptable impact of the FLOODLIGHT battery on daily activities (
      • Midaglia L.
      • Mulero P.
      • Montalban X.
      • Graves J.
      • Hauser S.L.
      • Julian L.
      • Baker M.
      • Schadrack J.
      • Gossens C.
      • Scotland A.
      • Lipsmeier F.
      • van Beek J.
      • Bernasconi C.
      • Belachew S.
      • Lindemann M.
      Adherence and satisfaction of smartphone- and smartwatch-based remote active testing and passive monitoring in people with multiple sclerosis: nonrandomized interventional feasibility study.
      ). However, adherence decreased over time, with only 60.5% of pwMS adhering to at least 3 days of testing per week after 24 weeks. There was evidence of both short and long-term practice effects extending beyond five repetitions (
      • Woelfle T.
      • Pless S.
      • Wiencierz A.
      • Kappos L.
      • Naegelin Y.
      • Lorscheider J.
      Practice effects of mobile tests of cognition, dexterity, and mobility on patients with multiple sclerosis: data analysis of a smartphone-based observational study.
      ).
      Correlation of the NeuFun version with traditional SDMT was good (CCC = 0.84) and weak to moderate with clinical composite measures (Rho = −0.24 to −0.49), MRI measures (Rho = 0.45–0.46) and unique smartphone-based measures of upper limb function (r = 0.45)(
      • Pham L.
      • Harris T.
      • Varosanec M.
      • Morgan V.
      • Kosa P.
      • Bielekova B.
      Smartphone-based symbol-digit modalities test reliably captures brain damage in multiple sclerosis.
      ). The magnitude of the correlations was comparable with traditional SDMT and exceeded that of the paced auditory serial addition test (PASAT). Test-retest reliability was good (ICC 0.87 to 0.9) (
      • Pham L.
      • Harris T.
      • Varosanec M.
      • Morgan V.
      • Kosa P.
      • Bielekova B.
      Smartphone-based symbol-digit modalities test reliably captures brain damage in multiple sclerosis.
      ). There was a significant difference between test scores of HC and pwMS and a practice effect was noted at up to eight repetitions.
      The only voice-controlled digit substitution test is utilized by Elevate MS. This has shown limited discriminative validity only - it was only able to distinguish pwMS with severe impairment, and not those with mild or moderate impairment (
      • Pratap A.
      • Grant D.
      • Vegesna A.
      • Tummalacherla M.
      • Cohan S.
      • Deshpande C.
      • Mangravite L.
      • Omberg L.
      Evaluating the utility of smartphone-based sensor assessments in persons with multiple sclerosis in the real-world using an App (elevateMS): observational, prospective pilot digital health study.
      ). Other forms of validity and reliability were not examined in the study.
      Predictors of smartphone based SDMT performance is an emerging area of research. Pham et al. showed through elastic net regression that age, vision, eye movement, MRI measures (brain volume and T2 lesion volume (T2LV)) and cerebellar function predict smartphone SDMT performance (
      • Pham L.
      • Harris T.
      • Varosanec M.
      • Morgan V.
      • Kosa P.
      • Bielekova B.
      Smartphone-based symbol-digit modalities test reliably captures brain damage in multiple sclerosis.
      ).

      3.3.2 Passive keystroke monitoring

      A passive monitoring approach towards cognitive testing has been tested with the Neurokeys app. This app continuously collected keystroke data whenever the keyboard was used. Cross-sectional analysis of eight passive keystroke features revealed test-retest reliability in the good to excellent range. Correlation with SDMT, EDSS and 9-hole peg test (9HPT) was moderate and Release-Release Latency had the strongest correlation with SDMT with a Pearson's r value of −0.553 (
      • Lam K.H.
      • Meijer K.A.
      • Loonstra F.C.
      • Coerver E.M.E.
      • Twose J.
      • Redeman E.
      • Moraal B.
      • Barkhof F.
      • de Groot V.
      • Uitdehaag B.M.J.
      • Killestein J.
      Real-world keystroke dynamics are a potentially valid biomarker for clinical disability in multiple sclerosis.
      ). In the follow-up longitudinal analysis, a further seven features were included and a total of 15 features were analysed. In longitudinal analysis, the variables examined such as emoji sentiments had an AUC <0.7 for prediction of clinically relevant change in SDMT (
      • Lam K.H.
      • Twose J.
      • McConchie H.
      • Licitra G.
      • Meijer K.
      • de Ruiter L.
      • van Lierop Z.
      • Moraal B.
      • Barkhof F.
      • Uitdehaag B.
      • de Groot V.
      • Killestein J.
      Smartphone-derived keystroke dynamics are sensitive to relevant changes in multiple sclerosis.
      ). It should be noted that this exceeded that of the traditional clinical benchmark (EDSS) used in the study. Other test features predicted clinically relevant change in the EDSS and new or enhancing MRI lesions with AUC exceeding 0.7 and again outperforming the traditional clinical benchmarks of 9HPT and EDSS respectively. This remains an extremely promising field given its ability to convert real-world smartphone use into a surrogate for cognitive function without the need for active participation by pwMS.
      A summary of the validity and reliability of the aforementioned apps can be found in Table 2. For each app, if multiple studies examining reliability and validity were found, each study was allocated a separate row. Detailed information including duration in-between test and retest and exact correlation coefficients are included in Appendix Table 1.
      Table 2Validity and reliability of smartphone apps monitoring cognition.
      Smartphone appConcurrent validity with established cognitive testsConcurrent validity with clinical composite measuresOther forms of concurrent validityDiscriminant/predictive validity *Test-retest reliability
      FloodlightStrong with SDMT (
      • Montalban X.
      • Graves J.
      • Midaglia L.
      • Mulero P.
      • Julian L.
      • Baker M.
      • Schadrack J.
      • Gossens C.
      • Ganzetti M.
      • Scotland A.
      • Lipsmeier F.
      • van Beek J.
      • Bernasconi C.
      • Belachew S.
      • Lindemann M.
      • Hauser S.L.
      A smartphone sensor-based digital outcome assessment of multiple sclerosis.
      )
      Moderate with EDSS (
      • Montalban X.
      • Graves J.
      • Midaglia L.
      • Mulero P.
      • Julian L.
      • Baker M.
      • Schadrack J.
      • Gossens C.
      • Ganzetti M.
      • Scotland A.
      • Lipsmeier F.
      • van Beek J.
      • Bernasconi C.
      • Belachew S.
      • Lindemann M.
      • Hauser S.L.
      A smartphone sensor-based digital outcome assessment of multiple sclerosis.
      )
      Moderate with T2/FLAIR LV and BV

      Moderate with MSIS-29 (
      • Montalban X.
      • Graves J.
      • Midaglia L.
      • Mulero P.
      • Julian L.
      • Baker M.
      • Schadrack J.
      • Gossens C.
      • Ganzetti M.
      • Scotland A.
      • Lipsmeier F.
      • van Beek J.
      • Bernasconi C.
      • Belachew S.
      • Lindemann M.
      • Hauser S.L.
      A smartphone sensor-based digital outcome assessment of multiple sclerosis.
      )
      ICC 0.85 (
      • Montalban X.
      • Graves J.
      • Midaglia L.
      • Mulero P.
      • Julian L.
      • Baker M.
      • Schadrack J.
      • Gossens C.
      • Ganzetti M.
      • Scotland A.
      • Lipsmeier F.
      • van Beek J.
      • Bernasconi C.
      • Belachew S.
      • Lindemann M.
      • Hauser S.L.
      A smartphone sensor-based digital outcome assessment of multiple sclerosis.
      )
      MSSherpaModerate with CVLT, BVMT-R, SDMT (
      • Lam K.H.
      • van Oirschot P.
      • den Teuling B.
      • Hulst H.E.
      • de Jong B.A.
      • Uitdehaag B.M.J.
      • de Groot V.
      • Killestein J.
      Reliability, construct and concurrent validity of a smartphone-based cognition test in multiple sclerosis.
      c)
      Moderate with EDSS (
      • Lam K.H.
      • van Oirschot P.
      • den Teuling B.
      • Hulst H.E.
      • de Jong B.A.
      • Uitdehaag B.M.J.
      • de Groot V.
      • Killestein J.
      Reliability, construct and concurrent validity of a smartphone-based cognition test in multiple sclerosis.
      c)
      - pwMS & HC

      - CIpwMS & pwMSwoCI (
      • Lam K.H.
      • van Oirschot P.
      • den Teuling B.
      • Hulst H.E.
      • de Jong B.A.
      • Uitdehaag B.M.J.
      • de Groot V.
      • Killestein J.
      Reliability, construct and concurrent validity of a smartphone-based cognition test in multiple sclerosis.
      c)
      ICC 0.875 to 0.934 (
      • Lam K.H.
      • van Oirschot P.
      • den Teuling B.
      • Hulst H.E.
      • de Jong B.A.
      • Uitdehaag B.M.J.
      • de Groot V.
      • Killestein J.
      Reliability, construct and concurrent validity of a smartphone-based cognition test in multiple sclerosis.
      c
      - 3 month SDMT change

      - 12 month BICAMS change (
      • Lam K.H.
      • Bucur I.G.
      • Van Oirschot P.
      • De Graaf F.
      • Weda H.
      • Strijbis E.
      • Uitdehaag B.
      • Heskes T.
      • Killestein J.
      • De Groot V.
      Towards individualized monitoring of cognition in multiple sclerosis in the digital era: a one-year cohort study.
      )
      ICC 0.9 (
      • Lam K.H.
      • Bucur I.G.
      • Van Oirschot P.
      • De Graaf F.
      • Weda H.
      • Strijbis E.
      • Uitdehaag B.
      • Heskes T.
      • Killestein J.
      • De Groot V.
      Towards individualized monitoring of cognition in multiple sclerosis in the digital era: a one-year cohort study.
      )
      Strong with SDMT (
      • van Oirschot P.
      • Heerings M.
      • Wendrich K.
      • den Teuling B.
      • Martens M.B.
      • Jongen P.J.
      Symbol digit modalities test variant in a smartphone app for persons with multiple sclerosis: validation study.
      )
      - pwMS & matched HC (
      • van Oirschot P.
      • Heerings M.
      • Wendrich K.
      • den Teuling B.
      • Martens M.B.
      • Jongen P.J.
      Symbol digit modalities test variant in a smartphone app for persons with multiple sclerosis: validation study.
      )
      ICC 0.874 (
      • van Oirschot P.
      • Heerings M.
      • Wendrich K.
      • den Teuling B.
      • Martens M.B.
      • Jongen P.J.
      Symbol digit modalities test variant in a smartphone app for persons with multiple sclerosis: validation study.
      )
      MSCopilotStrong with MSFC as part of MSCopilot (
      • Maillart E.
      • Labauge P.
      • Cohen M.
      • Maarouf A.
      • Vukusic S.
      • Donzé C.
      • Gallien P.
      • De Sèze J.
      • Bourre B.
      • Moreau T.
      • Louapre C.
      • Mayran P.
      • Bieuvelet S.
      • Vallée M.
      • Bertillot F.
      • Klaeylé L.
      • Argoud A.L.
      • Zinaï S.
      • Tourbah A.
      MSCopilot, a new multiple sclerosis self-assessment digital solution: results of a comparative study versus standard tests.
      )
      - pwMS & HC (as part of MSCopilot) (
      • Maillart E.
      • Labauge P.
      • Cohen M.
      • Maarouf A.
      • Vukusic S.
      • Donzé C.
      • Gallien P.
      • De Sèze J.
      • Bourre B.
      • Moreau T.
      • Louapre C.
      • Mayran P.
      • Bieuvelet S.
      • Vallée M.
      • Bertillot F.
      • Klaeylé L.
      • Argoud A.L.
      • Zinaï S.
      • Tourbah A.
      MSCopilot, a new multiple sclerosis self-assessment digital solution: results of a comparative study versus standard tests.
      )
      ICC 0.90 (
      • Maillart E.
      • Labauge P.
      • Cohen M.
      • Maarouf A.
      • Vukusic S.
      • Donzé C.
      • Gallien P.
      • De Sèze J.
      • Bourre B.
      • Moreau T.
      • Louapre C.
      • Mayran P.
      • Bieuvelet S.
      • Vallée M.
      • Bertillot F.
      • Klaeylé L.
      • Argoud A.L.
      • Zinaï S.
      • Tourbah A.
      MSCopilot, a new multiple sclerosis self-assessment digital solution: results of a comparative study versus standard tests.
      )
      Moderate with EDSS as part of MSCopilot (
      • Tanoh I.C.
      • Maillart E.
      • Labauge P.
      • Cohen M.
      • Maarouf A.
      • Vukusic S.
      • Donzé C.
      • Gallien P.
      • De Sèze J.
      • Bourre B.
      • Moreau T.
      • Louapre C.
      • Vallée M.
      • Bieuvelet S.
      • Klaeylé L.
      • Argoud A.L.
      • Zinaï S.
      • Tourbah A.
      MSCopilot: new smartphone-based digital biomarkers correlate with expanded disability status scale scores in people with multiple sclerosis.
      )
      - pwMS with EDSS </=3.5 & pwMS with EDSS >3.5 (
      • Tanoh I.C.
      • Maillart E.
      • Labauge P.
      • Cohen M.
      • Maarouf A.
      • Vukusic S.
      • Donzé C.
      • Gallien P.
      • De Sèze J.
      • Bourre B.
      • Moreau T.
      • Louapre C.
      • Vallée M.
      • Bieuvelet S.
      • Klaeylé L.
      • Argoud A.L.
      • Zinaï S.
      • Tourbah A.
      MSCopilot: new smartphone-based digital biomarkers correlate with expanded disability status scale scores in people with multiple sclerosis.
      )
      NeufunGood with SDMT (
      • Pham L.
      • Harris T.
      • Varosanec M.
      • Morgan V.
      • Kosa P.
      • Bielekova B.
      Smartphone-based symbol-digit modalities test reliably captures brain damage in multiple sclerosis.
      )
      Weak to moderate with NeurEx features 

      (
      • Pham L.
      • Harris T.
      • Varosanec M.
      • Morgan V.
      • Kosa P.
      • Bielekova B.
      Smartphone-based symbol-digit modalities test reliably captures brain damage in multiple sclerosis.
      )
      Moderate with BV and T2LV 

      Moderate with tapping score (
      • Pham L.
      • Harris T.
      • Varosanec M.
      • Morgan V.
      • Kosa P.
      • Bielekova B.
      Smartphone-based symbol-digit modalities test reliably captures brain damage in multiple sclerosis.
      )
      -pwMS & HC (
      • Pham L.
      • Harris T.
      • Varosanec M.
      • Morgan V.
      • Kosa P.
      • Bielekova B.
      Smartphone-based symbol-digit modalities test reliably captures brain damage in multiple sclerosis.
      )


      ICC 0.87-0.9 (
      • Pham L.
      • Harris T.
      • Varosanec M.
      • Morgan V.
      • Kosa P.
      • Bielekova B.
      Smartphone-based symbol-digit modalities test reliably captures brain damage in multiple sclerosis.
      )
      Neurokeys- change in SDMT over 3 months (
      • Lam K.H.
      • Bucur I.G.
      • Van Oirschot P.
      • De Graaf F.
      • Weda H.
      • Strijbis E.
      • Uitdehaag B.
      • Heskes T.
      • Killestein J.
      • De Groot V.
      Towards individualized monitoring of cognition in multiple sclerosis in the digital era: a one-year cohort study.
      )
      Moderate with SDMT (
      • Lam K.H.
      • Meijer K.A.
      • Loonstra F.C.
      • Coerver E.M.E.
      • Twose J.
      • Redeman E.
      • Moraal B.
      • Barkhof F.
      • de Groot V.
      • Uitdehaag B.M.J.
      • Killestein J.
      Real-world keystroke dynamics are a potentially valid biomarker for clinical disability in multiple sclerosis.
      )
      Moderate with EDSS (
      • Lam K.H.
      • Meijer K.A.
      • Loonstra F.C.
      • Coerver E.M.E.
      • Twose J.
      • Redeman E.
      • Moraal B.
      • Barkhof F.
      • de Groot V.
      • Uitdehaag B.M.J.
      • Killestein J.
      Real-world keystroke dynamics are a potentially valid biomarker for clinical disability in multiple sclerosis.
      )
      Moderate with 9HPT (
      • Lam K.H.
      • Meijer K.A.
      • Loonstra F.C.
      • Coerver E.M.E.
      • Twose J.
      • Redeman E.
      • Moraal B.
      • Barkhof F.
      • de Groot V.
      • Uitdehaag B.M.J.
      • Killestein J.
      Real-world keystroke dynamics are a potentially valid biomarker for clinical disability in multiple sclerosis.
      )
      -pwMS & HC (
      • Lam K.H.
      • Meijer K.A.
      • Loonstra F.C.
      • Coerver E.M.E.
      • Twose J.
      • Redeman E.
      • Moraal B.
      • Barkhof F.
      • de Groot V.
      • Uitdehaag B.M.J.
      • Killestein J.
      Real-world keystroke dynamics are a potentially valid biomarker for clinical disability in multiple sclerosis.
      )
      ICC = 0.601 to 0.965 (
      • Lam K.H.
      • Meijer K.A.
      • Loonstra F.C.
      • Coerver E.M.E.
      • Twose J.
      • Redeman E.
      • Moraal B.
      • Barkhof F.
      • de Groot V.
      • Uitdehaag B.M.J.
      • Killestein J.
      Real-world keystroke dynamics are a potentially valid biomarker for clinical disability in multiple sclerosis.
      )
      Elevate MS-pwMS with moderate disability & other pwMS

      -pwMS with gait impairment & other pwMS

      - pwMS with severe CI & other pwMS

      (
      • Pratap A.
      • Grant D.
      • Vegesna A.
      • Tummalacherla M.
      • Cohan S.
      • Deshpande C.
      • Mangravite L.
      • Omberg L.
      Evaluating the utility of smartphone-based sensor assessments in persons with multiple sclerosis in the real-world using an App (elevateMS): observational, prospective pilot digital health study.
      )
      Abbreviations: 9HPT, 9-hole peg test; BICAMS, brief international cognitive assessment for MS; BV, brain volume; BVMT-R, brief visuospatial memory test-revised; CI, cognitively impaired; CVLT, California verbal learning test; EDSS, expanded disability status scale; FLAIR, fluid attenuated inversion recovery; HC, healthy controls; ICC, intra-class correlation; MSFC, MS functional composite; MSIS-29, MS impact scale; pwMS, people with MS;  pwMSwoCI, people with MS without cognitive impairment; SDMT, symbol digit modalities test; T2LV, T2 lesion volume.
      For the purposes of this study, strong correlation was defined as  Pearson's r or Spearman's rho >0.7 (coloured green), moderate between 0.4 and 0.7 (coloured yellow), and weak between 0.1 and 0.4 (coloured red).
      With regards to ICC, strong was defined as >0.9 (coloured green), good as >0.75–0.9 (coloured yellow) and weak as </= 0.75.
      With regards to AUC, any score >/= 0.7 was considered acceptable (coloured green), scores <0.7 were considered poor (coloured red).
      With regards to t-tests or Wilcoxon's signed-rank test used to discriminate pwMS VS controls, a p-value <0.05 was considered acceptable (coloured green), scores >/=0.05 was considered poor (coloured red).
      Where a range of values were reported, the best possible value was taken for the colour coding.
      *With regards to discriminant/known-groups validity, the known groups are separated by the symbol &.

      3.3.3 Practice effects

      Practice effects were examined for FLOODLIGHT, MSSherpa and Neufun. For MSSherpa, prominent practice effects were noted between the first and second test, with a cohen's d value of 0.477 reported (
      • van Oirschot P.
      • Heerings M.
      • Wendrich K.
      • den Teuling B.
      • Martens M.B.
      • Jongen P.J.
      Symbol digit modalities test variant in a smartphone app for persons with multiple sclerosis: validation study.
      ). Subsequent repetitions revealed cohen's d values of less than 0.2. However, with the increase in frequency of testing, it has become apparent that significant practice effects continue for longer than previously thought. In the longitudinal one year follow up study of the same MSSherpa cohort, a local linear trend model was applied and this showed that practice effects lasted for up to two months before plateauing (
      • Lam K.H.
      • Bucur I.G.
      • Van Oirschot P.
      • De Graaf F.
      • Weda H.
      • Strijbis E.
      • Uitdehaag B.
      • Heskes T.
      • Killestein J.
      • De Groot V.
      Towards individualized monitoring of cognition in multiple sclerosis in the digital era: a one-year cohort study.
      ).
      In FLOODLIGHT Open, an average improvement of 25.4% was noted between the first and last scores (
      • Woelfle T.
      • Pless S.
      • Wiencierz A.
      • Kappos L.
      • Naegelin Y.
      • Lorscheider J.
      Practice effects of mobile tests of cognition, dexterity, and mobility on patients with multiple sclerosis: data analysis of a smartphone-based observational study.
      ). The majority (19.7%) occurred within the first five repetitions but there was still a significant long-term practice effect noted beyond the first five repetitions. In fact, in a multivariate model the number of repetitions had the largest beta coefficient for the difference between the fifth to last score. However, the R-squared value was only 0.3, suggesting that other factors may be at play. A long-term learning curve analysis predicted that with more tests performed, further improvement in scores can occur with an eventual average improvement over baseline of 40.8% (
      • Woelfle T.
      • Pless S.
      • Wiencierz A.
      • Kappos L.
      • Naegelin Y.
      • Lorscheider J.
      Practice effects of mobile tests of cognition, dexterity, and mobility on patients with multiple sclerosis: data analysis of a smartphone-based observational study.
      ).
      In Neufun, practice effects were examined in 15 pwMS who had longitudinal data with over 20 test sittings. Non-linear regression was utilised to show that on average, practice effects plateaued at eight repetitions, although it should be noted that two participants continued to experience ongoing practice effects beyond 20 repetitions.
      It should be noted that the possibility of participants practising in between tests was not explicitly excluded. Details including short versus long-term practice effects is included in Appendix Table 1.

      3.3.4 Feasibility

      Feasibility was reported for FLOODLIGHT, MSSherpa, elevateMS and MSCopilot. In FLOODLIGHT, 84.7% of participants reported at least acceptable impact on their daily activities. However actual adherence to testing (defined as at least 3 days per week of valid testing) was lower, with 70% to active and 79% to passive testing. After six months, 63.9% of participants indicated willingness to continue using the app. Of the test battery, the 2 min walk test (2MWT) was the least acceptable with 45% indicating that it was the one component of the battery they would remove. It should be noted that this cohort was asked to perform the tests daily. The actual frequency required to detect a clinically significant change is likely to be much less frequent, and thus real-world adherence and acceptability may be higher (
      • Midaglia L.
      • Mulero P.
      • Montalban X.
      • Graves J.
      • Hauser S.L.
      • Julian L.
      • Baker M.
      • Schadrack J.
      • Gossens C.
      • Scotland A.
      • Lipsmeier F.
      • van Beek J.
      • Bernasconi C.
      • Belachew S.
      • Lindemann M.
      Adherence and satisfaction of smartphone- and smartwatch-based remote active testing and passive monitoring in people with multiple sclerosis: nonrandomized interventional feasibility study.
      ).
      In elevateMS, only 46.1 to 50.0% of pwMS were compliant (defined as at least one sensor-based test per week in this study) over a 12-week follow up period (
      • Pratap A.
      • Grant D.
      • Vegesna A.
      • Tummalacherla M.
      • Cohan S.
      • Deshpande C.
      • Mangravite L.
      • Omberg L.
      Evaluating the utility of smartphone-based sensor assessments in persons with multiple sclerosis in the real-world using an App (elevateMS): observational, prospective pilot digital health study.
      ). Participants who were referred from MS clinics had a significantly higher retention rate compared to self-referred pwMS.
      In MSCopilot, 67.6% preferred the smartphone app to the traditional MSFC and 87.1% indicated that the test was easy to use. Acceptable frequency was at least once a month in 85.5% (
      • Maillart E.
      • Labauge P.
      • Cohen M.
      • Maarouf A.
      • Vukusic S.
      • Donzé C.
      • Gallien P.
      • De Sèze J.
      • Bourre B.
      • Moreau T.
      • Louapre C.
      • Mayran P.
      • Bieuvelet S.
      • Vallée M.
      • Bertillot F.
      • Klaeylé L.
      • Argoud A.L.
      • Zinaï S.
      • Tourbah A.
      MSCopilot, a new multiple sclerosis self-assessment digital solution: results of a comparative study versus standard tests.
      ). Finally, seven participants interviewed in the MSSherpa study revealed positive attitudes about the app (
      • van Oirschot P.
      • Heerings M.
      • Wendrich K.
      • den Teuling B.
      • Martens M.B.
      • Jongen P.J.
      Symbol digit modalities test variant in a smartphone app for persons with multiple sclerosis: validation study.
      ).
      Details with regards to feasibility is included in Appendix Table 1.

      3.3.5 Quality of studies

      The results of the methodological quality assessment of the included studies are presented in Appendix Table 2. Of the 12 included studies, all were classified as having good methodological quality with a low risk of bias. Blinding, levels of exposure and assessment of exposure prior to outcome assessment were not relevant to any of the studies. A lack of sample size calculation was common, affecting 10 out of the 12 included studies.

      4. Discussion

      Remote smartphone-based testing of cognition represents an attractive option for ecological assessment of cognition in pwMS. This is particularly relevant in the COVID-19 pandemic, which has emphasized the need for valid, reliable remote assessment of immunocompromised pwMS. This review shows that these smartphone apps are generally valid and reliable. Concurrent validity was comparable, if not better than contemporary cognitive tests. They were highly acceptable to pwMS, although actual compliance was lower across all apps with significant attrition rates. The ability to predict longitudinal change has not yet been demonstrated, as many of the studies have been cross-sectional. Some studies have also shown that MCID was smaller than SDC, suggesting that further work can be done to increase the sensitivity of these monitoring tools, particularly at an individual level.
      A smartphone version of the SDMT is utilised by the FLOODLIGHT, MSSherpa, MSCopilot, ElevateMS and NeuFun apps. Smartphone SDMTs lack standardization across different apps. There are also subtle yet key differences between smartphone and traditional SDMTs. Depending on the app, some smartphone SDMTs are not restricted to six keys for first 26 symbols, include no practice keys and have different symbols. Pixel sizes for symbols differ, and sizes of mobile buttons for answering are not standardized. The impact of these minor variations in smartphone SDMTs are unclear, and future studies may examine correlations between different versions of smartphone SDMTs. The raw overall score for smartphone SDMTs was significantly lower than the traditional written SDMT. The primary reason for this is likely due to the delay in display and reaction time as smartphone SDMTs only display one symbol at a time. Other possible reasons suggested include memorization from prior SDMT attempts and the use of only six symbols for the first 26 symbols in the traditional SDMT (
      • Pham L.
      • Harris T.
      • Varosanec M.
      • Morgan V.
      • Kosa P.
      • Bielekova B.
      Smartphone-based symbol-digit modalities test reliably captures brain damage in multiple sclerosis.
      ). In the Neufun version, a correction was added to the smartphone SDMT to better correlate with traditional SDMT scores (
      • Pham L.
      • Harris T.
      • Varosanec M.
      • Morgan V.
      • Kosa P.
      • Bielekova B.
      Smartphone-based symbol-digit modalities test reliably captures brain damage in multiple sclerosis.
      ).
      Concurrent validity with traditional SDMT and other established neuropsychological tests were examined in four out of the six apps and were generally strong. Concurrent validity with other clinical measures of MS such as EDSS, radiological features and other clinical measures was examined in five out of the six apps. This revealed moderate correlation coefficients, and crucially when compared to other traditional neuropsychiatric tests such as the PASAT and traditional SDMT, smartphone SDMT outperformed the PASAT and performed comparably, if not slightly better than the traditional SDMT.
      Discriminant validity was examined in five out of six apps and has shown the ability to discriminate not just between HC and pwMS, but also different levels of disability in pwMS. In particular, the sensitivity and specificity to distinguish pwMS with mild disability from those with more severe disability was greater than traditional measures such as PASAT and oral SDMT in MSCopilot (
      • Tanoh I.C.
      • Maillart E.
      • Labauge P.
      • Cohen M.
      • Maarouf A.
      • Vukusic S.
      • Donzé C.
      • Gallien P.
      • De Sèze J.
      • Bourre B.
      • Moreau T.
      • Louapre C.
      • Vallée M.
      • Bieuvelet S.
      • Klaeylé L.
      • Argoud A.L.
      • Zinaï S.
      • Tourbah A.
      MSCopilot: new smartphone-based digital biomarkers correlate with expanded disability status scale scores in people with multiple sclerosis.
      ). Test-retest reliability was reported for five out of the six apps and were generally in the good to excellent range, though specific test features on passive keystroke testing were less reliable. Given the exponential increase in the number of test features that can be derived from smartphone apps, further validity and reliability testing will allow us to select the best test features from these apps.
      The quality of the studies were generally good as assessed by the critical appraisal tool. Major limitations of the studies include the lack of a sample size calculation in 10 of the 12 studies. This would be particularly relevant when assessing longitudinal outcomes as sample sizes must be sufficient to have enough participants with a clinically significant change.
      There are still significant gaps in the literature with regards to the validity of smartphone apps monitoring cognition in MS. Ecological validity (correlation with employment and activities of daily living) has not been demonstrated. Much of the literature is cross-sectional and longitudinal data is lacking at this stage due to the short follow-up periods and small proportion of participants experiencing clinically significant improvement or deterioration. For example, in the FLOODLIGHT study, only seven participants experienced a significant change in their EDSS score over six months. Of the apps where longitudinal data was available, predictive validity was not shown, with AUC below 0.7 for prediction of a clinically significant change in the benchmark cognitive test (BICAMS and SDMT) in MSSherpa and Neurokeys. Furthermore, MCID exceeded SDC at the group level only in the Neurokeys app. Nevertheless, the results were still promising given that the AUC for Neurokeys still exceeded that of the clinical benchmark used in the study, the EDSS. Future studies with larger cohorts and longer follow-up periods will be able to investigate the ability of smartphone apps to predict clinically relevant outcomes longitudinally, and whether they have sufficient sensitivity for MCID to exceed the SDC. By proving ecological and predictive validity, HCPs would be more likely to base their clinical decision-making on smartphone-based digital biomarkers.
      The volume of data generated by smartphone applications have also allowed for novel methods to adjust for potential confounders of cognitive function in MS. This has been an area of ongoing controversy with conflicting findings on the effect of mood and fatigue on cognition (
      • Hoffmeister J.
      • Basso M.R.
      • Reynolds B.
      • Whiteside D.
      • Mulligan R.
      • Arnett P.A.
      • Combs D.R.
      Effects of diminished positive mood and depressed mood upon verbal learning and memory among people with multiple sclerosis.
      ;
      • Morrow S.A.
      • Weinstock-Guttman B.
      • Munschauer F.E.
      • Hojnacki D.
      • Benedict R.H.B.
      Subjective fatigue is not associated with cognitive impairment in multiple sclerosis: cross-sectional and longitudinal analysis.
      ;
      • Ruet A.
      • Deloire M.S.A.
      • Charré-Morin J.
      • Hamel D.
      • Brochet B.
      A new computerised cognitive test for the detection of information processing speed impairment in multiple sclerosis.
      ;
      • Whitehouse C.E.
      • Fisk J.D.
      • Bernstein C.N.
      • Berrigan L.I.
      • Bolton J.M.
      • Graff L.A.
      • Hitchon C.A.
      • Marriott J.J.
      • Peschken C.A.
      • Sareen J.
      • Walker J.R.
      • Stewart S.H.
      • Marrie R.A.
      Comorbid anxiety, depression, and cognition in MS and other immune-mediated disorders.
      ). Traditionally, researchers have tried to measure the effect of these confounders through lengthy questionnaires (
      • Diamond B.J.
      • Johnson S.K.
      • Kaufman M.
      • Graves L.
      Relationships between information processing, depression, fatigue and cognition in multiple sclerosis.
      ;
      • Hu M.
      • Muhlert N.
      • Robertson N.
      • Winter M.
      Perceived fatigue and cognitive performance change in multiple sclerosis: uncovering predictors beyond baseline fatigue.
      ;
      • Ruet A.
      • Deloire M.S.A.
      • Charré-Morin J.
      • Hamel D.
      • Brochet B.
      A new computerised cognitive test for the detection of information processing speed impairment in multiple sclerosis.
      ), which ignores the day-to-day fluctuations of mood and fatigue. The use of an in-app generated fatigue score and upper limb function surrogate as demonstrated by NeuFun allowed for adjustment of this at the time of testing without the need for collection of additional data. The passive monitoring of emoji sentiments demonstrated by
      • Lam K.H.
      • Twose J.
      • McConchie H.
      • Licitra G.
      • Meijer K.
      • de Ruiter L.
      • van Lierop Z.
      • Moraal B.
      • Barkhof F.
      • Uitdehaag B.
      • de Groot V.
      • Killestein J.
      Smartphone-derived keystroke dynamics are sensitive to relevant changes in multiple sclerosis.
      ) is another promising method for real-world mood monitoring and subsequent adjustment. Other apps such as FLOODLIGHT have a daily mood question as part of its battery. Further studies may examine the effect of adjusting for a smartphone app-based mood surrogate and its effect on cognitive performance.
      Long-term practice effects can occur with the smartphone SDMT as shown by FLOODLIGHT, NeuFun and MSSherpa. With new modelling techniques and increased test repetitions, it is now apparent that practice effects last longer than previously thought. Whilst the largest practice effect is still within the first two to three repetitions (consistent with studies in traditional SDMT), it has been demonstrated that eight or more repetitions may be required prior to stabilization of practice effects. A future study may examine the use of a non-linear correction to counteract the practice effect. There may also be a proportion of patients where this improvement in test score is a real effect, such as a response to their treatment regimen or recovery from a relapse. This is a known phenomenon with the EDSS, where a significant proportion of patients experience regression of their EDSS (
      • Kappos L.
      • Butzkueven H.
      • Wiendl H.
      • Spelman T.
      • Pellegrini F.
      • Chen Y.
      • Dong Q.
      • Koendgen H.
      • Belachew S.
      • Trojano M.
      Greater sensitivity to multiple sclerosis disability worsening and progression events using a roving versus a fixed reference value in a prospective cohort study.
      ), thus reducing the sensitivity of the test to detect a clinically significant change. A potential solution may be the application of a roving smartphone SDMT score, such as that suggested with the EDSS. A future study may use this to determine if this increases the sensitivity of smartphone apps to detect clinically significant cognitive change.
      Whilst acceptability was largely excellent amongst pwMS, compliance with testing decreased significantly with time and was lower than self-reported acceptability. In the FLOODLIGHT study, only 60.5% of pwMS adhered to at least three days of testing per week after 24 weeks. Real-world compliance is likely to be even lower than trial settings. However, it should be noted that test frequency in these studies were high – most required a minimum of two to three tests every week. It is likely that minimum test frequency required to detect a clinically relevant change in cognition will be less frequent, and this may reduce the risk of non-compliance. For example, analysis of longitudinal MSSherpa data suggests that testing frequency of once every 12 days is sufficient to detect a 4-point change in SDMT scores (
      • Lam K.H.
      • Bucur I.G.
      • Van Oirschot P.
      • De Graaf F.
      • Weda H.
      • Strijbis E.
      • Uitdehaag B.
      • Heskes T.
      • Killestein J.
      • De Groot V.
      Towards individualized monitoring of cognition in multiple sclerosis in the digital era: a one-year cohort study.
      ). Other strategies such as automated text reminders and passive monitoring may also improve compliance. Further studies may also explore the acceptability to HCP and healthcare organisations, particularly in light of the potential cost savings highlighted by a recent study (
      • Cloosterman S.
      • Wijnands I.
      • Huygens S.
      • Wester V.
      • Lam K.H.
      • Strijbis E.
      • den Teuling B.
      • Versteegh M.
      The potential impact of digital biomarkers in multiple sclerosis in The Netherlands: an early health technology assessment of MS sherpa.
      ).
      Other apps that did not meet criteria for inclusion in this study include the MS Screening test (
      • Cohen M.
      • Mondot L.
      • Fakir S.
      • Landes C.
      • Lebrun C.
      Digital biomarkers can highlight subtle clinical differences in radiologically isolated syndrome compared to healthy controls.
      ). This test involves a novel cognitive test where a number or letter is displayed on the screen from a random selection of 10 letters and five digits. The subject has to tap the screen when a letter appears, but do nothing if a number appeared instead. Whilst this was only subject to a small pilot study of 21 participants with radiologically isolated syndrome (RIS), it was able to discriminate HC from patients with RIS. This ability to detect subclinical cognitive dysfunction is promising and a follow-up study on early MS patients is planned. Other smartphone-based cognitive tests with a similar mechanism include the choice reaction test (part of the MSReactor battery) and further studies on this platform will be of interest (
      • Merlo D.
      • Darby D.
      • Kalincik T.
      • Butzkueven H.
      • van der Walt A.
      The feasibility, reliability and concurrent validity of the MSReactor computerized cognitive screening tool in multiple sclerosis.
      ,
      • Merlo D.
      • Stankovich J.
      • Bai C.
      • Kalincik T.
      • Zhu C.
      • Gresle M.
      • Lechner-Scott J.
      • Kilpatrick T.
      • Barnett M.
      • Taylor B.
      Association between cognitive trajectories and disability progression in patients with relapsing-remitting multiple sclerosis.
      ).
      Limitations of this study include the lack of a specific critical appraisal tool for smartphone apps. Whilst critical appraisal tools exist for validity and reliability studies, they are based largely on traditional in-clinic assessments and are not optimized for remote assessment tools such as smartphone apps with frequent repetitions. Another limitation specific to the Neurokeys app was that the most promising keystroke features for prediction of cognitive change were emoji sentiments, and the potential for depression and mood as a confounder for cognitive function was not considered in the study. Finally, none of the included studies specifically examined the effect of device type on results. Future studies could consider sensitivity analyses to confirm validity across different smartphone models.

      5. Conclusion

      This review shows that remote monitoring via smartphone apps is a feasible, valid and reliable method to collect quantifiable, objective data on cognition. Compared to contemporary cognitive tests, it may represent a more sensitive method to detect subclinical cognitive dysfunction. This may pave the way for more efficient trial recruitment in progressive MS. Whilst this does not replace formal neuropsychological testing by a trained neuropsychologist, it offers a low-cost, widely accessible way to assess cognition in pwMS. Further research should investigate the predictive and ecological validity of these smartphone apps and integrate them with other biomarkers to develop more accurate predictive tools.

      Funding

      This work was supported by the National Health and Medical Research Council (NHMRC), Multiple Sclerosis Research Australia (MSRA), AVANT Foundation, Australia and New Zealand Association of Neurologists (ANZAN).

      Declaration of Competing Interest

      The authors declare the following financial interests/personal relationships which may be considered as potential competing interests:
      Yi Chao Foong reports a relationship with Biogen that includes: travel reimbursement. Francesca Bridge reports a relationship with Biogen that includes: travel reimbursement. Anneke van der Walt reports a relationship with Novartis Pharmaceuticals Corporation that includes: consulting or advisory, funding grants, speaking and lecture fees, and travel reimbursement. Anneke van der Walt reports a relationship with Biogen that includes: consulting or advisory, funding grants, speaking and lecture fees, and travel reimbursement. Anneke van der Walt reports a relationship with Merck & Co Inc that includes: consulting or advisory, funding grants, speaking and lecture fees, and travel reimbursement. Anneke van der Walt reports a relationship with F Hoffmann-La Roche Ltd that includes: consulting or advisory, funding grants, speaking and lecture fees, and travel reimbursement. Anneke van der Walt reports a relationship with National Health and Medical Research Council that includes: funding grants. Anneke van der Walt reports a relationship with Multiple Sclerosis Research Australia that includes: funding grants. Melissa Gresle reports a relationship with Biogen that includes: funding grants. Melissa Gresle reports a relationship with F Hoffmann-La Roche Ltd that includes: funding grants. Daniel Merlo reports a relationship with Novartis that includes: speaking and lecture fees. Helmut Butzkueven reports a relationship with Novartis that includes: consulting or advisory, funding grants, and speaking and lecture fees. Helmut Butzkueven reports a relationship with Biogen Inc that includes: consulting or advisory, funding grants, and speaking and lecture fees. Helmut Butzkueven reports a relationship with Merck & Co Inc that includes: consulting or advisory, funding grants, and speaking and lecture fees. Helmut Butzkueven reports a relationship with F Hoffmann-La Roche Ltd that includes: consulting or advisory, funding grants, and speaking and lecture fees. Helmut Butzkueven reports a relationship with UCB Pharma SA that includes: consulting or advisory and speaking and lecture fees. Helmut Butzkueven reports a relationship with National Health and Medical Research Council that includes: funding grants. Helmut Butzkueven reports a relationship with Medical Research Future Fund that includes: funding grants. Helmut Butzkueven reports a relationship with Monash partners that includes: funding grants. Helmut Butzkueven reports a relationship with Trish MS research foundation that includes: funding grants. Helmut Butzkueven reports a relationship with Pennycook Foundation that includes: funding grants. Helmut Butzkueven reports a relationship with Multiple Sclerosis Research Australia that includes: funding grants. Helmut Butzkueven reports a relationship with MSBase Foundation that includes: consulting or advisory. Helmut Butzkueven reports a relationship with Oxford Health Policy Forum Brain Health Initiative that includes: consulting or advisory. Yi Chao Foong reports a relationship with National Health and Medical Research Council that includes: funding grants. Yi Chao Foong reports a relationship with Avant Foundation that includes: funding grants. Yi Chao Foong reports a relationship with Multiple Sclerosis Research Australia that includes: funding grants. Yi Chao Foong reports a relationship with Australian and New Zealand Association of Neurologists that includes: funding grants.

      Appendix. Supplementary materials

      References

        • Abou L.
        • Alluri A.
        • Fliflet A.
        • Du Y.
        • Rice L.A.
        Effectiveness of physical therapy interventions in reducing fear of falling among individuals with neurologic diseases: a systematic review and meta-analysis.
        Arch. Phys. Med. Rehabil. 2021; 102: 132-154
        • Benedict R.H.B.
        • Amato M.P.
        • DeLuca J.
        • Geurts J.J.G.
        Cognitive impairment in multiple sclerosis: clinical management, MRI, and therapeutic avenues.
        Lancet Neurol. 2020; 19: 860-871
        • Cloosterman S.
        • Wijnands I.
        • Huygens S.
        • Wester V.
        • Lam K.H.
        • Strijbis E.
        • den Teuling B.
        • Versteegh M.
        The potential impact of digital biomarkers in multiple sclerosis in The Netherlands: an early health technology assessment of MS sherpa.
        Brain Sci. 2021; 11: 1305https://doi.org/10.3390/brainsci11101305
        • Cohen M.
        • Mondot L.
        • Fakir S.
        • Landes C.
        • Lebrun C.
        Digital biomarkers can highlight subtle clinical differences in radiologically isolated syndrome compared to healthy controls.
        J. Neurol. 2021; 268: 1316-1322
        • Cumpston M.
        • Li T.
        • Page M.J.
        • Chandler J.
        • Welch V.A.
        • Higgins J.P.
        • Thomas J.
        Updated guidance for trusted systematic reviews: a new edition of the cochrane handbook for systematic reviews of interventions.
        Cochrane Database Syst. Rev. 2019; 10https://doi.org/10.1002/14651858.ED000142
        • Diamond B.J.
        • Johnson S.K.
        • Kaufman M.
        • Graves L.
        Relationships between information processing, depression, fatigue and cognition in multiple sclerosis.
        Arch. Clin. Neuropsychol. 2008; 23: 189-199
        • Fleiss J.L.
        • Levin B.
        • Paik M.C.
        Statistical Methods for Rates and Proportions.
        John Wiley & Sons, 2013
        • Goverover Y.
        • Chiaravalloti N.
        • DeLuca J.
        Brief International Cognitive Assessment for Multiple Sclerosis (BICAMS) and performance of everyday life tasks: actual reality.
        Mult. Scler. J. 2015; 22: 544-550
      1. National Institute of Health, 2014. Quality assessment tool for observational cohort and cross-sectional studies [Internet]. Available from: https://www.nhlbi.nih.gov/health-topics/study-quality-assessment-tools.

        • Hoffmeister J.
        • Basso M.R.
        • Reynolds B.
        • Whiteside D.
        • Mulligan R.
        • Arnett P.A.
        • Combs D.R.
        Effects of diminished positive mood and depressed mood upon verbal learning and memory among people with multiple sclerosis.
        J. Clin. Exp. Neuropsychol. 2021; 43: 117-128
        • Hu M.
        • Muhlert N.
        • Robertson N.
        • Winter M.
        Perceived fatigue and cognitive performance change in multiple sclerosis: uncovering predictors beyond baseline fatigue.
        Mult. Scler. Relat. Disord. 2019; 32: 46-53
        • Kalb R.
        • Beier M.
        • Benedict R.H.B.
        • Charvet L.
        • Costello K.
        • Feinstein A.
        • Gingold J.
        • Goverover Y.
        • Halper J.
        • Harris C.
        • Kostich L.
        • Krupp L.
        • Lathi E.
        • LaRocca N.
        • Thrower B.
        • DeLuca J.
        Recommendations for cognitive screening and management in multiple sclerosis care.
        Mult. Scler. J. 2018; 24: 1665-1680
        • Kappos L.
        • Butzkueven H.
        • Wiendl H.
        • Spelman T.
        • Pellegrini F.
        • Chen Y.
        • Dong Q.
        • Koendgen H.
        • Belachew S.
        • Trojano M.
        Greater sensitivity to multiple sclerosis disability worsening and progression events using a roving versus a fixed reference value in a prospective cohort study.
        Mult. Scler. 2018; 24: 963-973
        • Lam K.H.
        • Bucur I.G.
        • Van Oirschot P.
        • De Graaf F.
        • Weda H.
        • Strijbis E.
        • Uitdehaag B.
        • Heskes T.
        • Killestein J.
        • De Groot V.
        Towards individualized monitoring of cognition in multiple sclerosis in the digital era: a one-year cohort study.
        Mult. Scler. Relat. Disord. 2022; 60103692
        • Lam K.H.
        • Twose J.
        • McConchie H.
        • Licitra G.
        • Meijer K.
        • de Ruiter L.
        • van Lierop Z.
        • Moraal B.
        • Barkhof F.
        • Uitdehaag B.
        • de Groot V.
        • Killestein J.
        Smartphone-derived keystroke dynamics are sensitive to relevant changes in multiple sclerosis.
        Eur. J. Neurol. 2022; 29: 522-534
        • Lam K.H.
        • Meijer K.A.
        • Loonstra F.C.
        • Coerver E.M.E.
        • Twose J.
        • Redeman E.
        • Moraal B.
        • Barkhof F.
        • de Groot V.
        • Uitdehaag B.M.J.
        • Killestein J.
        Real-world keystroke dynamics are a potentially valid biomarker for clinical disability in multiple sclerosis.
        Mult. Scler. J. 2020; 27: 1421-1431
        • Lam K.H.
        • van Oirschot P.
        • den Teuling B.
        • Hulst H.E.
        • de Jong B.A.
        • Uitdehaag B.M.J.
        • de Groot V.
        • Killestein J.
        Reliability, construct and concurrent validity of a smartphone-based cognition test in multiple sclerosis.
        Mult. Scler. J. 2022; 28: 300-308
        • Lysandropoulos A.
        • Havrdova E.
        ‘Hidden'factors influencing quality of life in patients with multiple sclerosis.
        Eur. J. Neurol. 2015; 22: 28-33
        • Maillart E.
        • Labauge P.
        • Cohen M.
        • Maarouf A.
        • Vukusic S.
        • Donzé C.
        • Gallien P.
        • De Sèze J.
        • Bourre B.
        • Moreau T.
        • Louapre C.
        • Mayran P.
        • Bieuvelet S.
        • Vallée M.
        • Bertillot F.
        • Klaeylé L.
        • Argoud A.L.
        • Zinaï S.
        • Tourbah A.
        MSCopilot, a new multiple sclerosis self-assessment digital solution: results of a comparative study versus standard tests.
        Eur. J. Neurol. 2020; 27: 429-436
        • Merlo D.
        • Darby D.
        • Kalincik T.
        • Butzkueven H.
        • van der Walt A.
        The feasibility, reliability and concurrent validity of the MSReactor computerized cognitive screening tool in multiple sclerosis.
        Ther. Adv. Neurol. Disord. 2019; 12 (1756286419859183)
        • Merlo D.
        • Stankovich J.
        • Bai C.
        • Kalincik T.
        • Zhu C.
        • Gresle M.
        • Lechner-Scott J.
        • Kilpatrick T.
        • Barnett M.
        • Taylor B.
        Association between cognitive trajectories and disability progression in patients with relapsing-remitting multiple sclerosis.
        Neurology. 2021; 97: e2020-e2031
        • Midaglia L.
        • Mulero P.
        • Montalban X.
        • Graves J.
        • Hauser S.L.
        • Julian L.
        • Baker M.
        • Schadrack J.
        • Gossens C.
        • Scotland A.
        • Lipsmeier F.
        • van Beek J.
        • Bernasconi C.
        • Belachew S.
        • Lindemann M.
        Adherence and satisfaction of smartphone- and smartwatch-based remote active testing and passive monitoring in people with multiple sclerosis: nonrandomized interventional feasibility study.
        J. Med. Internet Res. 2019; 21: e14863
        • Moher D.
        • Liberati A.
        • Tetzlaff J.
        • Altman D.G.
        Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement.
        Ann. Intern. Med. 2009; 151: 264-269
        • Montalban X.
        • Graves J.
        • Midaglia L.
        • Mulero P.
        • Julian L.
        • Baker M.
        • Schadrack J.
        • Gossens C.
        • Ganzetti M.
        • Scotland A.
        • Lipsmeier F.
        • van Beek J.
        • Bernasconi C.
        • Belachew S.
        • Lindemann M.
        • Hauser S.L.
        A smartphone sensor-based digital outcome assessment of multiple sclerosis.
        Mult. Scler. 2021; 28: 654-664
        • Morrow S.A.
        • Weinstock-Guttman B.
        • Munschauer F.E.
        • Hojnacki D.
        • Benedict R.H.B.
        Subjective fatigue is not associated with cognitive impairment in multiple sclerosis: cross-sectional and longitudinal analysis.
        Mult. Scler. J. 2009; 15: 998-1005
        • Mukaka M.M.
        A guide to appropriate use of correlation coefficient in medical research.
        Malawi Med. J. 2012; 24: 69-71
        • Peters J.
        • Abou L.
        • Wong E.
        • Dossou M.S.
        • Sosnoff J.J.
        • Rice L.A.
        Smartphone-based gait and balance assessment in survivors of stroke: a systematic review.
        Disabil. Rehabil. Assist. Technol. 2022; : 1-11https://doi.org/10.1080/17483107.2022.2072527
        • Pham L.
        • Harris T.
        • Varosanec M.
        • Morgan V.
        • Kosa P.
        • Bielekova B.
        Smartphone-based symbol-digit modalities test reliably captures brain damage in multiple sclerosis.
        NPJ Digit. Med. 2021; 4: 36
        • Pratap A.
        • Grant D.
        • Vegesna A.
        • Tummalacherla M.
        • Cohan S.
        • Deshpande C.
        • Mangravite L.
        • Omberg L.
        Evaluating the utility of smartphone-based sensor assessments in persons with multiple sclerosis in the real-world using an App (elevateMS): observational, prospective pilot digital health study.
        JMIR Mhealth Uhealth. 2020; 8: e22108
        • Raggi A.
        • Covelli V.
        • Schiavolin S.
        • Scaratti C.
        • Leonardi M.
        • Willems M.
        Work-related problems in multiple sclerosis: a literature review on its associates and determinants.
        Disabil. Rehabil. 2016; 38: 936-944
        • Rice M.E.
        • Harris G.T.
        Comparing effect sizes in follow-up studies: ROC area, Cohen's d, and r.
        Law Hum. Behav. 2005; 29: 615-620
        • Ruet A.
        • Deloire M.S.A.
        • Charré-Morin J.
        • Hamel D.
        • Brochet B.
        A new computerised cognitive test for the detection of information processing speed impairment in multiple sclerosis.
        Mult. Scler. J. 2013; 19: 1665-1672
        • Schiavolin S.
        • Leonardi M.
        • Giovannetti A.M.
        • Antozzi C.
        • Brambilla L.
        • Confalonieri P.
        • Mantegazza R.
        • Raggi A.
        Factors related to difficulties with employment in patients with multiple sclerosis: a review of 2002–2011 literature.
        Int. J. Rehabil. Res. 2013; 36: 105-111
        • Strober L.
        • DeLuca J.
        • Benedict R.H.B.
        • Jacobs A.
        • Cohen J.A.
        • Chiaravalloti N.
        • Hudson L.D.
        • Rudick R.A.
        • LaRocca N.G.
        Symbol digit modalities test: a valid clinical trial endpoint for measuring cognition in multiple sclerosis.
        Mult. Scler. J. 2018; 25: 1781-1790
        • Tanoh I.C.
        • Maillart E.
        • Labauge P.
        • Cohen M.
        • Maarouf A.
        • Vukusic S.
        • Donzé C.
        • Gallien P.
        • De Sèze J.
        • Bourre B.
        • Moreau T.
        • Louapre C.
        • Vallée M.
        • Bieuvelet S.
        • Klaeylé L.
        • Argoud A.L.
        • Zinaï S.
        • Tourbah A.
        MSCopilot: new smartphone-based digital biomarkers correlate with expanded disability status scale scores in people with multiple sclerosis.
        Mult. Scler. Relat. Disord. 2021; 55103164
        • van Oirschot P.
        • Heerings M.
        • Wendrich K.
        • den Teuling B.
        • Martens M.B.
        • Jongen P.J.
        Symbol digit modalities test variant in a smartphone app for persons with multiple sclerosis: validation study.
        JMIR Mhealth Uhealth. 2020; 8: e18160
        • Weber E.
        • Goverover Y.
        • DeLuca J.
        Beyond cognitive dysfunction: relevance of ecological validity of neuropsychological tests in multiple sclerosis.
        Mult. Scler. J. 2019; 25: 1412-1419
        • Whitehouse C.E.
        • Fisk J.D.
        • Bernstein C.N.
        • Berrigan L.I.
        • Bolton J.M.
        • Graff L.A.
        • Hitchon C.A.
        • Marriott J.J.
        • Peschken C.A.
        • Sareen J.
        • Walker J.R.
        • Stewart S.H.
        • Marrie R.A.
        Comorbid anxiety, depression, and cognition in MS and other immune-mediated disorders.
        Neurology. 2019; 92: e406
        • Woelfle T.
        • Pless S.
        • Wiencierz A.
        • Kappos L.
        • Naegelin Y.
        • Lorscheider J.
        Practice effects of mobile tests of cognition, dexterity, and mobility on patients with multiple sclerosis: data analysis of a smartphone-based observational study.
        J. Med. Internet Res. 2021; 23: e30394
        • Wojcik C.M.
        • Beier M.
        • Costello K.
        • DeLuca J.
        • Feinstein A.
        • Goverover Y.
        • Gudesblatt M.
        • Jaworski 3rd, M.
        • Kalb R.
        • Kostich L.
        • LaRocca N.G.
        • Rodgers J.D.
        • Benedict R.H.
        Computerized neuropsychological assessment devices in multiple sclerosis: a systematic review.
        Mult. Scler. 2019; 25: 1848-1869