Introduction

Music and cochlear implants

With the contemporary cochlear implants (CIs), users of all ages can achieve high levels of speech perception [1]. Nonetheless, musical perception is particularly challenging for these individuals. This challenge arises from biological constraints imposed by their auditory pathology, and the spectral intricacies of music, as opposed to speech, makes music perception a demanding task [2].

CI users employ different strategies to deal with deficits in musical perception. While many children or adolescents with congenital or prelingual hearing loss view music positively [3] and are actively involved in musical activities [4], adult with CI tend to avoid music due to its perceived unpleasantness [5, 6]. This issue can affect the overall quality of life for CI users, as music is present in countless social and cultural contexts in daily life [7].

In two studies carried out by Lassaletta et al., a positive correlation was found between the perceived quality of music and overall quality of life. The studies indicated that users who spend more time listening to music after implantation tend to exhibit higher quality of life scores [6, 8]. These findings further support the importance of music.

In this context, various programs aimed at training and evaluating musical perception following cochlear implantation have been developed [9,10,11,12,13,14], as musical training appears to have significant and lasting effects, particularly when initiated at an early developmental stage [9]. This training can enhance musical perception mediated by CIs. However, many of the studies conducted so far involve in-person musical training programs that are time-consuming and less accessible. Online resources are a growing potential resource. In Jiam’s study [11], timbre identification scores improved to a greater extent after online musical training. Smith et al. [13] found that scores on post-training diagnostic tests using self-administered musical training software were better compared to pre-training scores, with significant improvements observed in the perception of musical patterns.

Musical training with platform may be used as supplement to the standard speech therapy rehabilitation [15] in patients following cochlear implantation. It serves as an adjunctive therapy training program, rather than replacing traditional rehabilitation methods. However, a “Gold Standard” for evaluating musical experience has not been adopted [16], which would allow for the individualization of auditory training for each patient.

The Meludia platform (Meludia, Paris, FR) is among the various tools designed for musical training. A recent study has proposed Meludia as a suitable means for objectively evaluating musical perception and as a tool for musical training in CI users over 6 years old [17]. However, access to Meludia and similar platforms is not widespread across all Centers where CI surgeries and rehabilitation are conducted. Consequently, alternative tools such as questionnaires play a crucial role in assessing the perception and enjoyment of music in these patients. The Munich Music Questionnaire (MUMU) and the Music Related Quality of Life Questionnaire (MuRQoL) have been validated for evaluating these aspects [7].

Need for the validation of MuRQoL to Spanish

This questionnaire, originally in English, is highly comprehensive and evaluates the impact of music on quality of life across two dimensions: the frequency of perception and musical engagement (questions 1–18, part I), as well as musical importance (questions 1–18, part II). It is a questionnaire that can be easily administered during a routine visit to the Clinic. Recently, versions in Turkish [18] and Italian [19] have been validated. Spanish is the world's second-most spoken native language after Mandarin Chinese. To the best of our knowledge, the MuRQoL has not been validated in Spanish before.

Objectives

The primary goal of this study was to translate and validate a Spanish version of the MuRQoL (sMuRQoL), making it widely available in clinical settings and allowing us to better understand the needs of adult CI users.

In the original MuRQoL’s instructions for use [7], it is also suggested that, for a comprehensive assessment of music, it is preferable to combine the MuRQoL questionnaire with music perception tests. We deemed it pertinent to explore whether this subjective level of musical perception (high or low) corresponded to outcomes (adequate or insufficient) in objective musical training tasks on the Meludia platform. In this manner the sMuRQoL could be utilized both for its initial purpose—assessing Music-Related Quality of Life- and for identifying individuals who may be “at risk” for poorer musical perception. This application of the MuRQoL was suggested in the original article [7]. Consequently, we could identify these patients much earlier and introduce them to musical training programs following cochlear implantation. Hence, a secondary objective was to gather evidence related to the convergent validity and discriminative capacity concerning musical perception using the sMuRQoL. This was achieved by comparing the scores of the sMuRQoL’s Frequency Test with the outcomes of the same patients in the objective musical perception test, Meludia.

Materials and methods

Translation of MuRQoL

The MuRQoL consists of two sections, each comprising 18 questions. The first part -Frequency Test- examines subjective musical perception abilities, while the second part -Importance Test- evaluates the perceived importance of musical perception and attitudes towards involvement with music [7]. Responses are rated on a 5-point Likert scale ranging from 1 (never) to 5 (always). The scores for both frequency and importance of each item can be compared to create a visual representation of how each of the 18 music perception/engagement tasks impacts the quality of life of individual CI users. When music is regarded as important but the frequency score for the perception or engagement subscale (or the overall frequency scale score) is low, it suggests that music may have a significant negative effect on quality of life. A patient scoring within this critical range for a music perception/engagement item indicates a need for music training, targeting specific musical aspects. This approach enables the MuRQoL measure to serve as a diagnostic tool for identifying individuals requiring music-related support and for guiding music training efforts.

The items and the overall score of the sMuRQoL underwent validation based on the psychometric criteria of classical test theory [20]. The translation of the “Music Related Quality of Life Questionnaire” into a Spanish version was carried out with the approval of the author of the original questionnaire. This involved the translation of the MuRQoL into Spanish by bilingual individuals, followed by a back-translation from Spanish to English conducted by professional translators. The back-translation was compared with the original questionnaire and the final version of the sMuRQoL (see Supplementary material) was approved.

Meludia

Currently, Meludia is available in up to 20 languages, some of them are English, French, German, Italian or Spanish. For this study, the Spanish version of the Meludia Discovery module [21] was employed for assessment, comprising five tasks, each with five levels of difficulty, totaling 25 exercises:

  • Rhythm: Determining the number of percussive hits heard.

  • Spatialization: Identifying whether the second note is higher or lower.

  • Melody: Recognizing whether the melody is ascending or descending.

  • Stable/unstable: Assessing whether the sound feels stable or unstable.

  • Density: Estimating how many sounds are played simultaneously—whether it’s one or many.

Before the assessment, a Spanish version program demonstration was provided to all participants in a quiet setting. NH subjects used headphones connected to a laptop.

CI users, utilized the direct audio input (DAI) cable to ensure that assessment hearing was conducted solely through the CI, eliminating input from the contralateral ear in cases of residual hearing. Bilaterally implanted participants were assessed with both implants simultaneously.

For each exercise, Meludia generates a numerical score ranging from 0 to 3. A score of 0 signifies an incomplete attempt, while 3 indicates the fastest and most accurate performance on the exercise. Participants were allowed to restart each level up to four times. If a level remained incomplete after four attempts, that task was considered unfinished. The program proceeded to the next exercise when the 5 levels of each task were completed or unfinished before.

The tasks were presented in the following sequence: Rhythm, Spatialization, Melody, Stable/unstable and Density (See Supplementary material for an example of Meludia Testing Procedure). The Meludia testing procedure require an average time of 60 min to be completed.

Participants

The minimum sample size was calculated considering the maximum variance of the original study scale [7] and considering a 95% confidence interval. Therefore, with the variance of the original study being around 16 and considering a 95% confidence interval, a minimum sample size of 123 (122.93) individuals is estimated to be necessary.

We recruited two groups of participants: 55 individuals with CI and 74 subjects with NH. The group of CI users consisted of postlingually deaf individuals aged ≥ 17 years. All of them had been implanted with a MEDEL CI system (MED-EL GmbH, Innsbruck, Austria), either unilaterally or bilaterally. Additionally, they had a minimum of 10 active electrodes and had maintained a stable fitting (at least one year without any changes in the fitting of the CI).

The individuals with NH were selected from the Otolaryngology Outpatient clinics, specifically among those seeking consultation for conditions unrelated to hearing. The participants in this group did not have a formal musical academic background, and their ages were matched to those in the CI user group. Additionally, they underwent an audiometric evaluation and only those with a PTA < 30 dB were included in the study.

All participants were proficient in Spanish and did not have simultaneous visual or cognitive impairments that could interfere with task performance.

Ethics committee

The design of the study was approved by the local Ethics Committee (approval number HULP PI-4447) and was registered in ClinicalTrials.gov (identifier NCT05319678). No adverse events were reported during the study.

The research was conducted in accordance with the principles of the Declaration of Helsinki [22] and informed consent was obtained from each participant.

Data collection

The final version of the sMuRQoL was completed firstly in-person under supervision during Otolaryngology consultations, both for NH subjects and CI users. For those subjects who took the test in two occasions − 58 NH subjects and 37 CI users-, some answered it again in-person, while others did so through an online self-administered form.

Data analysis

Only fully completed questionnaires with fewer than 3 N/A responses were considered. For those who underwent the test–retest, only patients who completed it within the specific time frame (15 days after the initial assessment) were selected.

  1. 1.

    Firstly, descriptive statistics (mean, standard deviation, skewness, and kurtosis) of the instrument’s items were analyzed. The discrimination indices of the items (corrected item-test correlation) were analyzed, considering them adequate when they were above 0.20 [23].

  2. 2.

    The internal structure of the sMuRQoL was analyzed through an Exploratory Factor Analysis (EFA). This procedure was conducted for both the first part of the questionnaire—Frequency test—and the second part—Importance test. As all items exhibited suitable values of skewness and kurtosis (within a range of ± 1), and the number of response alternatives was five, the EFA was conducted on the Pearson correlation matrix [24]. Unweighted Least Square (ULS) was employed as the estimation method. The KMO index and Bartlett’s test of sphericity were used to assess the data’s suitability for the EFA. A value of KMO above 0.70 and a statistically significant Bartlett’s test value (p < 0.05) indicate that the data are suitable for factor analysis. The Measure of Sampling Adequacy (MSA) was also calculated, where values of MSA below 0.50 suggest that the item does not measure the same domain as the remaining items in the pool, and so it should be removed. Promin was used as the rotation method since it deals with oblique factors. The percentage of explained variance was assessed and fit indices such as Comparative Fit Index (CFI), and Root Mean Square of Residuals (RMSR) were used, establishing a good fit when CFI > 0.95 and RMSEA < 0.08 [25].

  3. 3.

    The reliability of the sMuRQoL scores were examined. For the study of internal consistency, Cronbach alfa’s coefficient and McDonald’s omega coefficient were used. Additionally, the stability of the scores (test–retest reliability) was studied through the intraclass correlation coefficient. Values above 0.70 indicate good internal consistency and stability of the scores.

  4. 4.

    The mean score was calculated for the frequency and importance tests, as well as the perception and engagement subscales for each group of participants (CI and NH). Similar to the original MuRQoL validation study and based on literature indicating significantly poorer musical performance of adult CI users compared to adults with NH in pitch perception, timbre, and recognition of familiar melodies, the hypothesis was formulated that the Frequency test scores of the sMuRQoL for adults with NH was significantly higher than those of CI users for the overall scale and subscales. Regarding the scores in the Importance test, we hypothesized that no significant differences would be expected between both groups, as there is no evidence of a different level of importance of music between NH subjects and CI users. To study potential differences in sMuRQoL scores between NH subjects and CI users, an independent samples t-test was conducted and Cohen’s d was used as the effect size. Effect sizes were categorized as small for values between 0.2 and 0.5, medium for values between 0.5 and 0.8 and largo values for above 0.8 [26].

  5. 5.

    The potential convergent validity of the sMuRQoL in relation to other tools was examined. As the first part of the MuRQoL-Frequency Test- includes a series of questions about individual ability to perceive music—especially items 1–11—, the hypothesis was formulated that those subjects who scored higher in the Frequency scale would also achieve better results in the Meludia Discovery module. In the Importance scale, no differences are expected between groups in the Meludia results since the subjective importance of music for each individual does not confer greater musical perception ability. For this analysis, the Pearson correlation was calculated between the scores of the instrument and the score of the first attempt in each of the five categories of Meludia (Rhythm, Spatialization, Stable/unstable, Melody and Density). Of the 129 participants who completed the sMuRQoL, 79 of them completed Meludia (35 CI users and 44 NH individuals).

  6. 6.

    Finally, ROC curves were calculated to study the discriminative capacity of the sMuRQoL in detecting subjects with heightened musical perception. In the original validation, it was concluded that the MuRQoL has the potential as a tool for pinpointing those who could benefit from musical training. However, this had to be demonstrated in clinical practice and future experience. In this regard, we hypothesized a potential discriminative capacity in the first part of the test (Frequency), and to substantiate this, we used the five Meludia Discovery tasks completed in a single attempt as the “Gold Standard”. The discriminative capacity was assessed using the area under the curve (AUC). Values between 0.5 and 0.6 are considered inadequate, acceptable between 0.6 and 0.8, and about 0.8 are considered quite good.

EFA were conducted using the FACTOR 12.03.02 program [27]. The rest of the statistical analyses were performed using the SPSS v27 (IBM Corp, 2021).

Results

A final sample of 129 patients was obtained, of which 95 of them (74%) completed the questionnaire on two occasions with a 15-day interval between determinations.

The demographic and clinical data (gender, age, patient type, educational level, time and type of cochlear implantation), as well as the performed instruments, are recorded and summarized in Table 1.

Table 1 Demographic and clinical data and instruments performed

Analysis of the sMuRQoL items

Descriptive statistics, discrimination indices and appropriateness of the sMuRQoL items were examined. Each item exhibited suitable values in skewness and kurtosis (Table 2), ranging between ± 1. The discriminative capacity, evaluated according to its presumed dimension, is exceptionally high for each item (D.I. [0.412–0.821]).

Table 2 Descriptive statistics, discrimination indices and appropriateness of the items

Exploratory factor analysis (EFA)

The internal structure of the sMuRQoL was explored through EFA, conducted separately for the first part of the test—Frequency test—and the second part—Importance test. With respect to the Frequency test, a two-factor structure (perception and engagement) was explored. Both KMO (0.933) and the Bartlett statistic (p < 0.001) indicate that the data are suitable for factor analysis. In addition, the MSA (Table 2) for each item was very high, well above the cutoff point of 0.5. The percentage of variance explained by the two factors was 57.54%. The model fit was very good (CFI = 999; RMSR = 0.045), and the factorial loadings were appropriately distributed considering the two theoretical factors (Table 3). Regarding the Importance test, a two-factor structure (perception and engagement) was also explored. Once again, KMO (0.916) and Bartlett (p < 0.001) indicated that the data were suitable for conducting the factorial analysis, and the MSA (Table 2) for each item was very high. The percentage of variance explained by the two factors was 59.78% and the model fit was adequate (CFI = 0.999; RMSR = 0.059). Regarding the factorial loadings, they were appropriately distributed, except for item 10, which showed a low factor loading on its reference factor (Table 3). Additionally, in both parts of the test (Frequency and Importance), it would be possible to consider a total score for each, given the high correlation between the factors (frequency: r = 0.827; importance: r = 0.805).

Table 3 Factorial loadings of the items in the exploratory factor analysis

Reliability and stability of the scores of sMuRQoL

Regarding the reliability of the sMuRQoL, the internal consistency of the scores across different dimensions of the test was very high, as well as the stability of the scores (test–retest) for the four dimensions, as reflected in the results of the intraclass correlation coefficient (Table 4).

Table 4 Reliability and stability of the Scores of the sMuRQoL

Differences in sMuRQoL scores between CI users and NH individuals

It was observed that NH subjects exhibit significantly higher scores in the Frequency test compared to CI users. Similar to the original study [7], the effect was more pronounced in the perception and the overall frequency subscales than in the engagement subscale. In line with this, no statistically significant differences were found regarding the Importance test, even though participants in the control group exhibited higher scores than those with CI (Table 5).

Table 5 Study of differences in scores of the sMuRQoL between Implanted individuals and the Control group

Convergent validity analysis of the sMuRQoL

Regarding the convergent validity of the sMuRQoL in relation to other instruments, MuRQoL scores exhibited positive correlations with scores of Meludia Discovery module tasks on the first attempt for each of its categories (Table 6). Specifically, the Frequency test showed high correlations with all scores on Meludia, particularly standing out in Stable/unstable, Melody and Density.

Table 6 Pearson correlations between sMuRQoL scores and scores of Meludia on the first attempt (n = 76)

Analysis of the discriminative capacity of the instrument regarding musical perception

The discriminative capacity of sMuRQoL was assessed, considering the completion of the five Meludia Discovery module tasks in a single attempt as the “Gold Standard”. The ROC curves are depicted in Fig. 1, calculated using the total scores of Frequency and Importance. Additionally, the area under the curve (AUC) for each Meludia Discovery module task was calculated, showing adequacy across all tasks in relation to the Frequency test. Within these, a notable discriminative capacity was observed, particularly in Spatialization, Stable/Unstable and Density. There was no significant association in Meludia tasks with the Importance test, as expected. (Rhythm: AUCfrequency = 0.645; AUCimportance = 0.534. Spatialization: AUCfrequency = 0.816; AUCimportance = 0.547. Stable/Unstable: AUCfrequency = 0.783; AUCimportance = 0.675. Melody: AUCfrequency = 0.764; AUCimportance = 0.629. Density: AUCfrequency = 0.749; AUCimportance = 0.567).

Fig. 1
figure 1

Discriminative capacity of the sMuRQoL based on completing Meludia tasks in a single attempt

Discussion

The validation in Turkish and Italian languages of the MuRQoL confirmed the initial findings regarding the reliability and validity of the questionnaire [18, 19]. Our results in 55 CI users and 74 individuals with NH are similar to those obtained in the original MuRQoL and in its two validations in Turkish and Italian, supporting the instrument’s strong cross-cultural validity, as previously reflected in other validations [19].

On the other hand, in the validation of the original MuRQoL [7], the questionnaire’s ability to predict aspects of quality of life was assessed. This was evaluated by demonstrating convergent validity between the MuRQoL Frequency test and the SF12v2 quality of life questionnaire. The SF12v2 is a quality of life questionnaire that can be used to produce utility scores for economic evaluations [7]. In this regard, we aimed to assess the potential convergent validity of the sMuRQoL and its discriminative capacity regarding musical perception using the Meludia tool as a reference. The obtained results have been satisfactory, revealing a positive correlation—showing moderate to high correlations (r = 0.24–0.65)—where individuals with high scores in the Frequency Test are capable of achieving better outcomes in Meludia’s Discovery level. This observation aligns with our preliminary findings, suggesting Meludia as a useful tool in assessing and training musical perception. It is noteworthy that these analyses demonstrate the strong discriminative ability of the sMuRQoL instrument—specifically, the Frequency Test—based on good or poor musical perception. However, this questionnaire, being a self-report measure, will never function as a substitute for an objective measure like Meludia, but rather as a complementary tool, cause music training is an intervention aimed at improving auditory perception and processing through various tools of musical activities [9, 10, 13, 17], while the diagnosis of musical perception is a clinical assessment of an individual’s ability to perceive and cognitively process music. The discriminative capacity of the sMuRQoL could prove valuable in clinical practice, as specific results from the AUC of the sMuRQoL Frequency test could be used to establish a cutoff score. This cutoff score would help identify CI users with poorer musical perception, indicating those who would benefit from more extensive musical training. This is particularly relevant since it has been demonstrated that musical training also enhances speech perception, reinforcing the benefits of CI use in other auditory domains [13, 14]. Specifically, in the case of using Meludia, where sessions last at least 1 h, the sMuRQoL would enable us to improve CI users with poorer musical perception selection to optimize available resources.

The MuRQoL questionnaire [7], sound quality assessments [28], and musical perception evaluation platforms like Meludia [17], are some of the suitable tools for assessing the outcomes of music perception in CI users. Therefore, we believe that the sMuRQoL is valuable in evaluating the quality of life in already implanted patients. It serves as a useful tool to discriminate and assist in selecting those CI users who are likely to have a poorer musical perception. In the future, it would be interesting to explore potential differences in questionnaire scores between patients with unilateral or bilateral implantation and those with bimodal implantation. Additionally, an adaptation of the questionnaire for pediatric populations should be considered.

Limitations

We acknowledge that this study has several limitations inherent to the characteristics of our center and the participant’s sample:

Firstly, the sample of CI users consists solely of patients implanted with a MED-EL System.

Secondly, the age range of the participants is quite broad. Nonetheless, we believe that both groups—CI users and NH—represent highly homogeneous populations with no statistically significant differences between them.

Furthermore, it would be interesting to increase the sample size in future studies. Although the sample size is statistically sufficient for the validation procedure, being a single-center study has limited the sample. In the same sense, the test–retest was carried out with 95 individuals from the total of 129 participants, representing nearly 75% of the sample. The main reason for not applying it to all 129 participants is that those who did not undergo the test–retest did not meet the established temporal criterion, thus preventing the second evaluation from being conducted. However, a sample size of 95 individuals is sufficient to provide evidence of score stability when it accounts for 75% of the total sample obtained.

Finally, there are some inherent limitations of learning and training music listening. Thus, some potential participants declined to participate in the research. This was primarily due to two situations: on one hand, the unpleasant perception of music [5, 6]—in the CI users group—or lack of interest in this field, not perceiving poorer musical perception as a problem—both NH and CI users -; and on the other hand, the completion of Meludia and self-administered online questionnaires require a minimum level of computer literacy, which some elderly participants lacked and were therefore excluded from the study.

Conclusions

The sMuRQoL is a reliable instrument with adequate evidence of validity based on internal structure. The present study offers an accessible, low-cost, and quickly executable tool in Spanish. The sMuRQoL has proven to be a reliable instrument in discriminating among patients with greater difficulties in musical perception. Therefore, it constitutes a valuable tool for selecting individuals within CI users who might benefit from musical training platforms such as Meludia.