Skip to main content

Evaluation of the palliative symptom burden score (PSBS) in a specialised palliative care unit of a university medical centre - a longitudinal study

Abstract

Background

The implementation of standardised, valid and reliable measurements in palliative care is subject to practical and methodological challenges. One aspect of ongoing discussion is the value of systematic proxy-based assessment of symptom burden in palliative care. In 2011, an expert-developed proxy-based instrument for the assessment of symptom burden in palliative patients, the Palliative Symptom Burden Score (PSBS), was implemented at the Specialised Palliative Care Unit of the University Medical Centre in Dusseldorf, Germany. The present study investigated its feasibility, acceptance and psychometric properties.

Methods

The PSBS was rated by nursing staff three times a day over 5 years (N = 820 patients). Feasibility and nurses’ acceptance of PSBS were analysed. Structural validity was investigated by principal component analysis. Construct validity was examined via cross-validation with the Hospice and Palliative Care Evaluation checklist. Discriminative validity of the PSBS was analysed by means of Kruskal-Wallis test of patients’ performance score. Reliability of the PSBS was evaluated by internal consistency analysis, test-retest and split-half-reliability. Inter-rater reliability was investigated by observer agreement of nurses’ ratings of symptom burden within a day. Sensitivity to change was analysed by Wilcoxon test with repeated measures of the PSBS before and after palliative complex treatment.

Results

A high degree of acceptance and the feasibility of a high-frequency proxy-based symptom burden assessment approach were demonstrated. There were low rates of missing values and no indications of the adoption of prior ratings. PSBS in its present form demonstrates good structural and construct validity (rs = .27–.79, p’s < .001) and high sensitivity to changes in symptom burden (p’s < .01, except sweating), but unsatisfactory reliability (α = .41–.67; test-retest: rs = .30–.88; p’s < .001; split-half: rs = .69; p < .001; inter-rater: n.s.).

Conclusions

The study presents a framework for the post hoc validation of an already existing documentation tool in palliative care. This study supports the notion that PSBS might not be reflective of an overall construct and will therefore require further development and critical comparison to other already established symptom burden instruments in palliative care.

Peer Review reports

Background

Palliative care deals, by definition, with human beings in a very complex and difficult situation. This situation includes, for instance, multi-morbidity at a very late stage of treatment, with a multitude of different pharmacological treatments resulting in both physical and mental suffering. Consequently, patients in a palliative care setting exhibit a broad range of physical and psychological symptoms. The documentation of patients’ symptom development at short intervals by means of standardised documentation systems can improve patient care, clinical decision-making, quality assurance and evaluation of treatment delivery.

Due to the identified need for standardised documentation instruments, the last few years have seen a growing interest in the measurement of symptom development in palliative care patients. Several national and international collaborations were founded to foster and harmonise research on this topic [European PRISMA-Group [1]. A recently published consensus-paper by the European Association for Palliative Care Task Force on outcome measurement highlights the critical importance of implementing standardised, psychometrically evaluated outcome measures into the daily clinical routine in specialised palliative care units (SPCU) [2].

Nevertheless, there are several practical and methodological challenges to consider when examining methods of documenting symptoms [3]. The first challenge is determining an adequate outcome measure for the end-of-life setting and the frequency of measurement. The numerous existing outcome measurement systems were not developed for palliative care populations, and most of them have not yet been validated [4]. Given the high temporal variability in patients’ symptom presentation, there is no consensus on the appropriate frequency of symptom measurement [4, 5]. Even though the self-report is considered the gold standard to obtain information on patient symptom burden [4], many palliative care patients are not able to complete questionnaires or answer questions due to fatigue, decreased alertness or delirium [6]. For other patients, a high-frequency self-assessment approach might constitute an undue burden. Additionally, being confronted with a terminal disease and the prospect of proceeding towards their personal death results in severe distress and a broad range of affective reactions, which in turn cause various coping and self defence mechanisms [7, 8]. Those mechanisms might also lead to bias with regard to not reporting symptoms. Consequently, especially in the end-of-life setting, proxy-based symptom documentation appears to be a promising additional source of information to complement self-reported measurement instruments. Because high-frequency proxy-based symptom measurement approaches always entail an increased workload for hospital staff, the successful implementation of such an approach depends to a great degree on its practical feasibility in daily clinical routine and its acceptance by nurses and physicians [9].

Currently, only a few validated proxy-based assessment tools for symptom burden in palliative care patients are available in the German language: The Basic Documentation for Psycho-Oncology [10], the Hospice and Palliative Care Evaluation checklist [HOPE, [11] and the Edmonton Symptom Assessment System [ESAS, [12].

The Basic Documentation for Psycho-Oncology focuses on the psycho-social burden of cancer patients. HOPE measures the symptom burden of the previous three days of palliative patients. ESAS was originally developed as a self-assessment tool for symptom burden in cancer patients but is also used as a proxy-based assessment tool in some cases.

Nevertheless, a large number of SPCUs and hospices still use non-validated, unpublished, self-administered and non-expert-developed documentation tools to assess patients’ symptom burden. To acquire additional knowledge regarding outcome measures in palliative care, it would be advisable to evaluate these instruments in terms of their psychometric validity and to share experiences in the implementation of such approaches with the scientific community in addition to palliative care practitioners.

One outcome instrument used within the interdisciplinary palliative care centre at the university hospital Duesseldorf (see methods for details) is the Palliative Symptom Burden Score [PSBS, [13], which measures physical and psychosomatic symptom burden. The PSBS items are alertness, confusion, restlessness/anxiety, sweating, weakness, nausea, vomiting, dyspnoea, coughing, pain and constipation. Symptom intensity is rated on a ten-point verbal rating scale three times a day by nursing staff. The data collected by PSBS were first reported in a study on high-dependency palliative care patients dying in a tertiary hospital inpatient unit [13]. To date, no studies have examined the psychometric properties of PSBS. The tool was heuristically developed by clinical experts in palliative care. The most frequent symptoms in palliative care patients according to experts’ clinical impressions were included as items for the instrument (see Table 1); there has been no further psychometric validation.

Table 1 Setting of the SPCU

In 2011, the SPCU of the University Medical Centre in Dusseldorf, Germany implemented a proxy-based measurement instrument embedded in the electronic patient record (EPR) for high-frequency assessments of symptom burden in palliative patients [14]. Physical and psychological symptom burden is now assessed by means of the PSBS [8]. To our knowledge, no studies have yet been conducted implementing a longitudinal proxy-rated, high-frequency assessment system into daily clinical care. Considering the importance of such an approach for the quality assurance of treatment delivery and clinical evaluation of therapy outcomes, the present paper intends to share empirical knowledge concerning the feasibility and acceptance of longitudinal high-frequency proxy-based assessments of symptom burden in palliative patients. Given the demand for reliable and valid instruments [2], this study reports data concerning the psychometric properties of the PSBS and presents a framework for the validation of expert-created tools in palliative care. Based on the experience gained during the implementation process, the paper also presents practical and useful recommendations for the development, implementation and evaluation of proxy-based assessments in SPCUs.

Methods

Study design

This study was an observational cohort study with a retrospective analysis of longitudinal data on symptom burden assessment in an inpatient palliative care setting. The study reporting follows the STROBE [15] guidelines for reporting observational cohort studies. This study was approved by the Ethics Committee of the Medical Faculty of Heinrich Heine University Dusseldorf, Germany (protocol number 5287, approved 09 November 2015).

Setting

The Interdisciplinary Centre for Palliative Medicine is a SPCU at a university hospital in an urban area of Germany. It offers inpatient palliative care treatment at a ward with 8 beds. Furthermore, there is a liaison service for inpatients at other hospitals. A detailed overview of the setting, i.e., the SPCU Dusseldorf, is presented in Table 1. The data for this study were only collected from inpatients of the SPCU (not from inpatients of other hospitals).

Implementation

High-frequency proxy-based symptom assessment by means of the PSBS was implemented in August 2011. To train nurses in the utilisation of the new documentation system and to foster their acceptance for the new approach, a training course was offered. Subsequently, a pilot phase was conducted in which the nurses were asked to evaluate the documentation system and share their experiences with it in daily clinical routine. As a result of the nurses’ feedback, the user interface was adjusted to enhance its ease-of-use.

Data

Dependent variables

The palliative symptom burden score

The PSBS was developed as a high-frequency documentation tool for medical professionals to measure the symptom burden of palliative care patients. Symptom burden is rated three times daily, measuring the last 8 h. An overview of its 10 items and their assessment is given in Table 2. Symptom burden indicators were originally developed by an expert panel of two palliative care physicians and one senior palliative care nurse in a heuristic process including a narrative literature search and iterative discussion. The original development of the instrument took place in one specialised palliative care centre in Berlin during the pioneer phase of palliative medicine in the 1990s in Germany and did not follow traditional tool development guidelines. The final set of items used for the instrument was based on expert opinion and had not initially been tested in a pilot phase. Patients or carers were not involved in the development phase. The items in the PSBS represent the most common symptoms of patients in SPCUs as defined by the original expert panel: alertness, confusion, restlessness/anxiety, sweating, weakness, nausea, vomiting, dyspnoea, coughing, pain and constipation. Each symptom is measured with one item. The intensity of the symptom is rated by means of a five-point verbal rating scale ranging from zero points (no symptom burden) to five points (strong symptom burden). Pain was rated via a 10-point verbal rating scale ranging from zero points to ten points. For reasons of comparability to the other items and for the statistical analyses, it was converted into a 5-point verbal rating scale after data collection. Constipation was originally measured dichotomously, reported as yes or no. Consequently, this item was excluded from the PSBS because conversion into a 5-point verbal rating scale was not possible. In total, the PSBS consists of 11 items. Symptom assessment and operationalisation of the items are described in Table 2. Overall symptom burden is reflected by the sum of single items (Min = 0; Max = 44).

Table 2 Items and operationalisation of the PSBS

Regarding component structure, it was proposed by the authors that the items alertness, confusion, restlessness/anxiety and weakness may constitute a component indicating a psychosomatic symptom complex subscale. In addition, nausea and vomiting were allocated to a gastrointestinal subscale and dyspnoea and cough to a respiratory subscale. There were no further groupings regarding pain, sweating and itching. The authors therefore expected six components of symptom burden.

Hope

The symptom and problem checklist HOPE [11, 16] consists of 16 items for the documentation of symptom burden of the previous 3 days. Eight items (pain, nausea, vomiting, dyspnoea, constipation, weakness, loss of appetite, tiredness) measure physical symptoms, four items (feeling depressed, anxiety, tension, disorientation/confusion) measure psychological issues, two items (wound care, activities of daily living) measure nursing issues, and two items (organisation of care, overburdening of the family) measure social issues [16]. Additionally, one free entry is provided for possible further issues, e.g., symptoms that are not assessed in the instrument. The symptom intensity is measured on a 4-point verbal rating scale (0 = no, 1 = mild, 2 = moderate, 3 = severe). Sum scores are calculated for the four subscales and a global sum score ranging from a minimum of zero points to a maximum of 51 points.

HOPE’s item structure is similar to the PSBS: anxiety, confusion, weakness, nausea, vomiting, dyspnoea and pain are measured in both HOPE and PSBS. HOPE’s item tension can be compared to PSBS’ item restlessness. In both instruments, symptom burden is rated via verbal rating scales (HOPE: 4-point Likert-Scale; PSBS: 5-point Likert-Scale). Due to these similarities, HOPE was chosen for cross-validation and investigation of the construct validity of PSBS. Nevertheless, the instruments are not completely interchangeable. While PSBS covers alertness as an important cognitive parameter within psychological symptoms, HOPE includes feeling depressed as an important psycho-affective component, which is not measured by PSBS. Additionally, HOPE includes items for loss of appetite and tiredness, (symptoms), which are not included in PSBS, while PSBS includes coughing, itching and sweating. Thus, PSBS and HOPE can be considered two distinct measures for symptom burden measurement that have main overlaps but also cover different aspects of symptom burden in palliative care patients.

The ECOG scale of performance status

The Eastern Cooperation Oncology Group (ECOG) scale of performance status [17] is a widely used prognostic tool to quantify functional status in cancer patients. In palliative care, it has also been used to report the functional status of non-cancer patients with life-limiting illness [18]. The ECOG describes patients’ functional status regarding ambulatory status and need for care. The scale categorises functional status via five symptom burden classes (0–4). A score of zero indicates normal activity; a score of one point indicates that the patient is able to walk and that light activity is possible. A score of two points means the patient is < 50% bedridden, with self-care being possible; a score of three points means the patient is > 50% bedridden with limited self-care capability, while a score of 4 points indicates the patient is completely bedridden and in need of care [19].

Independent variables

Palliative complex treatment

Between day one and day seven of treatment at the SPCU, patients received a specialised palliative complex treatment that included a set of interventions performed by palliative care professionals focusing on patient stabilisation and the reduction of symptom burden.

Data collection

Data collection was performed between August 2011 and August 2015. Symptom burden assessment by means of the PSBS was conducted three times a day by trained palliative care nurses of the SPCU. The results were documented digitally via a standardised documentation interface. An assessment took two to 3 min. HOPE and ECOG were measured on admission and at discharge. For deceased patients, assessments for PSBS, HOPE and ECOG were performed post-mortem by nurses within a day after death. Among the patients, 476 (58%) died at the SPCU, 298 (36.30%) were discharged, 27 (3.30%) were moved to another ward within the university hospital and 9 (1.10%) were moved to another institution (e.g., hospice). Patients’ palliative stage was reported for day one of admission for those patients in whom initial assessment of performance stage and clinical survival prediction was deemed reliable. A majority of patients needed a longer period of assessment and were discussed during our weekly multidisciplinary team meetings to improve prognostic accuracy, as suggested by White et al. [20].

Sample

Sample characteristics regarding age, sex, diagnosis group, palliative stage and ECOG performance status are shown in Table 3.

Table 3 Sample characteristics (N = 820)

Statistical analyses

Patient data were extracted from the clinic’s electronic medical records and anonymised before transferring the data into SPSS. All statistical analyses were performed using IBM SPSS 22 for Windows (IBM Corp. in Armonk, NY). The data were checked for plausibility prior to inferential analyses. Descriptive statistics are reported.

For each analysis, the data timepoint is reported, whereas the first number indicates the day of data collection and the second number indicates the daytime (morning, noon, evening). For example, t1_3 is day 1 (admission), measure 3 (evening) and t7_1 is day 7 (1 week after admission), measure 1 (morning). We used measure 3 (evening) for the analyses wherever possible due to low rates of missing values. For comparisons within a day, data for day 7 instead of day 1 were used for the same reason.

Feasibility and acceptance of the PSBS

To evaluate the feasibility and acceptance of the PSBS, high-frequency documentation data were investigated regarding rates of missing values. In addition, the data were checked for potential bias caused by the adoption of prior ratings by means of the Kendall-W coefficient of concordance [21]. High and significant Kendall-W values and significant results were assumed to be an indicator of systematic adoption of prior ratings.

Validity

Structural validity

PSBS’ structural validity data from timepoint t1_3 was analysed because of a low rate of missing values. To investigate the structural validity of the PSBS, a principal component analysis (PCA) with a cut-off criterion of 6 principal components was conducted. Although PCA is not a factor analysis, it is the most frequently used approach for data reduction in psychology [22]. Analyses were performed in accordance with the procedure suggested by Klopp [22]:

Suitability of the data for PCA

Prior to analysis, the data were controlled for adequacy to perform a principal component analysis using the Kaiser-Meyer-Olkin measure of sampling adequacy (KMO) and Bartlett’s test of sphericity. KMO-values > .05 [23] and significant Bartlett’s test results were taken as indicators of the adequacy of the data for PCA.

Number of components

The main goal of principal component analysis is to determine a component structure that is stable concerning the performed method of component extraction and rotation and replicable in other conditions. In subjective assessment methods, such as scree plot analysis [24], there are some objective procedures. A criterion for estimation of the number of components to be extracted is the replicability of the component structure. Therefore, the dataset was split into two random samples, and two principal component analyses were performed on each random sample with a cut-off criterion of the proposed number of components. The resulting two component loading matrices were then compared with each other by calculating Tucker’s coefficient of congruency [25, 26] as follows:

$$ {C}_{jk}=\frac{\sum_{i=1}^p{a}_{ij}\cdot {b}_{ik}}{\sqrt{\left({\sum}_{i=1}^p{a}_{ij}^2\right)\cdot \left({\sum}_{i=1}^p{b}_{ik}^2\right)}} $$

where aij represents the loading of variables i on component j of the first component loading matrix, and bik is the loading of variables i on component k of the second component loading matrix. The resulting coefficient C may have values between − 1 and + 1, which can be interpreted similarly to Pearson’s correlation coefficient [27]. Values of Tucker’s congruency coefficients > .80 are assumed to be indicators of good replicability of the component structure [26].

Interpretability and rotation of the principal components

To facilitate the interpretability of the component solution, we chose the orthogonal rotation method varimax. The aim of the varimax rotation method is to achieve a simple structure of the component solution, which means that some variables load very high on one component, while other variables load very low. Thus, the variance of the squared component loadings is maximised [28].

Significance of component loadings

After facilitation of interpretability by means of varimax rotation, the variables that are used for the interpretation of a component must be determined. In accordance with the rule proposed by Gorsuch [29], only variables with component loadings < .30 were assumed to correspond to a component. We further considered the general rule of Guadagnoli and Velicer [30]: if fewer than 10 variables have a component loading > .40, then the sample size must be greater than 300 persons.

Construct validity

Due to its comparable item structure and similar outcome measure, the construct validity of the PSBS was investigated via cross-validation with the HOPE checklist. HOPE subscales nursing problems and social problems were excluded because there were no similar subscales in the PSBS. Spearman’s rank correlation was calculated for sum scores and subscales of the PSBS and HOPE. Because HOPE does not include a gastrointestinal and respiratory symptom complex component, no analysis concerning these PSBS subscales was performed. Consequently, further analyses were performed on single item levels, Significant positive correlations were assumed to be indicators of good construct validity.

Discriminative validity

The discriminative validity of the PSBS was investigated using two nonparametric analyses of variance using the Kruskal-Wallis test [31] with ECOG performance status stages as independent and the PSBS sum score at t1_3 and t7_3 as dependent variables.

Reliability

The internal consistency of the PSBS subscales was tested by Cronbach’s alpha.

In accordance with [32], values > .70 were taken as indicators of acceptable internal consistency. Additionally, the split-half reliability for the whole test was calculated using the odd-even method. Spearman-Brown coefficients [33] are reported. The test-retest reliability was evaluated by Spearman’s rank correlation of PSBS sum scores and subscales within a day (t7_1 morning and t7_3 evening) and within a week (at t1_3 and after 1 week of treatment t7_3). To assess PSBS inter-rater reliability, intermediate measurements of different nurses during a day (t7_1, t7_2, t7_3) were examined using Kendall’s W concordance coefficient [21].

Sensitivity to change

To investigate the sensitivity of the PSBS to changes in patients’ symptom burden as a consequence of treatment interventions, the sum scores of the PSBS were evaluated with respect to significant mean differences pre- (t1_3) and after complex palliative treatment (t7_3) using the Wilcoxon test with repeated measures [34]. The level of significance was Bonferroni-adjusted to p < .01. Only patients who completed the palliative complex treatment were included in the analysis (n = 514). Patients who died within the first week and did not complete treatment were excluded from the analysis.

Results

Feasibility and acceptance of the PSBS

Analyses showed a high degree of acceptance of the PSBS implementation by the specialised palliative care nurses. The rates of missing values in the PSBS documentation were low (0.32%).

Descriptive PSBS and HOPE

The overall mean PSBS score was 12.21 (SD = 4.40; n = 784, missing = 36). Female patients had a mean PSBS score of 12.50 (SD = 4.45; n = 381. Male patients’ mean PSBS score was 11.94 (SD = 4.33, n = 403). Cancer patients had a mean PSBS score of 12.88 (SD = 4.33, n = 664), previous cancer patients 12.78 (SD = 4.29, n = 64) and non-cancer patients 13.78 (SD = 4.97, n = 56). Patients’ overall mean HOPE score was 23.80 (SD = 5.48, n = 782, missing = 38). Female patients had a mean HOPE score of 23.83 (SD = 5.99, n = 378) and male patients of 23.33 (SD = 5.98, n = 404). Cancer patients had a mean HOPE score of 23.66 (SD = 6.08, n = 658), previous cancer patients of 23.68 (SD = 5.06, n = 67) and non-cancer patients of 25.55 (SD = 5.62, n = 57).

Validity

Structural validity

Data were adequate for PCA with a Kaiser-Meyer-Olkin criterion of .61 and a significant Bartlett’s test of sphericity (χ2(55) = 1020.80 p < .0001). PCA revealed six main component solutions explaining 73.12% of the variance. Analysis of component replicability revealed values of Tucker coefficients of congruence greater than .80 for all of PSBS’ scales, indicating good replicability of the main components. Principal components, explained variance, Tucker coefficients of congruence, corresponding items and component loadings are presented in Table 4.

Table 4 Component solution of the PSBS

Construct validity

There was a significant positive correlation of PSBS on admission with HOPE scores on admission (rs = .58; p < .001) and at discharge (rs = .54; p < .001). The psychological problems scale of HOPE correlated significantly with the psychosomatic symptom complex of the PSBS on admission (rs = .43; p < .001) and at discharge (rs = .28; p < .001). As principal component analysis did not reveal a physical symptom burden complex component for the PSBS, no correlations concerning the physical problems subscale of HOPE were calculated. Single item correlations of PSBS and HOPE checklist revealed positive significant correlations ranging between rs = .48 and rs = .79 on admission and between rs = .18 and rs = .61 (all p-values < .001) 1 week after admission. The single item correlations of the PSBS and the HOPE checklist are presented in Table 5.

Table 5 Convergent validity

Two nonparametric analyses of variance using the Kruskal-Wallis test revealed significant differences in PSBS sum scores for patients in different subgroups of ECOG on admission (χ 2 (4) = 121.91; p < .001) and 1 week after admission (χ 2 (4) = 57.68; p < .001). The mean sum scores for each ECOG group and measurement point are presented in Table 6.

Table 6 Discriminative validity

Reliability

The Cronbach’s alpha coefficients for the PSBS sum score and the PSBS subscales did not meet the criterion of acceptable internal consistency (> .70). The coefficients are presented in Table 8. The split-half reliability was investigated using the odd-even method. The results were adjusted using the Spearman-Brown-formula, revealing a coefficient of .69. Spearman’s rank correlation of PSBS sum scores on admission and PSBS sum scores 1 week after admission revealed a significant positive moderate correlation (rs = .55; p < .001). Correlations and p-values for the PSBS subscales are shown in Table 7. Analyses of inter-rater reliability revealed poor and non-significant values for all items but confusion (Kendall’s W = .01; χ2 (2) = 9.97; p < .01). Pain marginally missed the level of significance (Kendall’s W = .01; χ2 (2) = 5.92; p = .05). The results of inter-rater-reliability analyses therefore indicated no hints for systematic adoption of prior ratings. Kendall’s W, chi-square- and p-values for each item are shown in Table 8.

Table 7 Reliability coefficients
Table 8 Inter-rater reliability

Sensitivity to change

The Wilcoxon test with repeated measures showed significant differences before and after palliative complex treatment for all PSBS subscales and sum scores except sweating (z = − 0.34; p = .73). The mean PSBS subscales and sum scores before and after palliative complex treatment with corresponding z- and p-values are presented in Table 9.

Table 9 Sensitivity to change

Discussion

The aim of the present study was to report the implementation, acceptability and feasibility of a high-frequency proxy-based symptom assessment instrument in palliative care, to describe data concerning the psychometric properties of the instrument and to present a framework for the evaluation of such an approach. Systematic proxy-based assessment of symptom burden in palliative care obtained by nurses can be similar in accuracy to patient-reported outcomes and has special value in low-functioning or confused patients [9, 35].

Feasibility and acceptance

Since its implementation in 2011, the PSBS has been integrated into daily clinical routine at the SPCU at the University Medical Centre in Dusseldorf, Germany. Symptom burden was successfully documented three times a day, and further analysis showed a low rate of missing values and no hints of adoption of prior ratings. We would argue that this finding can be interpreted as two indicators of the acceptability of this instrument, but interviews with nurses who conduct their daily assessments with PSBS are needed to confirm this preliminary finding. In summary, successful implementation of PSBS and the quantitative analysis of nurses’ ratings provide some evidence for the feasibility of a high-frequency proxy-based symptom documentation approach in a SPCU. To gain a more detailed understanding of PSBS’ feasibility and acceptance in clinical practice, an implication for further research is to conduct qualitative assessments, e.g., by means of qualitative interviews with nursing staff.

Psychometric properties

Validity

Because PSBS as an expert-developed documentation instrument has not yet been validated, another aim was to report data concerning the psychometric properties of this instrument. PCA revealed six main component solutions, including three multiple-item subscales (psychosomatic symptom complex; gastrointestinal symptom complex and respiratory complex) and three single-item scales (pain, itching, sweating). Considering the large amount of explained variance, the psychosomatic symptom complex appears to be a very relevant aspect of palliative patient symptom burden. This result is in agreement with former studies highlighting the importance of psychological symptoms in palliative care patients [36].

Several different methods are used for data reduction. Common factor analysis (CFA) and principal component analysis (PCA) are widely used multivariate techniques for this purpose [37]. According to Widaman [38], “the final word on comparisons between CFA and PCA has not yet been written” (p. 201). In the present study, we chose PCA for data reduction and evaluation of PSBS’ structural validity because nonzero PCA loadings are higher and more stable than nonzero common factor analysis loadings and are closer approximations of the true factor loadings than the loadings produced by common factor analysis [37]. An implication for further research is to further evaluate PSBS’ latent factor structure by structural equation modelling.

PSBS and HOPE sum scores showed a positive moderate significant correlation on admission and at discharge, indicating good construct validity of the PSBS. The aspect of moderate correlations implies that both instruments measure similar objectives but are not redundant, potentially due to the slightly different items. The psychosomatic subscales of the PSBS and HOPE show moderate positive significant correlations on admission and at discharge, which may be because both instruments cover different aspects of mental symptom burden. While the PSBS measures alertness and weakness as important mental symptoms of palliative care patients, HOPE covers depression, which is of no less importance. The results of single item correlations of the PSBS and HOPE support the construct validity of the PSBS. Interestingly, the strength of the correlations decreases at the second point of measurement (discharge), which may be caused by HOPE post-mortem ratings for deceased patients. Univariate analysis of variance showed significant ECOG subgroup differences in mean PSBS sum scores, demonstrating a good discriminative validity of the PSBS regarding different intensities of symptom burden.

Reliability

Analyses of the internal consistency of PSBS subscales revealed below cut-off results for all subscales. Whereas acceptable reliability was almost met by the psychosomatic and gastrointestinal symptom complexes, values for the respiratory symptom complex showed poor internal consistency. These indicators do not support the use of the proposed subscales for symptom assessment in the current instrument. It might be best to measure symptom burden on a single item level. The sum score for PSBS should not be used because it does not appear to be reliable.

The split-half reliability of the PSBS also slightly missed the criterion of acceptable reliability. Analysis of test-retest reliability showed a moderate correlation between PSBS sum scores on admission and 1 week later. A correlation of .70 is an indicator of fair test-retest reliability, but this value highly depends on the interval between the points of measurement. In terms of a state-like symptom burden that is subject to frequent fluctuations within a single day, an interval of 1 week may have been too short to detect good test-retest reliability.

The results of test-retest reliability of PSBS subscores indicated a difference in stability between symptom burden subscores. The psychosomatic symptom complex and the respiratory symptom complex appeared to be more stable indicators, while the gastrointestinal symptom complex and the items pain, itching and sweating appeared to be less stable.

The inter-rater reliability of nurses’ ratings of symptom burden within a day showed poor and non-significant results for all items but confusion. Because inter-rater agreement can only be high if the rating objective remains constant, this result may be regarded as another indicator of fluctuations of symptom intensity within a day. Therefore, high-frequency documentation of symptom burden appears to be a reasonable approach. In contrast to other items, confusion appeared to be a stable symptom with high inter-rater agreement.

This result indicates that there was no systematic adoption of prior ratings within the instrument. If this had been the case, the interrater-agreement would have been high and significant.

Sensitivity to change

Another matter of interest was the sensitivity of the PSBS to changes in symptom burden caused by interventions, a psychometric property that is often underreported in palliative care [39]. Given the hypothesis that interventions cause changes in symptom burden, the PSBS can be assumed to be sensitive for changes in symptom burden. In this context, it is probable that the symptom of sweating was not influenced by any intervention. All PSBS subscales (except sweating) and PSBS sum scores showed significant differences before and after palliative complex treatment intervention. Scores were significantly lower for most subscales. However, there was a significant increase in the psychosomatic and itching subscales. Whilst significant, it is unclear whether these findings have clinical relevance given that the psychosomatic symptom burden remained at a high level, itching remained at a very low level overall, and changes were measured after the decimal point [40]. From a clinical perspective, it is not surprising to observe a tentative increase in pruritus given the difficult and complex nature of its pathophysiology and treatment, including opioid-induced pruritus (OIP), and its increase in end-stage presentations of malignancy, cholestasis and uraemia [41, 42]. Psychological assessment in palliative care is inherently complex given the high level of confusion and low functioning of patients and the limited uptake of self-reported measures [43]. Further research is needed to establish the reasons for the significant increase in our psychosomatic symptom subscale.

Lessons learned

The current study demonstrates that the implementation of a high-frequency proxy-based assessment of symptom burden in palliative patients is feasible and appears to be acceptable to nurses. According to the performed analyses, the PSBS is a feasible tool for the documentation of physical and psychological symptom burden with high sensitivity to changes in symptom burden but unsatisfactory reliability. The study further presents a framework for the post hoc validation of an already existing documentation tool to encourage other clinicians and researchers to evaluate existing documentation tools to contribute to the demand for valid and reliable outcome measures in palliative care. Based on the experiences gained during the study/the experiences authors had during the study, the authors want to share the following recommendations for further endeavours.

Limitations

The present study deals with proxy-based measurements of symptom burden in palliative patients. Even though there are many advantages of this assessment approach, the rating itself is, to a great degree, dependent on the raters’ impression and extends only limited consideration to patients’ perception of their symptom burden. It should be mentioned that there could also have been a bias in nurses’ ratings because they were not blinded to the intervention of the palliative complex treatment. Due to the post hoc design and field setting of the study, it was not possible to use blinded raters.

Our evaluation of psychometric properties was based on classical test theory, and given our findings, it is possible that this tool is not reflective of an overall construct such as the Mini-Suffering State Examination [44] and the Palliative Outcome Scale [POS, [45]. Similar to the POS, the PSBS captures three factors and some independent items that do not load onto these factors, which makes this measure less ideal for the assessment of internal consistency and factor structure. Consequently, it appears that the PSBS is, in its present form, less suitable for this type of assessment.

In the current study, a post hoc psychometric analysis of an existing expert-developed documentation tool was performed. From a methodological perspective, a post hoc validation has its limitations. If possible, ad hoc theory-based test construction and validation should always be preferred. Further research is needed to enhance PSBS. For example, it would be an interesting research question to assess whether it can be adapted to other time scales than the prior 8 h.

Recommendations

Regarding the complex issues of designing and/or implementing high-frequency proxy-based symptom measurement instruments, the authors recommend integrating nursing staff into the implementation process at an early stage. This integration includes offering specific training in the use of the documentation interface in addition to the possibility of providing feedback and adapting the measurement system to foster its ease-of-use. Based on the experience of the authors, this procedure increases acceptance and compliance of the measurement approach.

From a methodological perspective, the use of an expert-developed tool caused several challenges regarding the psychometric evaluation of the documentation system. Clinical experts rarely consider theoretical aspects in the development of documentation systems or measurement instruments, resulting in different measurement levels for sub-items. In the present study, it became necessary to adjust graduations of the item pain to ensure its comparability to other items of the PSBS. It was further necessary to exclude the item constipation from analyses because of its non-ordinal level of measurement. To avoid methodological challenges regarding the psychometric and clinical evaluation of patient data, the authors recommend ensuring that sub-items are measured on at least ordinal verbal rating scale with comparable intervals between characteristic values, e.g., such as the Likert scale.

To maintain the possibility of evaluating a proxy-based measurement system with respect to its psychometric properties, it is highly recommended to add an empirically validated instrument for data collection. When evaluating such instruments for their suitability, it is important to consider a similar outcome objective and comparable item structure. From a test-theoretical perspective, it is also important to assure continuous and comparable measurement times of the second instrument to maintain the possibility of evaluating construct validity at several times of measurement.

The current study yielded evidence that symptom burden is subject to frequent fluctuations in its intensity within a day. Therefore, the authors highly recommend a high-frequency measurement approach of symptom burden data. Even though this approach leads to an additional workload for nursing staff, the experience gained within this study shows that it is feasible and accepted by nurses.

Conclusions

High-frequency proxy-based symptom burden assessment is a feasible and acceptable approach for nurse-led assessments of symptom burden in palliative care. PSBS in its present form demonstrates good structural and construct validity and high sensitivity to changes in symptom burden, but unsatisfactory reliability. This study supports the notion that PSBS might not be reflective of an overall construct and will therefore require further development and critical comparison to other already established symptom burden instruments in palliative care. Future research should focus on improving longitudinal psychosomatic symptom burden assessments.

Abbreviations

CFA:

Common Factor Analysis

ECOG:

Eastern Cooperation Oncology Group

ESAS:

The Edmonton Symptom Assessment System

HOPE:

Hospice and Palliative Care Evaluation checklist

M:

Mean

OIP:

Opioid-Induced Pruritus

PCA:

Principal Component Analysis

POS:

Palliative Outcome Scale

PSBS:

Palliative Symptom Burden Score

SD:

Standard Deviation

SPCU:

Specialised Palliative Care Unit

References

  1. Harding R, Simon ST, Benalia H, Downing J, Daveson BA, Higginson IJ, et al. The PRISMA symposium 1: outcome tool use. Disharmony in European outcomes research for palliative and advanced disease care: too many tools in practice. J Pain Symptom Manag. 2011;42:493–500.

    Article  Google Scholar 

  2. Bausewein C, Daveson BA, Currow DC, Downing J, Deliens L, Radbruch L, Defilippi K, Lopes Ferreira P, Costantini M, Harding R, Higginson IJ. EAPC White Paper on outcome measurement in palliative care: Improving practice, attaining outcomes and delivering quality services–Recommendations from the European Association for Palliative Care (EAPC) Task Force on Outcome Measurement. Palliat Med. 2016;30(1):6–22.

  3. Kirkova J, Walsh D, Russel M, Hauser K, Lasheen W. Symptom assessment in palliative medicine: complexities and challenges. Am J Hosp Palliat Med. 2010;27:75–83.

    Article  Google Scholar 

  4. Evans CJ, Benalia H, Preston NJ, Grande G, Gysels M, Short V, et al. The selection and use of outcome measures in palliative and end-of-life care research: the MORECare international consensus workshop. J Pain Symptom Manag. 2013;46:925–37.

    Article  Google Scholar 

  5. Tang ST, McCorkle R. Appropriate time frames for data collection in quality of life research among cancer patients at the end of life. Qual Life Res. 2002;11:145–55.

    Article  PubMed  Google Scholar 

  6. Hosie A, Davidson PM, Agar M, Sanderson CR, Phillips J. Delirium prevalence, incidence, and implications for screening in specialist palliative care inpatient settings: a systematic review. Palliat Med. 2013;27:486–98.

    Article  PubMed  Google Scholar 

  7. Rabinowitz T, Peirson R. “Nothing is wrong, doctor”: understanding and managing denial in patients with cancer. Cancer Investig. 2006;24:68–76.

    Article  Google Scholar 

  8. van Laarhoven HW, Schilderman J, Bleijenberg G, Donders R, Vissers KC, Verhagen CA, et al. Coping, quality of life, depression, and hopelessness in cancer patients in a curative and palliative, end-of-life care setting. Cancer Nurs. 2011;34:302–14.

    Article  PubMed  Google Scholar 

  9. Homsi J, Walsh D, Rivera N, Rybicki LA, Nelson KA, LeGrand SB, et al. Symptom evaluation in palliative medicine: patient report vs systematic assessment. Support Care Cancer. 2006;14:444.

    Article  PubMed  Google Scholar 

  10. Knight L, Mussell M, Brandl T, Herschbach P, Marten-Mittag B, Treiber M, et al. Development and psychometric evaluation of the basic documentation for psycho-oncology, a tool for standardized assessment of cancer patients. J Psychosom Res. 2008;64:373–81.

    Article  PubMed  Google Scholar 

  11. Radbruch L, Nauck F. Patientenregister als Forschungsinstrument. 2011 [cited 2016 Mar 5]; Available from: https://www.thieme-connect.com/products/ejournals/html/10.1055/s-2009-1225590.

  12. Bruera E, Kuehn N, Miller MJ, Selmser P, Macmillan K. The Edmonton Symptom Assessment System (ESAS): a simple method for the assessment of palliative care patients. J Palliat Care. 1991 [cited 2016 Mar 5]; Available from: http://psycnet.apa.org/psycinfo/1991-34179-001.

  13. Schulz C, Schlieper D, Altreuther C, Schallenburger M, Fetz K, Schmitz A. The characteristics of patients who discontinue their dying process–an observational study at a single university hospital Centre. BMC Palliat Care. 2015;14:1.

    Article  Google Scholar 

  14. Schlieper D, Altreuther C, Schallenburger M, Neukirchen M, Schmitz A, Schulz C. Electronic implementation of integrated end-of-life care: a local approach. Int J Integr Care. 2017;20:17(2).

  15. The Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) Statement: Guidelines for reporting observational studies. Int J Surg. [cited 2018 Jun 9]. Available from: https://www.journal-surgery.net/article/S1743-9191(14)00212-X/fulltext.

  16. Stiel S, Pollok A, Elsner F, Lindena G, Ostgathe C, Nauck F, et al. Validation of the symptom and problem checklist of the German hospice and palliative care evaluation (HOPE). J Pain Symptom Manag. 2012;43:593–605.

    Article  Google Scholar 

  17. Oken MM, Creech RH, Tormey DC, Horton J, Davis TE, McFadden ET, et al. Toxicity and response criteria of the eastern cooperative oncology group. Am J Clin Oncol. 1982;5:649–56.

    Article  PubMed  CAS  Google Scholar 

  18. Ostgathe C, Alt-Epping B, Golla H, Gaertner J, Lindena G, Radbruch L, et al. Non-cancer patients in specialized palliative care in Germany: what are the problems? Palliat Med. 2011;25:148–52.

    Article  PubMed  Google Scholar 

  19. Sørensen JB, Klee M, Palshof T, Hansen HH. Performance status assessment in cancer patients. An inter-observer variability study. Br J Cancer. 1993;67:773.

    Article  PubMed  PubMed Central  Google Scholar 

  20. White N, Reid F, Harris A, Harries P, Stone P. A systematic review of predictions of survival in palliative care: how accurate are clinicians and who are the experts? PLoS One. 2016;11:e0161407.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  21. Hallgren KA. Computing inter-rater reliability for observational data: an overview and tutorial. Tutor Quant Methods Psychol. 2012;8:23.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Klopp E. Explorative Faktorenanalyse. Explorative Factor Analysis [Internet]. 2010 [cited 2018 Mar 31]; Available from: http://psydok.psycharchives.de/jspui/handle/20.500.11780/3369

  23. Kaiser HF, Rice J. Little Jiffy, Mark IV. Educ Psychol Meas. 1974 [cited 2016 Mar 5]; Available from: http://psycnet.apa.org/psycinfo/1975-00097-001

  24. Cattell RB. The scree test for the number of factors. Multivar Behav Res. 1966;1:245–76.

    Article  CAS  Google Scholar 

  25. Tucker LR. A method for synthesis of factor analysis studies. Princeton NJ: EDUCATIONAL TESTING SERVICE PRINCETON NJ; 1951.

  26. Lorenzo-Seva U, Ten Berge JM. Tucker’s congruence coefficient as a meaningful index of factor similarity. Methodol Eur J Res Methods Behav Soc Sci. 2006;2:57.

    Google Scholar 

  27. Wuensch KL. Comparing two groups’ factor structures: Pearson r and the coefficient of congruence. Available from: http://core.ecu.edu/psyc/wuenschk/MV/FA/FactorStructure-TwoGroups.docx.

  28. Bortz J, Schuster C. Statistik für Human-und Sozialwissenschaftler: Limitierte Sonderausgabe. Heidelberg: Springer-Verlag; 2011.

  29. Gorsuch RL. Factor analysis, 2nd. Hillsdale NJ LEA. 1983.

    Google Scholar 

  30. Guadagnoli E, Velicer WF. Relation to sample size to the stability of component patterns. Psychol Bull. 1988;103:265.

    Article  PubMed  Google Scholar 

  31. Kruskal WH, Wallis WA. Use of ranks in one-criterion variance analysis. J Am Stat Assoc. 1952;47:583–621.

    Article  Google Scholar 

  32. Cronbach LJ. Coefficient alpha and the internal structure of tests. Psychometrika. 1951;16:297–334.

    Article  Google Scholar 

  33. Kelley TL. The applicability of the spearman-Brown formula for the measurement of reliability. J Educ Psychol. 1925;16:300–3.

    Article  Google Scholar 

  34. Brunner E, Munzel U, Puri ML. Rank-score tests in factorial designs with repeated measures. J Multivar Anal. 1999;70:286–317.

    Article  Google Scholar 

  35. Strömgren AS, Groenvold M, Sorensen A, Andersen L. Symptom recognition in advanced cancer. A comparison of nursing records against patient self-rating. Acta Anaesthesiol Scand. 2001;45:1080–5.

    Article  PubMed  Google Scholar 

  36. Balasooriya-Smeekens C, Walter FM, Scott S. The role of emotions in time to presentation for symptoms suggestive of cancer: a systematic literature review of quantitative studies. Psychooncology. 2015;24:1594–604.

    Article  PubMed  Google Scholar 

  37. De Winter JC, Dodou D. Common factor analysis versus principal component analysis: a comparison of loadings by means of simulations. Commun Stat-Simul Comput. 2016;45:299–321.

    Article  Google Scholar 

  38. Widaman KF. Common factor analysis versus principal component analysis: differential bias in representing model parameters? Multivar Behav Res. 1993;28:263–311.

    Article  CAS  Google Scholar 

  39. Aslakson RA, Dy SM, Wilson RF, Waldfogel J, Zhang A, Isenberg SR, et al. Patient- and caregiver-reported assessment tools for palliative care: summary of the 2017 Agency for Healthcare Research and Quality technical brief. J Pain Symptom Manag. 2017;54:961–972.e16.

    Article  Google Scholar 

  40. van Rijn MHC, Bech A, Bouyer J, van den Brand JAJG. Statistical significance versus clinical relevance. Nephrol Dial Transplant. 2017;32:ii6–12.

    PubMed  Google Scholar 

  41. Balkaransingh P, Massey G. Opiod induced pruritus: the need for palliative Care for a Palliative Medicine (S707). J Pain Symptom Manag. 2015;49:410.

    Article  Google Scholar 

  42. Alshammary. Review of management of pruritus in palliative care [Internet]. [cited 2018 Apr 16]. Available from: http://www.thejhs.org/article.asp?issn=2468-6360;year=2016;volume=4;issue=1;spage=17;epage=23;aulast=Alshammary.

  43. Mai SS, Gerlach C, Schmidtmann I, Vogt AR, Zeller V, Renner K-H, et al. Are Repeated Self-Reports of Psychological Variables Feasible for Patients Near the End of Life at a Palliative Care Unit? J Palliat Med. 2018 [cited 2018 Apr 16]; Available from: https://www.liebertpub.com/doi/abs/10.1089/jpm.2017.0537.

  44. Adunsky A, Zvi Aminoff B, Arad M, Bercovitch M. Mini-suffering state examination: suffering and survival of end-of-life cancer patients in a hospice setting. Am J Hosp Palliat Care. 2007;24:493–8.

    Article  PubMed  Google Scholar 

  45. Rugno FC, Carlo MMR do PD. The Palliative Outcome Scale (POS) applied to clinical practice and research: an integrative review. Rev Lat Am Enfermagem [Internet]. 2016 [cited 2018 Apr 16];24. Available from: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4996092/.

Download references

Acknowledgements

We thank Christiane Liese and the staff of the Interdisciplinary Centre for Palliative Medicine of Heinrich-Heine University Dusseldorf, Germany, for supporting this research project. We thank Manuela Schatz, Institute of Nursing Science, Medical University of Graz (Austria) and Dr. George Basdanis, South London and Maudsley NHS Foundation Trust, for their editorial support. We thank Prof Dr. Stefan Troche, Professorship for Personality Psychology and Diagnostics, Department of Psychology and Psychotherapy, Witten/Herdecke University for his advisory support.

Availability of data and materials

Data and material of this study can be requested from the corresponding author.

Author information

Authors and Affiliations

Authors

Contributions

KF developed the design of the study, performed the data parametrisation and the statistical analyses and interpreted final results. CSQ originally developed the concept of the study, was the principal investigator and substantially contributed to designing the study and interpreting the data. TO made substantial contributions to the design of the study, analysing the data and interpreting the final results. AS and HV substantially contributed to data acquisition and interpreting the final results. All authors were involved in drafting the manuscript and revising it critically for important intellectual content; all authors gave final approval of the final version to be published. Each author is taking public responsibility and accepts accountability for those portions of the content they have been substantially involved in as described above.

Corresponding author

Correspondence to Katharina Fetz.

Ethics declarations

Authors’ information

KF is a Psychologist, freelancing Consultant for Research Methodology and Statistics, Research Associate and PhD student at Witten/Herdecke University, Germany.

HV is a resident and research fellow for Psychosomatic Medicine, Psychotherapy and Palliative Medicine at the Interdisciplinary Centre for Palliative Medicine at the University Hospital Dusseldorf, Germany.

TO is a Professor for Research Methodology and Statistics in Psychology at the Department of Psychology and Psychotherapy at Witten/Herdecke University, Germany.

AS is a Consultant in Palliative Medicine, Pain Medicine, and Anaesthesiology. Her special interest is in children and adolescent mental health services (CAMHS) and in applying animal-assisted therapy (AAT) to palliative medicine.

CSQ is a Consultant in Psychosomatic Medicine, Medical Psychotherapy, and Palliative Medicine from Germany and is a Visiting Lecturer in Palliative Care Psychiatry at the Institute for Psychiatry, Psychology and Neuroscience (IoPPN) at King’s College, London. He is a faculty member of the Global Institute of Psychosocial, Palliative and End-of-Life Care, Toronto, Canada. He is a trainee psychiatrist at South London and Maudsley NHS Foundation Trust. Additionally, he is pursuing a Doctorate of Professional Studies (DProf) in Existential-Phenomenological Psychotherapy at the New School of Psychotherapy and Counselling in London, UK.

Ethics approval and consent to participate

Ethical approval was obtained from the Ethics Committee of the Medical Faculty of Heinrich-Heine-University Dusseldorf, Germany (trial registry no. 5287; date of approval: 09 November 2015). Data was clinical routine data and anonymised at the point of data acquisition for retrospective analysis. The ethics committee at University Düsseldorf waived the need for individual participant consent. The study was conducted in accordance with the Declaration of Helsinki on Ethical Principles for Medical Research Involving Human Subjects.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Fetz, K., Vogt, H., Ostermann, T. et al. Evaluation of the palliative symptom burden score (PSBS) in a specialised palliative care unit of a university medical centre - a longitudinal study. BMC Palliat Care 17, 92 (2018). https://doi.org/10.1186/s12904-018-0342-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12904-018-0342-0

Keywords