Introduction

Chemotherapy-induced peripheral neuropathy (CIPN) has always been a concern in the course of cancer therapy, as well as the main reason leading to the suspension or termination of chemotherapy [1]. Over the years, the development of intervention strategies for CIPN is not optimistic, which is not only due to the complicated underlying molecular mechanisms of different chemotherapy agents, but also limited by the lack of sensitive and targeted assessment tools to monitor the efficacy of intervention strategies [2,3,4,5].

Early assessment tools for CIPN mainly based on clinician-reported outcome (CRO), including World Health Organization (WHO) Scale [6], Eastern Cooperative Oncology Group (ECOG) Scale [7], Ajani scale [8], and National Cancer Institute-Common Terminology Criteria for Adverse Events (NCI-CTCAE) [9], which tend to underestimate the degree of neuropathy and lead to evaluation bias. Given the subjective nature of neurological symptoms, patient-reported outcome (PRO) measures are increasingly recognized as essential to comprehensively collect CIPN symptom information, and the most commonly used CIPN assessment tools in clinical practice. A series of PRO assessment tools have been produced in the last two decades, such as the Functional Assessment of Cancer Therapy/Gynecologic Oncology Group—Neurotoxicity (FACT/GOG-Ntx) subscale [10], the European Organization of Research and Treatment of Cancer (EORTC) Quality of Life Questionnaire—CIPN 20-item (QLQ-CIPN20) scale [11], the Patient Neurotoxicity Questionnaire (PNQ) [12], the Chemotherapy-Induced Peripheral Neuropathy Assessment Tool (CIPNAT) [13], the Comprehensive Assessment Scale for Chemotherapy-Induced Peripheral Neuropathy (CAS-CIPN) in Survivors of Cancer [14], and the Treatment-induced Neuropathy Assessment Scale (TNAS) [15]. PRO assessment tools have been extensively used as the primary index in CIPN-associated clinical trials, but still with some limitations in practical application. Most assessment tools attempt to be universally applicable, thus abandoning some specific neuropathic symptoms caused by individual chemotherapeutic agents, which would result in low overall-level positive response rate (O-PRR, which refers to the proportion of items with answers other than "none" or "not at all" in the overall scale). Floor effects, reflected in the low and close total score, reduces the sensitivity and anti-interference of the tools, making it susceptible to the basic disease and the subjective misjudgment of patients and difficult to reflect the change of symptoms under the intervention strategies. Therefore, developing a set of highly sensitive and targeted CIPN assessment tools could be the key of solving the problems above.

As oxaliplatin is the core agent in chemotherapy regimens such as FOLFOX, XELOX, SOX, oxaliplatin-induced peripheral neuropathy (OIPN) has become one of the most common adverse effects of the gastrointestinal cancer treatment, as well as the most noteworthy one in CIPN. OIPN can be subdivided into acute and chronic forms. About 89% of patients would experience acute OIPN, which mainly manifested as sensitivity to touching or swallowing cold items, throat discomfort, and muscle cramps, appeared within a day after the infusion of oxaliplatin, peaked in severity at day 3, and then alleviated. Chronic OIPN presents dose accumulation, mainly manifested in paresthesia symptoms including numbness and tingling at the extremities of the limbs, and a small number of proprioceptive disturbance and ataxia. Symptoms persist during treatment and can be aggravated after drug withdrawal (known as "coasting" phenomenon). The recovery is so slow that even 18 months after completing chemotherapy, 19% of patients still suffer from severe neuropathic symptoms [16]. In the early stage, acute OIPN has not received much attention as it is short-term reversible. More effort has been put into the prevention and treatment of chronic OIPN, until studies found that the condition of acute OIPN could predict the development of chronic OIPN [16,17,18]. Considering the refractory nature of chronic OIPN and the pervasiveness of acute OIPN, timely assessment of acute OIPN is obviously more valuable. However, existing CIPN assessment tools are inadequate for acute symptoms so that it is difficult to provide early warning for chronic progress. The purpose of this study is to develop an CIPN assessment tool dedicated for oxaliplatin to enhance the clinical practicability of the CIPN assessment tools, promote the development of CIPN precision assessment with more targeted and accurate content design, and provide support for the development of CIPN prevention and treatment strategies. In addition, we hope to further develop other CIPN assessment tools with this standardized model, so as to establish a systematic and accurate evaluation system.

Methods

Overall design

In order to standardize the development of the questionnaire module, the EORTC questionnaire module development guidelines were used as references [19]. The development phase was separated into three steps. The first step was literature review for sorting out OIPN-related symptoms and producing a symptom list. An expert panel, consisted of nine oncologists and three nursing experts with extensive experience in OIPN prevention and treatment, was assembled for symptom supplement. The second was item generation, that is, to translate the symptoms into plain language and generate an preliminary version of the questionnaire. The third was item improvement. Items were deleted, added or reorganized according to the content correlation analysis carried out by the expert panel and the pre-test performed among a few of patients, so as to generate a modified version of the questionnaire. In the validation phase, the field test was conducted in a Chinese population-based prospective cohort of patients, and testing data were analyzed for the reliability, validity and sensitivity of the questionnaire. The flow chart is shown in Fig. 1.

Fig. 1
figure 1

Flow chart of development and validation. CIPNIA-OS: Chemotherapy-induced Peripheral Neuropathy Integrated Assessment – Oxaliplatin Subscale; QLQ-CIPN20: European Organization of Research and Treatment of Cancer Quality of Life Questionnaire-CIPN 20-item; CTCAE: National Cancer Institute-Common Toxicity Criteria Adverse Events; ICC: Intraclass Correlation Coefficient; EFA: Exploratory factor analysis

Literature review

A keyword search was conducted in Pubmed, MEDLINE and CNKI. Keywords included "oxaliplatin" or "platinum" together with "neuropathy" or "neurotoxic", and the time frame was from 2004 to 2023. Literature with no or insufficiently precise description of OIPN-related clinical symptoms was excluded. A total of 445 literatures were searched, and 22 literatures with relatively detailed descriptions of OIPN-related symptoms were retained after screening, including 10 reviews, 8 prospective clinical studies, 3 retrospective analyses, and 1 mechanism research, as shown in Supplementary Table S1 [16, 18, 20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39].

Item generation

The expert panel put forward some suggestions according to the symptom list before drafting, which can be summarized as follows: a) medical terms need to be replaced with readily comprehensible and realistic descriptions; b) symptoms that have similar meanings or contain each other need to be merged; c) attributes of symptoms need to be more precise.

The expert panel also put forward five points of controversy. First, is it necessary to distinguish between upper and lower limbs according to the distribution of symptoms? Previous study recommended that symptoms in the upper and lower extremities should be distinguished distinctly as they are not identical [40], so it was necessary to evaluate the upper and lower limbs separately.

Second, is it necessary to distinguish between left and right limbs according to the distribution of symptoms? OIPN symptoms are mostly symmetrically distributed, and the difference between the left and right limbs, which may be caused by different habits of using the left and right limbs (for example, the right hand is more used for writing and using knives), does not affect the evaluation of the severity. In fact, patients tend to pay more attention to the more severe side, which is more valuable for the description of the severity of OIPN. So it does not need to distinguish between left and right.

Third, is it necessary to distinguish between acute and chronic symptoms? The clinical manifestations of acute and chronic OIPN are similar in many aspects, but differ only in terms of location, duration, and influence on daily life. Therefore, acute and chronic symptoms can be distinguished by adding items evaluating location, duration and influence on daily life.

Fourth, is it necessary to evaluate the attributes of each symptom (location, duration, influence on daily life)? In fact, the early symptoms of OIPN are mostly single. Even if more than two symptoms are present at the same time, attention to the most severe symptom is more valuable for evaluation.

Fifth, should objective assessment items be included, such as quantitative sensory testing (QST)? Given the objective tests had high requirements in assessing time point, skills and instrumentation or equipment, it was difficult to conduct frequent monitoring and widespread clinical application as the gold standard [41], so that they were not considered to be included.

Content correlation analysis

The evaluation of content validity mainly adopts the content validity index (CVI) based on experts' understanding of content correlation with the evaluation object as customary, which includes the item-level content validity index (I-CVI) and the scale-level content validity index (S-CVI). Items with excellent content validity should be composed of I-CVIs of 0.78 or higher [42]. However, in the actual evaluation process, a few of experts only linked the content correlation with the clinical incidence, which led to some clinically uncommon symptoms easy to be classified as low correlation items and excluded (such as sensory ataxia, although the incidence is not high, but its impact on chemotherapy adjustment may be decisive). To avoid this, the expert panel was called upon to evaluate clinical incidence and severity of symptoms that affect the chemotherapy adjustment based on their own perceptions, thereby quantifying the rationality and importance of each item and calculating the corresponding content correlation index (CCI). The expert-cognitive clinical incidence (assume the value as “i”) was classified into four levels, including remote (i = 0.25), low (i = 0.5), moderate (i = 0.75) and high (i = 1.00), the higher the incidence, the more valuable it is for evaluation. The expert-cognitive severity of symptoms affecting chemotherapy adjustment (assume the value as “s”) was classified into three levels, including mild (s = 1.00), moderate (s = 0.66) and severe (s = 0.33), the less severe the symptoms that affect chemotherapy adjustment, the more the symptoms weigh on the evaluation. Take the average of i and s as the CCI of each item, and to ensure that the assessment was accurate and valuable, we excluded items with an CCI lower than 0.5.

Pre-test

The pre-test was conducted in Jiangsu Province Hospital and Affiliated Hospital of Integrated Chinese and Western Medicine, Nanjing University of Chinese Medicine, both located in Nanjing, China. Sixteen colorectal cancer patients with different clinical and demographic characteristics (age, gender, educational background, chemotherapy cycle, dose of oxaliplatin, and neuropathic symptom) were selected from the two centers as subject representatives. Each subject was interviewed and the content was recorded. The interview questions included: a) Do you understand the symptoms or conditions described in the questionnaire? If not, which item or items make you confused? b) Are you willing to answer the questions truthfully? If not, what is your concern or scruple? c) Do you have any suggestion or supplement for the questionnaire? The item-level positive response rate (I-PRR, which refers to the proportion of subjects who gave the answers other than "none" or "not at all" for a single item) and the interview content would be analyzed for the basis of item improvement.

Field test

The field test was performed among patients from 30 centers of the TIMEPOINT study from June 2021 to July 2022. The center list was shown in Supplementary Table S2. The TIMEPOINT study is an ongoing multicenter, randomized, double-blind, placebo-controlled clinical trial for exploring the efficacy of herbal prescription granules on OIPN (ClinicalTrials.gov ID: NCT04690283) [43]. Taking the time window for symptom observation into account, we set independent eligibility criteria based on the TIMEPOINT study. The inclusion criteria were as follows: a) patients with malignant tumors using oxaliplatin-based chemotherapy regimens; b) predicted interval between chemotherapy cycles ≥ 3 weeks; c) Karnofsky performance status (KPS) scale ≥ 60; and d) patients aged between 20 and 75 years. The exclusion criteria were as follows: a) patients treated with other neurotoxic non-chemotherapeutic agents; b) patients with peripheral neuropathy induced by diabetes, uremia, nervous system malignancy, spinal degeneration, limb osteoarthropathy and other causes; c) patients with Alzheimer's disease, Parkinson's disease, Huntington's disease and other neurodegenerative diseases; d) patients with dyslexia, depression, anxiety, schizophrenia and other mental diseases; e) patients with a history of hereditary/familial neuropathy; and f) other situations of inability or unwillingness to complete the evaluations. All patients were required to sign an informed consent form before being included.

The sample size was set at 200 based on the internal consistency criteria (‘Excellent’) for sample size (number of items × 7 and ≥ 100 subjects) in the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) guidelines [44]. Considering that OIPN symptoms change gradually as chemotherapy progresses, repeated tests in the same cohort may yield different results. Therefore, we repeated the test 5 times in the same cohort of patients, with each subject completing the tested questionnaire and QLQ-CIPN20 scales and cooperating with the clinician CTCAE grading one day before cycle 2 to cycle 6 chemotherapy, so as to provide sufficient data for reliability and validity analysis and sensitivity comparison. There are two reasons to set the evaluation time one day before each cycle of chemotherapy. The first reason is to get the whole picture of OIPN over the full time period following the previous cycle of chemotherapy. Although acute OIPN usually recovers within a week, it is feasible to ask subjects to recall their condition of neuropathy within nearly a month, and it will not result in a lack of assessment of the persistent symptoms. The second reason is to enhance the operability and convenience of the assessment tool. Considering the frequent visits to the hospital during chemotherapy, it is difficult and unrealistic to require subjects to make a special trip to the hospital just to assess the degree of neuropathy. In contrast, the evaluation before each chemotherapy greatly enhances the convenience and operability of assessment. In addition, subjects were asked to complete another time of the tested questionnaire prior to infusion on the day of cycle 6 chemotherapy to provide data for the test–retest reliability analysis, and the interval from the preceding completion was ≥ 12 h.

Statistical analysis

All analyses were conducted using SPSS Statistics Software version 27 (IBM). Cronbach's α coefficient was calculated for the internal consistency, and intraclass correlation coefficient (ICC) was calculated for the stability of the questionnaire. As a new developed questionnaire, exploratory factor analysis (EFA) was more suitable for investigating the construct validity. Before conducting EFA, Kaiser–Meyer–Olkin (KMO) measures of sampling adequacy and Bartlett's test of sphericity were used to evaluate whether it was suitability for factor analysis. Principal component analysis method was used to extract common factors, and factors with eigenvalues > 1 and variance interpretation rate ≥ 5% would be retained. Factor rotation was performed using varimax rotation to find the corresponding relationship between the factors and items, and to give reasonable factor interpretations of the tested questionnaire. CTCAE was used as the gold standard for the criterion validity analysis, and the Pearson correlation coefficients between the tested questionnaire, QLQ-CIPN20 and CTCAE were calculated and compared to evaluate the accuracy. As the two questionnaires (the tested questionnaire and QLQ-CIPN20) have the same item number and the scoring algorithm, they can be homogeneously compared. Wilcoxon signed-rank sum test was used to compare the difference in results between the test questionnaire and QLQ-CIPN20 to demonstrate sensitivity.

Results

Symptom list and supplement

The list of OIPN-related symptoms reported in previous literature was listed, as shown in Supplementary Table S3. Build on previous reports, the classification of OIPN as acute and chronic is justified. Acute OIPN symptoms usually include cold-sensitive paresthesia (abnormal sensation represented by tingling and numbness) and dysesthesia (hypersensitivity or pain induced by touch) in the distal extremities or the perioral region, pharyngolaryngeal dysesthesia (causing swallowing or breathing difficulty), motor dysfunction (muscular fasciculation, tetanic spasm, and prolonged contraction) and rare visual impairment. Chronic OIPN symptoms are predominantly persistent paresthesia in the distal extremities, occasionally progressing to sensory ataxia and functional impairment (causing difficulty balancing, standing, walking and manipulating small objects such as writing, fastening buttons, and holding cups) [16, 18, 20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39]. According to the discussion the expert panel, the symptom list had been basically well-rounded and no supplement was proposed. Considering that acute OIPN mainly occurs during or shortly after chemotherapy and appears to predict the development of chronic OIPN [16,17,18], it is feasible and valuable to be evaluated.

Generation and improvement of the tested questionnaire

According to the suggestions put forward by the expert panel and the subject representatives, we formulated a 32-item questionnaire named "Chemotherapy-induced peripheral neuropathy Integrated Assessment – oxaliplatin subscale (preliminary version)" (CIPNIA-OS), as shown in Supplementary Fig. S1. The medical terms were concretized (for example, paresthesia was replaced by tingling and numbness), and the duplicate contents were merged (for example, cold allodynia, cold hyperalgesia, mechanical hyperalgesia, and touch evoked pain were combined into oversensitivity or pain by deleting triggers). To make symptom attributes more precise, we set up items evaluating the location, duration, and influence of the most severe symptom to the end, which were classified according to Total Neuropathy Score (TNS) [45], Levi’s scale [46], and NCI-CTCAE [9], respectively, inspired by the design of Patient-Reported Outcomes version of the CTCAE (PRO-CTCAE) [47].

Item improvement mainly depended on the results of CCI and pre-test I-PRR. The distributions of the expert-cognitive clinical incidence and severity of symptoms affecting chemotherapy adjustment of each item in the preliminary version were shown in Supplementary Fig. S2, based on which, CCIs of items were calculated and shown in Table 1. CCI less than 0.5 indicated a low content correlation, both in terms of the clinical incidence and the degree of impact on chemotherapy adjustment. Combining with the pre-test I-PRR, as shown in Supplementary Fig. S3, 8 items (items 5, 6, 12, 15, 16, 19, 20, 22) were deleted as low CCI (< 0.5) and I-PRR (= 0%). Items with I-PRR of less than 15% of the pre-test results were further interviewed to rule out accidental event. We found that stiffness of hands or arms, visual and auditory function impairment were usually present before chemotherapy, and facial cramps reported by only one patient occurred more than 3 weeks after chemotherapy and persisted just several minutes, which was considered to have a low correlation with OIPN. Therefore, items 11, 21, 28 and 29 were also considered for deletion. Compared to pain or burning sensation, hypersensitivity was reported more commonly and contained a broader description, so items 7 and 8 were revised. Besides, those patients with hand weakness were more likely to report an inability to lift or grasp objects such as cups, so item 25 was revised. Based on the improvements above, the modified version of the questionnaire containing 20 items was generated, as shown in Supplementary Fig. S4. In addition, we compared the modified version of CIPNIA-OS with current CIPN assessment tools, as shown in Supplementary Table S4.

Table 1 Content correlation index (CCI) of the preliminary version

Clinical and demographic characteristics of the field test subjects

A total of 207 patients were included in this study. 21 patients failed to complete 6 cycles of chemotherapy during treatment and were excluded, including 9 patients whose oxaliplatin-based chemotherapy regimen was adjusted due to disease progression, and 12 patients who voluntarily terminated chemotherapy due to adverse reactions other than peripheral neuropathy (7 had severe nausea and vomiting, 3 had severe myelosuppression, 1 had severe allergic skin reactions, and 1 had unexplained discomfort). Finally, data from 186 subjects were analyzed for the reliability, validity, and sensitivity of CIPNIA-OS.

The clinical and demographic characteristics of the subjects were shown in Table 2. The average age of the subjects was 57.7 years old, and 55.91% of them were female. The educational background was mostly from secondary to undergraduate level, accounting for 89.25%. Most subjects were treated with the XELOX regimen, accounting for 96.24%. 34.41% of the subjects received a cumulative dose of oxaliplatin more than 750 mg/m2. The average total scores of CIPNIA-OS and QLQ-CIPN20 were 5.12 (95% CI 4.91 to 5.33) and 2.93 (95% CI 2.77 to 3.09), respectively.

Table 2 Clinical and demographic characteristics of subjects

Reliability analysis

The results of the reliability analysis were shown in Table 3. Cronbach's α coefficient of CIPNIA-OS was 0.764 (> 0.7), indicating that the internal consistency was considerable. Due to 13 patients failed or refused to complete the second time of CIPNIA-OS at cycle 6 chemotherapy, 173 data sets were collected and calculated for the intraclass correlation coefficients (ICC). The ICC was 0.997, close to 1, indicating that the stability of CIPNIA-OS was good.

Table 3 Reliability statistics

Construct validity analysis

In the construct validity analysis, we did not analyze the last three items together in order to make the contents of the items in the same dimension. The KMO coefficient was 0.694 (> 0.5), indicating strong correlation between variables. Bartlett's test of sphericity indicated that each variable was independent and the correlation matrix was factorable (χ2 = 5726.739, p < 0.05). Seven common factors were extracted by principal component analysis method, explaining 64.21% of the cumulative variance, as shown in Supplementary Table S5. Factor loading of the rotated component matrix was shown in Table 4. Factor loadings ≥ 0.40 were marked to explore item commonalities, and factor explanations were provided in Table 4.

Table 4 Exploratory factor analysis

Criterion validity analysis

The Pearson correlation coefficient between CIPNIA-OS, QLQ-CIPN20 and CTCAE were calculated, as shown in Table 5. Both CIPNIA-OS and QLQ-CIPN20 were significantly correlated with CTCAE (P < 0.01). The Pearson correlation coefficient between CIPNIA-OS and CTCAE was 0.661, which was significantly higher than that between QLQ-CIPN20 and CTCAE (0.417, p < 0.01), indicating that CIPNIA-OS was closer to the evaluation results of CTCAE than QLQ-CIPN20.

Table 5 Pearson correlation

Sensitivity analysis

Wilcoxon signed-rank sum test showed that, the total score of CIPNIA-OS was generally higher than that of QLQ-CIPN20 (5.12 ± 3.28 vs 2.93 ± 2.47; p < 0.01), and the average difference between them was 2.19 (CI 95% 2.056 to 2.322), as shown in Tables 1 and 6. Corresponding comparisons were made between them along with chemotherapy cycles, as shown in Table 7 and Fig. 2a. The mean score of CIPNIA-OS at each cycle was significantly higher than that of QLQ-CIPN20 (p < 0.01), and the difference gradually expanded with the progress of chemotherapy (p < 0.05), as shown in Fig. 2b, indicating that CIPNIA-OS had higher sensitivity.

Table 6 Wilcoxon signed ranks test
Table 7 Score statistics of CIPNIA-OS and QLQ-CIPN20
Fig. 2
figure 2

a. Box-plot of the score distribution of CIPNIA-OS and QLQ-CIPN20. “ × ” represents mean score, “—” represents median score, and the box represents a score range of 25% to 75%. CIPNIA-OS: Chemotherapy-induced peripheral neuropathy Integrated Assessment – oxaliplatin subscale. QLQ-CIPN20: EORTC Quality of Life Questionnaire—Chemotherapy-induced Peripheral Neuropathy 20-item. b. Line chart of the mean score difference of CIPNIA-OS and QLQ-CIPN20 from the previous cycle

Discussion

The development of CIPN assessment tools has always sought to be universal, however, this is often the main reason for their low overall-level positive response rate (O-PRR) in practical application settings. Timely and precise intervention of OIPN depends on a sensitive and targeted evaluation system. Current commonly used assessment tools, whether CTCAE based on CRO [9], or QLQ-CIPN20 based on PRO [10], all have limitations such as high risk of subjective misjudgment, poor anti-interference and low sensitivity, which make them difficult to provide substantial guidance for prediction and intervention of chronic OIPN. To overcome these limitations, this study developed a CIPN assessment tool dedicated for oxaliplatin, named CIPNIA-OS, with the purpose of sensitively predicting the prognosis of OIPN and facilitating the development of a meticulous CIPN assessment system.

First of all, through literature search and expert supplement, OIPN-related symptoms were made into a list. We noted that the symptoms reported in the literature mostly contained or caused each other, for example, persistent paresthesia may be the cause of ataxia, and limb weakness may be the cause of functional impairment, so systematic induction and summary is necessary. Secondly, following the suggestions of the expert panel, symptoms were translated into easy-to-understand description items, so that a preliminary version questionnaire containing 32 items was developed, named CIPNIA-OS. With the purpose of accurately characterizing neuropathic symptom attributes, we were inspired by PRO-CTCAE [47], and designed assessment items for the location, duration, and influence of the most severe neuropathic symptoms by referring to the extent division of TNS [44], Levi’s scale [45], and NCI-CTCAE [9], respectively. This design can give more detailed tips and options and effectively reduce the risk of bias caused by subjective misjudgment of patients. Thirdly, the preliminary version was modified according to the content correlation analysis and pre-test results. Items with low CCI and I-PRR were deleted. Meanwhile, some items were revised based on subject representatives’ descriptions. Finally, a 20-item modified version was generated. What is noteworthy is that the CCI in this study is an innovation on the basis of the content validity index of the surface validity analysis.

Subsequently, the modified version was put into field testing. Data from 186 subjects were analyzed for the reliability, validity and sensitivity of CIPNIA-OS. Cronbach's α coefficient was 0.764 (> 0.7), indicating that the overall internal consistency reliability was considerable. ICC from the test–retest results was 0.997 (close to 1), supporting good reliability of CIPNIA-OS. Seven common factors were obtained in EFA, basically distributing according to the symptoms of upper and lower limbs (except perioral region). These factors explained 64.21% of the cumulative variance, more than 50%, indicating passable construct validity of the questionnaire. The total scores of CIPNIA-OS and QLQ-CIPN20 were further compared to demonstrate which of them was more superior. Firstly, correlation comparison was conducted using CTCAE as the gold standard, and the results showed that the Pearson correlation coefficient between CIPNIA-OS and CTCAE was 0.661, which was significantly higher than that between QLQ-CIPN20 and CTCAE (0.417, p < 0.01), indicating that CIPNIA-OS could be closer to objective evaluation results. This might be the result of CIPNIA-OS excluding items that were susceptible to underlying diseases, such as blurred vision and difficulty hearing. Secondly, Wilcoxon signed-rank sum test showed that, the total score of CIPNIA-OS was generally higher than that of QLQ-CIPN20 (5.12 ± 3.28 vs 2.93 ± 2.47; p < 0.01), and the average difference was 2.19 (CI 95% 2.056 to 2.322). This was apparently due to the high O-PRR of CIPNIA-OS, which made it easier to distinguish between mild and severe cases. Meanwhile, the mean score of CIPNIA-OS in each cycle was significantly higher than that of QLQ-CIPN20 (p < 0.01), and the difference gradually expanded with the chemotherapy cycle progressed, indicating that CIPNIA-OS had higher sensitivity.

The strengths of this study lie in the rigorous and standardized development and verification process, which were in accordance with the EORTC questionnaire module development guidelines. The questionnaire developed was original, highly targeted and sensitive. Prospective design was used in the validation phase to verify the reliability, structure, accuracy and sensitivity of the questionnaire. However, there are still some limitations. First, this assessment tool can only be used to assess the neurotoxicity of oxaliplatin, but not for other neurotoxic chemotherapeutic drugs. Second, the observation period was only limited to the chemotherapy period, and follow-up was not conducted for the post-chemotherapy phase, which may lead to the omission of some chronic OIPN symptoms. Third, all patients were hospitalized, which could potentially cause population selection bias. Fourth, although CIPNIA-OS provided some detailed features of neuropathic symptoms and improved the situation of low scoring slightly, but not ideal. Further setting up new scoring models, such as changing the weight of different items, is a possible solution. Developing specific assessment tools for different application settings is a good strategy to respond more sensitively and accurately to the actual situation. Considering that neuropathic symptoms induced by different chemotherapeutic agents have common ground, it is also promising to further develop a systematic evaluation tool with questions that can be answered selectively according to different settings.

Conclusions

This study developed an original CIPN questionnaire, named CIPNIA-OS, which was dedicated for OIPN assessment. It was a comprehensive tool that covered acute OIPN symptoms and integrated features from several validated CIPN assessment tools. The validation results supported that CIPNIA-OS had good reliability, stability, construct, criterion validity, and was more accurate and sensitive than QLQ-CIPN20 in the evaluation of OIPN. Further study should focus on the standard of score division and the corresponding intervention strategies, as well as assessment tools for other neurotoxic chemotherapeutic agents (such as taxanes, vinca alkaloids, etc.), so as to make it systematic and really universal.