Network meta-analysis of treatments for perineal extramammary paget’s disease: Focusing on performance of recurrence prevention
Introduction
Extramammary Paget’s disease (EMPD) is a rare refractory cutaneous malignancy, it predominantly affects apocrine gland-bearing skin and occurs preferably in perineal organs such as scrotum, penis and vulva [1–3]. The clinicopathological manifestations of EMPD commonly presents as red scaly lesions accompanied by itchy and painful feelings [1, 4]. Although its progression is indolent and metastasis rarely occurs, EMPD is deemed as a challenging disease due to its stubbornly high recurrence rate [5–7]. Since its first report, the therapeutic approaches of EMPD have been under constant exploration and development. Currently, the documented treatments of EMPD are numerous, diversified, and reported by different studies with outcomes varying from one to each other [4, 8]. However, facing literature in such large quantities, clinicians are struggling to find out which therapy has relatively low recurrence rate and thus is superior to others. To gain better insight into the recurrence-preventing ability of current treatments for EMPD, we here performed a network meta-analysis (NMA) based on previous comparative studies. Through our analysis, we hope to provide reference for clinical decision making of EMPD management and future relevant studies.
Literature search
We searched databases of Pubmed, Embase and Google Scholar for studies published in English between January 1980 and July 2023, by using ‘the searching strings of “((Extramammary Paget’s disease) AND (treatments)) AND (recurrence)”. We also examined reference lists from all included studies, sourced studies from known reviews on the topic, and sought input from expert contacts. The study and protocol have been registered in PROSPERO database (CRD42023415693).
Selection criteria
To be eligible for inclusion, 1) studies must be comparative research of at least two EMPD treatments, or observational studies that documented the information of at least two EMPD treatments; 2) the studies must include the basic demographic information (sample size, age, gender ratio, patient number in each treatment group, follow-up time), and details of recurrence (recurrent patients in each treatment group). 3) the studies should be reported original research, clearly described a clinical intervention, and were in the English language.
Exclusion criteria was set as: 1) studies reporting treatments of EMPD patients with metastasis or accompanied by other malignancy; 2) reviews or studies of case reports; 3) studies with eligible sample size fewer than 10 patients; 4) studies reporting EMPD in axilla and other non-perineum areas; 5) studies which have critical risk of bias assessed by ROBINS-I tool (Risk of Bias in Non-randomized Studies of Interventions) [9].
Searching work was conducted by two reviewers (of XY and RX). They also appraised the risk of bias of the results of all studies that met inclusion criteria. Agreement by consensus was reached on the risk of bias for the results of each study. Disagreements were resolved by discussion with a third reviewer (XC).
Data extraction
The following data were extracted from the 29 articles: first author, country, publication year, sample size, mean age, gender ratio, location of lesion (genitals or perianal region), follow-up time, treatment, patients in each treatment group, recurrence in each treatment group.
Statistical analysis
Bayesian random-effects approach with Markov-chain Monte Carlo simulation was carried out for the network meta-analysis. Currently, there are several platforms for conducting NMA, and we adopted R software (version 4.0.3) due to the abundance and flexibility of relevant R packages.
Here, in our Bayesian analysis, the package of gemtc [10] and dmetar [11] were used. Following the package instruction, we arranged the study data (including study labels, sample size, treatments, and recurrence in each treatment) into arm-based form. Applying these information, gemtc established the Bayesian hierarchy model (random-effects model) using a Markov Chain Monte Carlo (MCMC) simulation (simulating parameters were set as: Iterations = 100000, Thinning interval = 10, Number of chains = 4, Sample size per chain = 10000) to obtain the posterior distribution (prior parameters were set as N (0, 10, 000) and σ ∼ Unif(0, 5)) of the odds ratio for the treatments. Convergence of the MCMC was confirmed by using the gelman.plot and gelman.diag function from the coda package (see in S1 Fig). Network graph, direct/indirect evidence plot, and forest plot were generated by the gemtc package function of network, direct.evidence.plot (dmetar package) and forest, respectively. We ranked treatments by using surface under the cumulative ranking (SUCRA) score (which measure the extent of certainty that a treatment is better than another) [12]. Global inconsistency was assessed by comparing deviance information criteria (DIC) statistics between different models [13]. Considering the length of follow-up time may exert influence on recurrence outcome, and differences in gender and location of EMPD may yield different results, we also have conducted subgroup analysis based on follow-up time (≥ 60 months and ˂ 60 months), gender (male and female), and lesion locations (genitals and perianal regions). A comparison adjusted funnel plot was used to assess small study effects, where treatments were ordered by expected novelty. Studies with extreme effects on heterogeneity (outlier studies) were detected by using package NMAoutlier.
Summary measures used were odds ratio (OR) with 95% credible interval (CrI), p value < 0.05 was considered as statistically significant. The detailed R code transcript used in this research has been provided in S1 File.
Characters of included studies
In total, 1172 records were found in our initial searching, the title screening excluded 920 duplicated records, 222 articles were further screened out for unfitting inclusion criteria or with critical risk of bias (see in S1 Table), eventually there are 29 studies left for subsequent analysis (Fig 1A) [14–42]. Among the included studies, 8 therapeutic treatments for EMPD have been identified, which include wide local excision (WLE, with macroscopical surgical margin ≥ 2cm), local excision (LE, with surgical margin < 2cm), Mohs micrographic surgery (MMS), radiotherapy (RT), radical vulvectomy (RV), photodynamic therapy (PDT), imiquimod, and lasers (LS). To be noticed, some of the studies have used combined treatments (which are WLE+RT, WLE+PDT, and WLE+LS) as therapeutic measures for EMPD, and they also have been included in our analysis. Therefore, there are 11 treatment designs in total, the relationship between included studies and treatment designs is demonstrated by network graph (Fig 1B).
The 29 articles come from 7 different countries (United States, United Kingdom, China, Japan, Korea, France, and Italy) with publication year spanning from 1990 to 2023. Among them, the smallest sample size includes 10 patients, while the largest has 278 cases. Mean age varies from 60 to 77 years old, and the median follow-up time is 56.7 months. Detailed information of included studies was presented in Table 1.
Recurrence-preventing ability of different EMPD treatments
During the assessment, the WLE treatment was set as the reference, because this surgery is widely recognized as the most traditional and commonly used treating method for EMPD. Comparing with WLE, only the MMS showed significant difference in reducing EMPD recurrence, with OR of 0.18 (CrI: 0.034 to 0.87), while none of the rest treatments has statistically significant results. The detailed outcome has been demonstrated by forest plot in Fig 2A. The SUCRA ranking results also showed that MMS serves to be the best treatment dealing with EMPD recurrence, while LE has the worst performance in this regard (Fig 2B).
Outlier studies detection and NMA results after removing them
In the practices of network meta-analysis, some studies might have markedly different characteristics from others, and may exceed the degree of statistical heterogeneity that can be adequately explained by the random-effects model, and these “extreme” studies are usually referred to as the outlier studies [43]. Incorporating outlier studies often causes substantial bias that sometimes can cover up the real difference. To detect the outliers, we used the tailored package of NMAoutlier [44]. By adopting forward search algorithm, we have identified 3 major variance-producing outliers, which were demonstrated by the plot for ratio of variance (Fig 3A).
After removing the 3 outlier studies, we conducted NMA once again (also setting WLE as reference) for the remained 23 studies. As expected, the DIC score of current Beyasian model is lower, indicating inconsistence of the model is less than the previous one (Fig 3B). The forest plot in this time demonstrated that MMS (OR: 0.35, CrI: 0.11 to 0.82) and LE (OR: 13.0, CrI: 2.5 to 110) have significant difference comparing with WLE treatment in preventing EMPD recurrence (Fig 4A). SUCRA ranking result also has changed, as WLE+RT climbed to the top and followed by WLE+PDT and MMS, while LE remained in the last place (Fig 4B).
Influence of follow-up length
The variation of covariance from different studies can add heterogeneity of NMA and may impact the magnitude of effect sizes. In our research, we must consider the influence of different follow-up length upon recurrence rates. To check whether such influence exists and causes bias, we divided the outliers-removed studies into two subgroups (follow-up time ≥ 60 months and ˂ 60 months) and used the network meta-regression to conduct subgroup analysis. Results showed that the OR and CI values of the group of longer follow-up time (≥ 60 months) are universally lower than another group (˂ 60 months), presenting a “left-shifting” phenomenon on forest plot (Fig 5). Besides, while LE result is significant in both groups, MMS only has significant result in group of follow-up time ≥ 60 months. Such outcome indicates that follow-up time can influence the effect sizes. If the follow-up length is short (˂ 60 months), it tends to inflate the OR values and cause bias which can cover up the real differences.
Influence of lesion location and gender differences
Although the EMPD anatomical sites of our study have been confined to perineum, it still involves many different lesion areas such as penis, scrotum, vulva, crissum skin, etc. To investigate whether different lesion locations could lead to different recurrence outcomes, we categorized the frequently documented perineal EMPD lesion locations into genital region (including penis, scrotum and vulva) and perianal region, then conducted subgroup analysis. Similarly, to rule out the potential influence of gender differences, subgroup analysis of the data respectively from male and female patients has also been carried out (RV is not included in the gender subgroup analysis, because it is solely designed and performed for female patients). According to the forest plot (Figs 6 and 7), gender differences and lesion locations have no significant impact on the final result, as the ranges and values of OR and CI in each group showed little changes. All the plots come to the identical conclusion that MMS has the best, while LE owes the worst, recurrence data compared to WLE.
Examine the small study effects by conducting funnel plot
Comparison-adjusted funnel plots have been widely proposed to evaluate the risk of small-study effects (equal to publication bias here) in network meta-analyses [12]. Our funnel plot is depicted by funnel function of meta package. As shown in Fig 8, the distribution of comparisons is symmetric and p value of Egger’s test is 0.8608, which indicates that the small-study effects are not significant in our research.
Discussion
This study is focusing on recurrence outcomes of perineal EMPD treatments (for both male and female patients), our literature searching process has identified 11 treatment designs (WLE, LE, MMS, RT, RV, PDT, Imiquimod, LS, WLE+RT, WLE+PDT, and WLE+LS) of perineal EMPD from the 29 included studies. Among them, imiquimod is a topical cream which emerged as a promising therapy recently [3, 45], and the WLE is acknowledged as the most frequently used method for EMPD [1, 4]. Many scholars believe that the residual minimal lesions scattered around the gross tumour edge are the main cause of postoperational recurrence, so a wider resection margin (defined as > 2cm) is proposed by great majority of clinicians [4, 46]. Considering the universality of WLE in EMPD treatments, we set it as the reference in our NMA and hope to find a better alternative.
During the initial analysis of all the 26 included studies, MMS is shown in forest plot as the only treatment that has significant advantages in reducing EMPD recurrence, and it is also recommended by SUCRA rank as the best intervention in such regard. According to literature background, MMS is the surgical procedure which removes cutaneous tumour in a series of horizontal layers. It is performed by removing a thin margin of tissue circumferentially around and deep to the clinical margins of a skin tumor. The specimen is typically removed with a 45-degree bevel to facilitate tissue processing. It is then rapidly frozen and sectioned in a cryostat microtome, allowing for quick tissue processing (about 15 to 30 minutes). Sectioning the tissue in a horizontal direction allows virtually 100% of the tissue margin (peripheral and deep margins) to be examined under the microscope. The process is repeated until the tumor has negative histologic margins [47]. We believe such characters ensure the maximum removal of diseased tissues and thus give the MMS more advantages than WLE, which is relatively rougher in dealing with underlying lesions.
In the following analysis, 3 outlier studies have been detected and heterogeneity of the Beyasian model shows substantial reduction after removing them (DIC score reduced). A closer re-examination of these 3 studies has discovered confounding factors, which include blending unstated treatments into the control groups (Li, et al [24] Hata, et al [33]) and performing preoperative mapping biopsies to unknown number of patients (Hatta, et al [22]). Excluding these confounders is beneficial for bias control, and thus can bring us closer to the real facts.
NMA was conducted once again for the outlier-removed studies. As shown in forest plot, MMS still shows significant advantages in coping EMPD recurrence, while LE in this time, also has significant result which proves it can lead to higher recurrence rates than WLE. This outcome is within our expectation. The range of LE surgical margins is less than 2cm, such resection coverage gives convenience for wound repairing, but also inevitably omits many residual small lesions which are often present in macroscopically normal tissue and bring high risks of recurrence. As an early surgical method for EMPD, LE is rarely reported in recent studies, and clinicians rather adopt flap repairing to achieve the complete resection of EMPD [4, 48].
The SUCRA rank analysis of outlier-removed studies, however, shows inconsistent result with the forest plot, as WLE+RT and WLE+PDT have higher ranking places than MMS. Facing such conflicts, we think the SUCRA ranking result should be interpreted with caution because SUCRA is a mathematical process that calculates the likelihood of becoming the best treatment, it does not consider the magnitude of differences between interventions and its estimates of different treatments often overlap. Besides, only a few included studies focus on the treatments of WLE+PDT and WLE+RT, which associate with indirect comparisons plus low evidence certainty, and this also gives the SUCRA a flaw basis.
In the final parts, we checked the influence of follow-up length, gender difference, lesion locations and small study effect. Usually, different follow-up times tend to yield different recurrence rates, the subgroup analysis has showed that such case exits in our NMA. Comparing with subgroup of longer follow-up length (≥ 60 months), OR values in the shorter follow-up group (< 60 months) have been universally inflated, and the statistical result of MMS becomes no longer significant, which indicates that inadequate follow-up time may conceal the real differences. For the examination of small study effect, funnel plot with Egger’s test is adopted. Results demonstrated that no obvious small study effect was found.
There are some limitations in our NMA. First, we did not conduct subgroup analysis for different pathologic subtypes of EMPD. According to depth of tumour invasion, EMPD is commonly classified into two types which are intraepithelial (tumour cells are confined to the epithelium) and invasive (tumour cells penetrate the dermis) Paget’s diseases. Different extent of infiltration is probably associated with different risks of recurrence [49]. Unfortunately, we have no adequate data source to verify such hypothesis, due to most of the included studies have not documented detailed information of pathologic subtypes. Secondly, the evidence certainty and study samples of some treatments (such as RT, LS, and PDT) are generally insufficient, which affects the reliability of the corresponding results, we hope this situation could be improved with the increase of relevant studies in future. Finally, it is necessary noting that none of the included studies in our NMA is randomized controlled trial and confidence intervals of estimates are generally wide, these may also weaken the evidence basis and compromise the reliability.
In summary, based on included studies our NMA work reveals that, among the listed candidate treatments, MMS has the best performance on reducing perineal EMPD recurrence, while LE exhibits the worst capability in such regard. Recurrence-preventing abilities of other treatments such as RT, LS, WLE and PDT have no significant difference between each other. More relevant comparative studies with high evidence level and detailed pathologic information are required for verifying and furthering above conclusions.
Authors:
Xiaobin Yuan, Ruizhi Xue, and Xiaoming Cao.