BMJ Evidence-Based Medicine | 2019

41\u2005Validation of crowdsourcing for citation screening in systematic reviews

 
 
 
 
 
 
 
 
 
 
 
 
 

Abstract


Objectives Systematic reviews (SRs) are often cited as the highest level of evidence available as they involve the identification and synthesis of published studies on a topic. Unfortunately, it is increasingly challenging for small teams to complete SR procedures in a reasonable time period, given the exponential rise in the volume of primary literature. Crowdsourcing has been postulated as a potential solution. The feasibility objective of this study was to determine whether an online crowd would be willing to perform and complete abstract and full text screening. The validation objective was to assess the quality of the crowd’s work, including retention of eligible citations (sensitivity) and work performed for the investigative team, defined as the percentage of citations excluded by the crowd. Method We performed a prospective study evaluating the feasibility and validity of crowdsourcing essential components of an SR, including abstract screening, document retrieval, and full text assessment. Using the CrowdScreenSR citation screening software, 2323 articles from 6 SRs were available to an online crowd. Citations excluded by less than or equal to 75% of the crowd were moved forward for full text assessment. For the validation component, performance of the crowd was compared with citation review through the accepted, gold standard, trained expert approach. Results Of 312 potential crowd members, 117 (37.5%) commenced abstract screening and 71 (22.8%) completed the minimum requirement of 50 citation assessments. The majority of participants were students (192/312, 61.5%). The crowd screened 16,988 abstracts (median: 8 per citation; IQR 7-8), and all citations achieved the minimum of 4 assessments after a median of 42 days (IQR 26-67). Crowd members retrieved 83.5% (774/927) of the articles that progressed to the full text phase. A total of 7604 full text assessments were completed (median: 7 per citation; IQR 3-11). Citations from all but 1 review achieved the minimum of 4 assessments after a median of 36 days (IQR 24-70). When complete crowd member agreement at both levels was required for exclusion, sensitivity was 100% (95%CI 97.9-100) and work performed was 68.3% (95%CI 66.4-70.1). Using the predefined alternative 75% exclusion threshold, sensitivity remained 100% and work performed increased to 72.9% (95%CI 71.0-74.6; P<.001). Conclusions Crowdsourcing of citation screening for SRs is feasible and has reasonable sensitivity and specificity. By expediting the screening process, crowdsourcing could permit the investigative team to focus on more complex SR tasks. This requires a user-friendly online platform that allows research teams to crowdsource their reviews. Future directions should focus on assessing the application of this methodology to real life projects and determine its potential for rapid completion of systematic reviews.

Volume 24
Pages A26 - A26
DOI 10.1136/BMJEBM-2019-EBMLIVE.49
Language English
Journal BMJ Evidence-Based Medicine

Full Text