Publication | Open Access
De-duplication of database search results for systematic reviews in EndNote
1.7K
Citations
5
References
2016
Year
EngineeringSystematic Literature StudyFull-text DatabaseData DeduplicationData PublishingMultiple DatabasesText MiningDatabase Search ResultsDuplicate RecordsInformation RetrievalData ScienceData IntegrationBiostatisticsQuality ReviewPublic HealthBiomedical Text MiningData ManagementSearch TechnologyHealth InformaticsInformation ProfessionalsRecord LinkageSoftware ReviewClinical Database
When conducting exhaustive searches for systematic reviews, information professionals search multiple databases with overlapping content [1][2][3][4].They typically remove duplicate records to reduce the reviewers' workload associated with screening titles and abstracts; sometimes the reviewers remove the duplicates.Several articles have been published recently on de-duplication methods.In the authors' opinion, these methods are either very time consuming [5] or impractical, as they require uploading large files to an online platform [6,7].A recent overview article compared existing software programs but found that none was truly satisfactory [8].Unique identifiers for journal articles are digital object identifiers (DOIs) and PubMed IDs (PMIDs).However, these identifiers are not present in every database.When they are present, they often cannot be exported easily.Thus, they cannot be relied upon to identify duplicates.An alternative involves using pagination, because the often large page numbers in scientific journals, in combination with other fields, can serve as a type of unique identifier.However, this is complicated by variations in the way page numbers are stored.Most biomedical databases use a long format (e.g.,
| Year | Citations | |
|---|---|---|
Page 1
Page 1