Deja vu: a database of highly similar citations in the scientific literature

dc.contributorVirginia Techen
dc.contributor.authorErrami, Mouniren
dc.contributor.authorSun, Zhaohuien
dc.contributor.authorLong, Tara C.en
dc.contributor.authorGeorge, Angela C.en
dc.contributor.authorGarner, Harold R.en
dc.description.abstractIn the scientific research community, plagiarism and covert multiple publications of the same data are considered unacceptable because they undermine the public confidence in the scientific integrity. Yet, little has been done to help authors and editors to identify highly similar citations, which sometimes may represent cases of unethical duplication. For this reason, we have made available De´ ja` vu, a publicly available database of highly similar Medline citations identified by the text similarity search engine eTBLAST. Following manual verification, highly similar citation pairs are classified into various categories ranging from duplicates with different authors to sanctioned duplicates. De´ ja` vu records also contain user-provided commentary and supporting information to substantiate each document’s categorization. De´ ja` vu and eTBLAST are available to authors, editors, reviewers, ethicists and sociologists to study, intercept, annotate and deter questionable publication practices. These tools are part of a sustained effort to enhance the quality of Medline as ‘the’ biomedical corpus. The De´ ja` vu database is freely accessible at The tool eTBLAST is also freely available at
dc.description.sponsorshipP.O'B. Montgomery Distinguished Chair (to H.G.); the Hudson Foundation (to H.G.); National Institute of Health/National Library of Medicine grant (R01 LM009758-01 to H.R.G.). Funding for open access charge: P.O'B. Montgomery Distinguished Chair.en
dc.identifier.citationErrami M, Sun Z, Long TC, George AC,en
dc.identifier.citationGarner HR.en
dc.identifier.citationDeja vu: a database of highly similar citations in the scientific literature. Nucleic Acids Res. 2009;37:D921–4.en
dc.publisherOxford University Pressen
dc.rightsIn Copyrighten
dc.subjectDatabases ,Bibliographicen
dc.subjectDuplicate Publication as Topicen
dc.subjectUser-Computer Interfaceen
dc.titleDeja vu: a database of highly similar citations in the scientific literatureen
dc.title.serialNucleic Acids Researchen
dc.typeArticle - Refereeden
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
3.21 MB
Adobe Portable Document Format