Many works are published on a preprint server such as the arXiv before publication in a peer-reviewed journal. Since the arXiv can be updated during the peer-review process, it is common to find practically identical copies of journal articles on the arXiv. When writing an article, it is recommended to include both the journal reference and the preprint link.
Is there a tool, which can go through a bibtex file, look for entries
- with a arXiv:333.1234v1 preprint identifier but lacking a journal reference or
- a journal article without an eprint identifier
and tries to fill in the missing information?
This could be implemented in two ways:
Go through two .bib files, one with preprints and one with journal articles. Find matching pairs (by comparing authors and title) and write a third .bib file
Take a .bib file and use online resources (such as Google Scholar, arXiv, ...) to fill in any missing fields.
While reference manager software can often lookup information about journal articles (e.g. via DOI) and sometimes query the arXiv, I'm not aware of any semi-automatic method to match this information.