Can Common Crawl Reliably Track Persistent Identifier (PID) Use Over Time
- Submitting institution
-
University of Edinburgh
- Unit of assessment
- 11 - Computer Science and Informatics
- Output identifier
- 59252172
- Type
- E - Conference contribution
- DOI
-
10.1145/3184558.3191636
- Title of conference / published proceedings
- Companion Proceedings of the The Web Conference 2018
- First page
- 1749
- Volume
- -
- Issue
- -
- ISSN
- -
- Open access status
- -
- Month of publication
- April
- Year of publication
- 2018
- URL
-
-
- Supplementary information
-
-
- Request cross-referral to
- -
- Output has been delayed by COVID-19
- No
- COVID-19 affected output statement
- -
- Forensic science
- No
- Criminology
- No
- Interdisciplinary
- No
- Number of additional authors
-
1
- Research group(s)
-
D - Language, Interaction and Robotics
- Citation count
- -
- Proposed double-weighted
- No
- Reserve for an output with double weighting
- No
- Additional information
- First-ever large-scale study of the growth in use on the Web of the now-preferred Digital Object Identifier (DOI) for scholarly articles. By comparing the use of the persistent form (e.g. https://doi.org/10.1145/3184558.3191636) with the end-point version actually used to reach a paper (e.g. http://delivery.acm.org/10.1145/3200000/...) we have tested the risk posed to the overall project of moving to persistent identifier by 'leakage' of the non-persistent end-point form: about 10% of the DOIs showed some leakage between April 2014 and April 2017. This finding is important for motivating the mechanisms necessary to facilitate bookmarking and citing of the persistent form.
- Author contribution statement
- -
- Non-English
- No
- English abstract
- -