Okay, the Pilot is over ending with 17 papers, 16 of which have CiTO annotations (and so far 4 J.Cheminform. papers after the pilot), but my interest in the Citation Typing Ontology continues and we just need more adoption.
Datasets as source of annotations
So, here's a quick Wikidata update. I have been using Wikidata as infrastructure to collect and share CiTO annotations (see also the below "Scholia patch" posts). Some time ago I recovered my CiteULike CiTO annotations and made this available on Zenodo (doi:10.5281/zenodo.7368209).
And while thinking about datasets with CiTO annotations, I found two other datasets. One was from an article in Portuguese and one from an article by Peroni et al. with this data file. That data file is actually a zip, but inside the zip file is a CSV file with three interesting columns: cited_doi, citing_doi, and intext_citation.intent. There are many more columns and I can highly recommend browsing them. But these are the three I need to add data to Wikidata. The third column is free text, but using the CiTO for labels, making it relatively easy to convert to citation intentions from Wikidata (PS, thanks to Fvtvr3r for adding more!).
So, I had a cleaned file and started writing a Groovy Bioclipse script using Bacting. It basically does a few things: extract all DOIs, check which ones are in Wikidata, analyze the intext_citation.intent column content, and then generate QuickStatements (see this gist). Out of the 600 lines from the input, it creates some 200 new CiTO-annotated citations in Wikidata between some 150 article pairs:
Some more history
- 2021: BioHackathon Europe 2021 #1: CiTO annotations in BioHackrXiv
- 2021: Markdown template for the Journal of Cheminformatics with CiTO support
- 2020: CiTO updates #3: third paper in the collection and updated Scholia patch
- 2020: CiTO updates #2: annotation migration to Wikidata and first Scholia patch
- 2020: CiTO updates #1: first research paper in the Journal of Cheminformatics with CiTO annotation published
- July 2020: New Editorial: "Adoption of the Citation Typing Ontology by the Journal of Cheminformatics"
- 2015: "What You're Doing Is Rather Desperate"
- 2012: CiTO / CiteULike: publishing innovation
- 2010: CiteULike CiTO Use Case #1: Wordles
- September 2010: A list of things I miss in CiteULike
No comments:
Post a Comment