Open access publisher BioMed Central has announced that BMC series biology journals have joined BMC Research Notes in encouraging all authors of original research-based articles to publish supporting data permanently online. The authors have also been urged to include an 'Availability of supporting data' section in their articles. The series includes journals such as BMC Bioinformatics, BMC Systems Biology, BMC Ecology and BMC Genomics.
Different journals and publishers have different approaches to data sharing. A recent feasibility study by JISC and the Research Information Network reportedly has the potential to produce a comprehensive Journal Research Data Policy Bank (JoRD) cataloguing these different policies. Meanwhile, non-comprehensively, some journals such as the BMJ require a data sharing statement from all authors. Others – including all BioMed Central journals as a minimum requirement – imply data sharing on request as a condition of submission or publication. BioMed, while viewing data sharing statements are as a step in the right direction, also expects to see evidence of data sharing in practice.
The 'Availability of supporting data' section aims to make it easier to identify articles with supporting data available, and enhance the literature. Readers can easily search for all published articles which include this section. Datasets available permanently online should also be cited in an article's reference list, as well as being linked from the ‘Availability of supporting data’ section.
This policy change on the BMC series biology journals adds another 20 journals to those BioMed Central journals either encouraging or requiring all authors to permanently link their article to supporting data, or include their supporting data as additional files with their article. To show further evidence of the benefits of data sharing to scientific progress, reuse of published datasets is stated to be undoubtedly important.
In July 2012, data collected on reuse of datasets deposited in Gene Expression Omnibus (GEO) and discussed on Heather Piwowar’s blog suggests many public datasets are reused, not just an elite few. Hopefully the introduction of the 'Availability of supporting data' section on more journals should make it easier to find articles with supporting datasets available – both elite and non-elite – for reuse.