The practice of distributing data and code has a wide variety of possible approaches. There are many resources available to be used to post data and code to the internet for dissemination, and it is very easy to access these resources. It is more difficult to find exemplars of how data and code are easily and effectively distributed. I am conducting a review of some of the resources that describe procedures for this and present exemplars in this and following notes.
A paper that describes the rOpenSci project’s approach is http://dx.doi.org/10.5334/jors.bu:
Boettiger, C., Chamberlain, S., Hart, E., & Ram,
K. (2015). Building Software, Building Community: Lessons from the
rOpenSci Project. Journal of Open Research Software,
3(1).
This paper is focused on the way the community development and capacity building part of the project was conducted. The diagram shown below is introduced as an example of the style that rOpenSci recommend a data analysis workflow be constructed.
The focus on publishing data to a public repository so early in the project (prior to final analysis and manuscript) seems premature to me. But then, I do feel that I am somewhat more concerned with vexatious activity by climate skeptics than the rOpenSci team.