Spontaneous Publication of Raw Research Data

Jean-Claude Bradley's picture

Despite the lack of extrinsic motivating factors, such as formal publication recognition or financial gain, some researchers have been attempting to post the raw data of their experiments using any means available, and often involving considerable work and care. (1)

Virtually every researcher has been doing this in a different way, reflecting not only differences between fields but also how the information is represented within each individual's mental framework. (2)

This is a clear signal that information wants to be free, or at least that is how it is behaving.

Going forward I would expect an increase in data sharing, irrespective of any change in the scientific research reward structure. I do not expect a single standard representational system to emerge quickly. It seems more likely that the data will be posted first, then later duplicated and converted to adhere to the conventions of various databases as they arise.

Redundant data has survival value.

As a recent example, Frank Gibson writes (3):

Several months ago - about 3, I made a public commitment to make the data I have generated during my Phd open and available online. Well I have not ignored this and in the interim I have been investigating various ways I can do this. Not only do I want to make it available but I want to structure it in a standard form, namely the gelML format. In addition, I was involved in developing it the specification and therefore, I have somewhat an obligation to use it. As it is an XML transfer format I needed to be make changes and revision it, like developing code, so in that sense recording the data on a wiki or blog would not be appropriate. For this reason I have chosen to create a google code project for gel electrophoresis data and do everything in subversion. You can browse the subversion repository or check it out anonymously. The geML file that will eventually (as its still very much a work in progress) contain the data is here. As I am doing this, I though I might as well publish my lab book while I was at it. This will be done using LateX and the pdf that gets generated can be found here.

Tags:

Average: 5 (1 vote)

Hypotheses that reference this signal:

This signal has no hypotheses. Add a hypothesis

Forecasts that reference this signal:

This signal has no forecasts. Add a forecast