Belatedly, here’s a video of Brian introducing the CLARION project at the JISC programme meeting in July: –
The advert has finally come out. We’re looking for an experienced Java programmer, with skills in as many of the following as possible: XML, OO design, SPARQL, RDF, RESTful web development, Clojure (or Scheme), Triplestores. This is an exciting opportunity to apply interesting technologies to a challenging and worthwhile application: enabling the publication of Open Data to support science.
Closing date for applications is 24 Aug. See the university jobs website or next week’s Cambridge Evening News for details on how to apply.
Brian and I are attending the JISC programme meeting at Leicester Uni. We’ll be presenting CLARION very (like, 30 seconds) briefly tomorrow morning, but if anyone wants to chat about the project this evening then come and grab us!
CLARION is starting the process for recruiting a developer. We need someone who is smart, with experience with open software techniques and tools; good knowledge of XML; ideally someone who has experience within a scientific environment. And experience of RDF or CML would be the icing on the cake! Work is based in Cambridge. Do you know of anyone suitable…? 😉
CLARION project – Cambridge Chemistry Department
The data challenge: Chemistry laboratories produce many types of information and data – raw data, processed data, observations, chemical structures, reaction schemes, experimental write-ups, conclusions, graphs, images, crystallographic, spectroscopy data, papers, references, and so on. It is challenging to store this variety of information such that it is accessible and usable by a variety of users. The challenges include:
- Storing data in formats that allow its use by specialist data processing tools
- Using data formats that are suitable for publication and long-term preservation
- Allowing certain data to be used by people outside the department
- Motivating researchers to open their data
- Enhancing the meaning and context of the data to improve its usability
- Making the data searchable and easily navigable
- Ensuring that the system has minimal support overheads, yet continually evolves as required to meet changes in the IT environment.
Using an ELN: The Cambridge Chemistry Department has a basic repository which stores crystallographic data. Project CLARION (Cambridge Laboratory Repository In/Organic Notebooks) will create an enhanced repository that captures core types of chemistry data and ensures their access and preservation. The Chemistry Department is implementing a commercial Electronic Laboratory Notebook (ELN) system; CLARION will work closely with the ELN team to create a system for ingesting chemistry data directly into the repository with minimum effort by the researcher.
Enhancing and expanding data usage: CLARION will provide functionality to enable scientists to make selected data available as Open Data for use by people external to the department. The project will use techniques for adding semantic definition to chemical data, including RDF (Resource Description Framework) and CML (Chemical Markup Language). Much of these techniques will be extensible to other disciplines. CLARION will address general issues such as ownership of data, and it will publicise its results to the chemistry and repositories communities. Effort will be put into developing a sustainable business model for operating the repository that can be adopted by the department after project completion.
Timelines: The project runs for two years from April 2009. The initial pilot deployment of the ELN is scheduled for late 2009, and we hope to be publishing open data from it in early 2010.
Project blog: https://clarionproject.wordpress.com/
Twitter: CLARIONproject http://twitter.com/CLARIONproject
Contact: “Brian Brooks” <email@example.com>
We’re happy to publicly announce the CLARION project, funded by the JISC to enhance the existing data repository at the Chemistry Department of the University of Cambridge, especially by integrating it with an Electronic Lab Notebook system.
You can read a little more about the project, tweet us @clarionproject or refer to us as #clarionproject.