tag:blogger.com,1999:blog-37468076518484934102024-03-19T11:35:46.195+00:00RepositoryManThe Blog of a repository administrator and web scientist. Leslie Carr is a researcher and lecturer who runs a research repository for the School of Electronics and Computer Science in the University of Southampton in the UK. This blog is to record the day to day activities of a repository manager.Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.comBlogger138125tag:blogger.com,1999:blog-3746807651848493410.post-85452473327343125432015-04-17T12:10:00.002+01:002015-04-17T12:10:46.077+01:00EPrints for EPSRC Data Management<div class="MsoNormal" style="font-size: 11pt; margin: 0cm 0cm 0.0001pt;">
<div style="font-family: Calibri, sans-serif;">
The following simple Research Data Management advice has just been set around my institution for staff publishing papers to satisfy the new EPSRC data mandate. Although each institution will provision research data differently, it was great to see all the work that has been done over the last few years distilled into a simple set of instructions that even professors can understand!</div>
<br />
<ol>
<li style="font-family: Calibri, sans-serif;"><span style="font-size: 11pt;">Write the paper</span></li>
<li style="font-family: Calibri, sans-serif;"><span style="font-size: 11pt;">Login to EPrints</span></li>
<li style="font-family: Calibri, sans-serif;"><span style="font-size: 11pt;">Go in to manage deposits</span></li>
<li style="font-family: Calibri, sans-serif;"><span style="font-size: 11pt;">Click on the Add New Data Set button</span></li>
<li style="font-family: Calibri, sans-serif;"><span style="font-size: 11pt;">Upload an Excel spreadsheet with the data in from the paper</span></li>
<li style="font-family: Calibri, sans-serif;"><span style="font-size: 11pt;">Fill in as many of the questions as you can, making sure you describe what the data corresponds to in the paper (e.g. Fig 1 etc…)</span></li>
<li style="font-family: Calibri, sans-serif;"><span style="font-size: 11pt;">You can link it to the grant that funded it (these should be in the system already)</span></li>
<li style="font-family: Calibri, sans-serif;"><span style="font-size: 11pt;">In the options for the upload I made the data “visible to registered users only” and embargoed it until the end of the year with “publication pending” as the reason.</span></li>
<li><span style="font-size: 11pt;"><span style="font-family: Calibri, sans-serif;">Email </span><span style="font-family: Courier New, Courier, monospace;">researchdatamanager@yourinstitution.ac.uk</span><span style="font-family: Calibri, sans-serif;"> to get a DOI - the repository team will check what you’ve entered at the same time.</span></span></li>
<li style="font-family: Calibri, sans-serif;"><span style="font-size: 11pt;">Write the following in the acknowledgements of the paper, "The data for this paper can be found at doi:10.the/DOI/you.received.above"</span></li>
<li style="font-family: Calibri, sans-serif;"><span style="font-size: 11pt;">Submit paper</span></li>
<li style="font-family: Calibri, sans-serif;"><span style="font-size: 11pt;">When the paper is accepted, make visible to all, remove embargoes, and link it to a copy of the paper that has been uploaded onto the system.</span></li>
</ol>
<br />
<div style="font-family: Calibri, sans-serif;">
Southampton's repository has an extended set of metadata fields to describe datasets that are part of the ReCollect EPrints Bazaar plugin that was <span style="font-size: 11pt;">developed by the UK Data Archive and the University of Essex, as part of the JISC MRD Research Data @Essex project.</span></div>
<div>
<span style="font-size: 11pt;"><br /></span></div>
</div>
Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com0tag:blogger.com,1999:blog-3746807651848493410.post-77171854013512156522013-01-31T03:42:00.001+00:002013-02-02T13:15:32.160+00:00The Basics of Scholarly Communications in the UKIn the decade since the Budapest Open Access Initiative declared a new public good, there have been many expositions of the advantage and inevitability of Open Access and its consequences for new modes of scientific enquiry. Tony Hey (who has just claim to 'first cause' of UK open access in his position of Head of Electronics and Computer Science at the University of Southampton) has recently started a series of blog posts <a href="http://tonyhey.net/2012/12/19/a-journey-to-open-access/" target="_blank">A Journey to Open Access</a> that gives a very accessible introduction to the topic. Stevan Harnad (who was given a chair in ECS by the same Tony Hey) also blogs extensively at <a href="http://openaccess.eprints.org/" target="_blank">Open Access Archivangelism.</a><br />
<br />
In my lesser role of championing repositories and developing the capabilities of the EPrints platform, I have had the privilege of working with library and information professionals to try to explain the principles of Open Access to a broad range of academics and researchers, and I have been struck by the almost total lack of understanding of the UK scholarly communication infrastructure shown by my research colleagues.<br />
<br />
To help those who have been too busy writing papers to appreciate how those papers appear and now find themselves über-confused and offended by the Finch regime, I offer the following diagram as an introduction to Everything You Need To Know on the topic. Forget the dissemination of papers and the transfer of knowledge that form the scholarly publishing cycle, this is all about influence and power.<br />
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="margin-left: auto; margin-right: auto; text-align: left;"><tbody>
<tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhiXE5mFaPEdVjk5zPNcHyBYg2mbtyAUC-AOJI-heBGDSxmxYIlwxE0oSuxQCq1ZIlRXQZbVzVycf4qMFhxzUEoWPQ71lH6PFEu-wbmd5eBBGIzDmxWYVogIdjAw0FMKY6bXvdNd4dOQzc/s1600/UK+Scholarly+Comms.png" imageanchor="1" style="margin-left: auto; margin-right: auto;"><img border="0" height="300" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhiXE5mFaPEdVjk5zPNcHyBYg2mbtyAUC-AOJI-heBGDSxmxYIlwxE0oSuxQCq1ZIlRXQZbVzVycf4qMFhxzUEoWPQ71lH6PFEu-wbmd5eBBGIzDmxWYVogIdjAw0FMKY6bXvdNd4dOQzc/s400/UK+Scholarly+Comms.png" width="400" /></a></td></tr>
<tr><td class="tr-caption" style="text-align: center;"><div style="text-align: left;">
<span style="font-size: small; text-align: -webkit-auto;">Publishing companies have pushed governments towards Gold Open Access (more money for publishers) and pulled universities away from Green Open Access (no-cost parallel dissemination). Researchers themselves have sided with publishing companies and learned societies (who act like sub-branches of publishing companies) to try to maintain the stability of the publishing industry, irrespective of the health of the university sector on which it depends!</span></div>
<div style="text-align: left;">
<span style="font-size: small; text-align: -webkit-auto;"><br /></span></div>
<div style="text-align: left;">
<span style="font-size: small; text-align: -webkit-auto;">Consequently, we now have a government proposal (the Finch report) to pay publishers twice! Once to make UK research open access whilst still retaining subscription access to the non-UK material. It's a kind of Westminster Open Access Initiative stating that an </span><span style="text-align: center;"><span style="font-size: small;">old tradition of scholarly publishing and a new technology of the Web have converged to make possible an unprecedented injection of public cash <i>for publishers</i>. </span></span></div>
<div style="text-align: left;">
<span style="text-align: center;"><span style="font-size: small;"><br /></span></span></div>
<div style="text-align: left;">
<span style="font-size: small;">The only reasonable way forward is for researchers to take the initiative, and to show the kind of academic leadership that Professors Hey and Harnad demonstrated a decade ago - to start being proactive in their own scholarly communications. The easiest way to do that is to start using the existing repository infrastructure provided by their universities and supported by their libraries. </span><br />
<span style="font-size: small;"><br /></span>
<span style="font-size: small;">Researchers already hold all the cards, they don't need to be held to ransom in this Finchian standoff. They are the producers and consumers and quality control agents that create every aspect of the literature, they are also the community that defines its own criteria for professional advancement and assessment. Everything they think that they depend on the publishing industry for, they can actually achieve for themselves.</span></div>
<div>
<span style="font-size: small; text-align: -webkit-auto;"><br /></span></div>
</td></tr>
</tbody></table>
Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com5tag:blogger.com,1999:blog-3746807651848493410.post-80266324031848608832012-11-29T11:04:00.003+00:002012-11-29T11:04:52.806+00:00Repository Twitter Training<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi6a0KHt2nTNEoE86Fzrx1_p3j4GsG9O7Whv08qWEe68HoJHV4SId6S8NfB94cnmBLz_p4-4Q0im4mL8FSg_13Ni67QtZhH7y_LDfMdYP9HDZ1kPU7NqWeat371F6GHkua-y67IBTOR9Is/s1600/eprintstweetstream.png" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"><img border="0" height="320" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi6a0KHt2nTNEoE86Fzrx1_p3j4GsG9O7Whv08qWEe68HoJHV4SId6S8NfB94cnmBLz_p4-4Q0im4mL8FSg_13Ni67QtZhH7y_LDfMdYP9HDZ1kPU7NqWeat371F6GHkua-y67IBTOR9Is/s320/eprintstweetstream.png" width="155" /></a></div>
In a previous post I reported on using <a href="http://repositoryman.blogspot.co.uk/2011/10/using-eprints-repositories-to-collect.html" target="_blank">EPrints to gather data from Twitter</a> in order to support researchers in the social sciences, particularly those looking for evidence of social processes or for the impact of the Web on society. The work was also reported at OR2012 in Edinburgh in a paper <i><a href="http://eprints.soton.ac.uk/342973/" target="_blank">Microblogging Macrochallenges for Repositories</a></i> that described the work involved in adapting EPrints to support this task.<br />
<br />
Having got some more experience from running a pilot service at Southampton, we would like to invite anyone from the repository community who is interested in this work to join in a training session at the University on Tuesday 11th December from 1-3pm (buffet lunch included).<br />
<br />
The first hour will focus on using the service: how to harvest twitter streams, how to monitor the harvesting process, how use the repository tools to analyse the collection of tweets, how to export the data to other visualisation and analysis services and how to deposit the analysed data in an institutional repository.<br />
<br />
The second hour will discuss the management of the service itself: how to install twitter-harvesting functionality using the EPrints Bazaar, how manage the functionality, how to integrate it with your institutions other repository services and consideration for the licensing and ethical restrictions on gathering and using Twitter data.<br />
<br />
If you are interested in attending or finding out more information, please email me, <a href="mailto:lac@ecs.soton.ac.uk?subject=EPrints%20Twitter%20Training">lac@ecs.soton.ac.uk</a>.<br />
<br />
<br />Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com1tag:blogger.com,1999:blog-3746807651848493410.post-64067705658421234972012-11-12T14:16:00.002+00:002012-11-12T14:16:53.391+00:00Repositories, Theses and Graduation CeremoniesI was attending my son's graduation ceremony at Bournemouth University last week. While waiting for his turn, the title of a graduating student's PhD thesis was read out. It caught my attention (it was about TV production on Dr Who) and so I slipped out my iPhone, googled the student's surname, a word from the title and the name of the university and found the thesis available in the Bournemouth Institutional Repository (first result). I was able to download and start skimreading the PDF before the student had returned to his seat .<br />
<br />
It's difficult to express what a genuinely exciting experience this was - it felt like I had arrived in the future!
This is a repository use case that I had never thought of, and everything just worked.<br />
<br />
Congratulations to Bournemouth's repository team on the hard work they have put in to making the experience join up.
Also, congrats to <a href="http://eprints.bournemouth.ac.uk/20444/" target="_blank">Andrew Ireland on a really interesting thesis!</a><br />
<br />
PS Universities really should consider letting graduation audiences see some of the really impressive work that their students have done. Perhaps an onstage projection of a poster from their final dissertation while they walk across the stage? Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com0tag:blogger.com,1999:blog-3746807651848493410.post-52607769998806859152012-07-20T09:03:00.002+01:002012-07-22T11:05:57.841+01:00Changing Lightbulbs<div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;">
Some more reflections on the road(s) to Open Access...</div>
<div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;">
<br /></div>
<div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;">
Q: How many publishers does it take to change a lightbulb?</div>
A: The lightbulb doesn't need changing because everyone has bought torches.<br />
<br />
<div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;">
Q: How many funders does it take to change a lightbulb?</div>
A: One to run a community lightbulb changing programme, and another to bulk purchase torches.<br />
<br />
<div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;">
Q: How many librarians does it take to change a lightbulb?</div>
<div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;">
A: About 0.25FTE, but the lightbulb has to have a CC-BY license.</div>Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com0tag:blogger.com,1999:blog-3746807651848493410.post-34099883911684223402012-07-19T11:01:00.004+01:002012-07-19T11:01:58.864+01:00Open Access Joke. Spoiler: not funny at all<div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;">
Q: How many Finch committee members does it take to change a lightbulb?</div>
<div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;">
A: The lightbulb doesn't need to be changed, it just needs a large injection of public funds to transition it to a more illuminating condition.</div>
<br />
One of the Finch committee members <a href="http://www.timeshighereducation.co.uk/story.asp?storycode=420628" target="_blank">has gone public on the tricky balancing act </a>that the committee tried to maintain. In his words "Green was unacceptable to funders unless learned societies and publishers were willing to allow it". In my words, the committee was structured so that publishers' interests trumped all other considerations.<br />
<br />
<br />Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com0tag:blogger.com,1999:blog-3746807651848493410.post-64109357642424875212012-07-18T08:28:00.001+01:002012-07-18T08:37:30.942+01:00Gold Finch and Green Open AccessThe UK's Finch Recommendations on Open Access, much of which look suspiciously like a blank cheque that the research sector has to write to one of its support industries, has stirred a lot of debate. Still, the government has supported it, and RCUK has been careful to publicly support it even while <a href="http://openaccess.eprints.org/index.php?/archives/913-RCUK-EC-Didnt-Follow-FinchWillets,-They-Rejected-it,-Promptly-and-Prominently.html" target="_blank">ensuring that it doesn't interfere too much with its current policy of open access mandates</a>. But while I'm frustrated at the Finch recommendations and relieved that they haven't stopped the funding councils support for the UK's rich open access repositories infrastructure, I do think there might be some positive outcomes for OA.<br />
<br />
Let's not lose sight of the fact that the Open Access proposition is very simple, but quite radical:<br />
<ul>
<li>Universities are disruptive communities - they create new knowledge and transfer it to society through teaching, training and all kinds of impact mechanisms.</li>
<li>The Web is a disruptive technology - it drastically reduces the difficulty of sharing knowledge between multiple parties, across the world.</li>
<div>
</div>
<li>Open Access is a disruptive idea - it rebuilds universities' research communications on the Web's more efficient communications platform.</li>
</ul>
<div>
The context in which Open Access operates is less simple. Scholarly communication is a complex network of stakeholders whose principle output is "The Scientific Literature"and whose major outcome is "The Progression of Scientific and Scholarly Knowledge". But each stakeholder participant in this network is driven by other outputs and outcomes: individual researchers have careers to develop and families to feed; universities have reputation to develop and sustainability to ensure; publishing companies have profits to increase and shareholders to benefit; research funders have governments to impress; governments have lobbyists and voters to satisfy and industries to benefit. The meshing of these diverse motivations into a stable network of 'players' that produce such a lasting and valuable resource is tribute to the decades of investment into the bigger picture of scientific progress by all parties. The astonishing thing about scientific publishing is not that it has been done well, but that it has been done at all.</div>
<div>
<br /></div>
<div>
The Open Access idea is particularly welcomed by those who see the stresses in the network threatening its viability or choking its productivity. On the other hand, where Open Access practice is actually adopted, it is by those researchers who see it as an effective route to getting their job done regardless of the "complex network of stakeholders". In other words, open access flourishes in disruptive communities who adopt new practices to improve their own capabilities, regardless of the consequences. Disruptive technologies aren't disruptive just because they exist, but because they are adopted, used and gradually mainstreamed. The network works around this disruption - new players emerge, new practices are fashioned, new relationships are formed, new contracts are negotiated - and an improved network results that is better fit to the current conditions.</div>
<div>
<br /></div>
<div>
<div>
<div>
<div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;">
Willett's strong words directed to publishers at the recent Publishers' Association indicate that the government really has adopted the Open Access ideal and is not taking many prisoners along the way:</div>
</div>
</div>
<blockquote class="tr_bq">
<div style="margin-bottom: 0px; margin-left: 0px; margin-right: 0px; margin-top: 0px;">
<i>Provided we all recognise that open access is on its way, we can then work together to ensure that the valuable functions you carry out continue to be properly funded</i> </div>
</blockquote>
<div>
</div>
The role of the Finch recommendations is to coerce the current research publishing players into accepting that Open Access is a reality that they must adopt by offering them a lifeline that allows them a chance of transitioning to the realities of a new Open Access publishing network.<br />
<br />
Many of us think that this is pointless because we believe that the new network needs leaner, more efficient participants rather than the same old players. But the effect of the Finch lifeline may be a radical restructuring of the network, as Chris Keene (EPrints repository manager at Sussex) has pointed out in discussions on the UKCoRR mailing list. Payment of the APC (article processing charge) changes the relationship between publishers and researchers.<br />
<br /></div>
<div>
<div>
So although Finch's proposal may seem retrograde, superfluous and overly generous to the publishing industry, it does lead publishers by the nose to a much more exposed position. Now they have to deal with every author of every research paper and justify their costs on a much greater scale. Previously cost negotiations have been handled once per year per institution, and then with the library as an intermediary. Now they have to deal with angry and cash-strapped researchers on a daily basis - those that lived by the market will probably die by the market in a thousand hand-to-hand combats.</div>
</div>
<div>
<br /></div>
<div>
In the meantime, quite unlauded by Dame Finch, the UK has a robust infrastructure that actually delivers Open Access through an excellent network of institutional repositories together with training and advocacy programmes from each University library, all underpinned by a decade of technology R&D, policy development and professional practice funded by JISC. Finch doesn't predict a smooth transition to publisher-led Open Access, and the research community's response seems to back her predictions up. But the RCUK response shows what the UK is actually really good at - pragmatism - and likely means an increased role for repositories and the emergence of a more balanced and thoroughly hybrid environment as the network of stakeholders all seek to come to a new equilibrium.<br />
<br />
<br />
<br /></div>Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com1tag:blogger.com,1999:blog-3746807651848493410.post-26919417068720415902012-04-03T15:37:00.000+01:002012-04-03T15:37:07.756+01:00Soton Labs: Embedded Repository ExperimentationWe are just in the second stage of the transition for the ECS repository - all the data has been copied across to the main Southampton Institutional Repository, all the ECS repository URLs now redirect there as well, and we are in the middle of data reconciliation and de-duplication. This is very exciting, because the university finally has a single OA research service, with all stakeholders pulling in the same direction and providing a unified view of the university's research output for business, research, education and administration purposes. Huge thanks to Wendy White, Simon de Montfalcon and the rest of the library team, as well as Tim Miles-Board, Tim Brody and the rest of the EPrints Services team for making the whole venture run so smoothly!<br />
<br />
Even more exciting for us is the fact that we now about to set up a new programme of repository activity called "Soton Labs". Inspired by the idea of Google Labs, it is an institutional space for experimentation and innovation around research information systems, and EPrints will form its backbone. Driven by the needs of the research staff, it will be informed by a whole range experience and ideas (many gathered from research council and JISC projects) that can be offered to staff on the famous "permanent beta" experimental basis until they are ripe for integration into the main (business critical) repository. Unlike the ECS repository which was focused on a single department's needs, Soton Labs will have a broader brief, to deliver cutting edge services and to facilitate new improved practice for early adopters throughout the whole institution.<br />
<br />
I've got a shortlist of tasks that we hope to address in the coming months:<br />
<br />
<ul>
<li>live collection of research data</li>
<li>simple metadata schemas for research data archiving</li>
<li>collections of documentation around research proposals (bids, reviews, responses)</li>
<li>research projects</li>
<li>linked data.</li>
</ul>
<br />
So you can see that rather than reducing the repository activity in Southampton by halving the number of installations, we're stepping up the pace of repository development.Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com0tag:blogger.com,1999:blog-3746807651848493410.post-37518733165892866682012-03-13T18:56:00.002+00:002012-03-13T18:57:47.170+00:00Lunch Talking at SPARC 2012In the lunch break at SPARC 2012 today our table was discussing the negotiation of author rights for repository deposits. In lamenting how authors tend to be backed into a corner by the publisher's last-minute demands to sign the copyright transfer form (or else forfeit their publication opportunity), a delicious and subversive idea arose. I present it for you here, without any claim of endorsement by SPARC or my lunchtime companions.<br />
<div>
<br /></div>
<div>
<blockquote class="tr_bq">
<i>PLOS NULL: the high profile, high impact journal that publishes articles that have been peer reviewed, accepted and corrected for publication by third party journals whose lawyers have then refused to agree the author's pro-repository copyright transfer amendment.</i></blockquote>
</div>Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com2tag:blogger.com,1999:blog-3746807651848493410.post-11440301480439152182012-03-12T14:01:00.004+00:002012-03-12T14:02:31.919+00:00Value Transactions and The Publishing Business ModelI'm at the <a href="http://www.arl.org/sparc/meetings/oa12/" target="_blank">SPARC2012 Open Access</a> conference, and all this talk about Open Access is reminding me that the issue of scholarly publishing is actually very straightforward.<br />
<br />
Publishing companies have a very simple business model - they take authors' articles, add value and charge for that value. You can see this process illustrated in the diagram below, with the various stages in publishing an article broken out between the different parties, and each transaction explicitly labelled with its typical financial charges and legal agreements.<br />
<br />
<div class="separator" style="clear: both; text-align: center;">
<a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiPjvXJGT7o8_YFSNayoHEO7MuSk8oywM4jYi-A1_NlgGoMZl1bNUlQ9UGNcIQuTj9O9xbL91JmzEizDqVbozke8A0G-wVSrmVuNyU183avl2oiMyUO3F-Ddn9n-IzOxuOVeUIBtklADWM/s1600/publishing.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"><img border="0" height="171" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiPjvXJGT7o8_YFSNayoHEO7MuSk8oywM4jYi-A1_NlgGoMZl1bNUlQ9UGNcIQuTj9O9xbL91JmzEizDqVbozke8A0G-wVSrmVuNyU183avl2oiMyUO3F-Ddn9n-IzOxuOVeUIBtklADWM/s400/publishing.png" width="400" /></a></div>
<br />
A decade on from the original Budapest Open Access Initiative and here we are in Kansas City just about to start discussing more of the nuances and implications of this obvious publishing model.<br />
<span id="goog_647134530"></span><span id="goog_647134531"></span>Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com1tag:blogger.com,1999:blog-3746807651848493410.post-34672756901254609832012-01-05T12:08:00.002+00:002012-01-05T12:13:30.520+00:00Mendeley Open Access UpdateIn the last six months since I analysed <a href="http://repositoryman.blogspot.com/2011/06/mendeley-download-vs-upload-growth.html">Mendeley's contribution to Computer Science OA in June 2011</a>, they appear to have increased their membership of that community by 37% and the ratio of full text documents to community members has increased from 0.66 to 0.71. The number of OA documents has increased by 47% to 11,757 and the number of OA active users (i.e. users who have made at least one document public through Mendeley's servers) has risen by 46% to 2,441 but still represents only 15% of the total membership of that community.<br />
<br />
Congratulations to Mendeley - their service is obviously rising in popularity and hence in significance to the community. OA analysts will note that the increase in open access documents comes from increased membership, rather than a change in behaviour of the community.<br />
<br />
<br />
<br />Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com3tag:blogger.com,1999:blog-3746807651848493410.post-81142833838640277552011-10-26T02:21:00.001+01:002012-01-04T12:45:20.034+00:00Rethinking the Open Access AgendaI used to be a perfectly good computer scientist, but now I've been ruined by sociologists. Or at least that is what Professor Catherine Pope (the Marxist feminist health scientist who co-directs the <a href="http://dtc.webscience.ecs.soton.ac.uk/">Web Science Doctoral Training Centre</a> with me) says. I am now as likely to quote Bruno Latour as Donald Knuth, and when I examine "the web" instead of a linked graph of HTML nodes I increasingly see a complex network of human activity loosely synchronised by a common need for HTTP interactions.<br />
<br />
All of which serves as a kind of explanation of why I have come to think that we need to revisit the Budapest Open Access Initiative's obsession with information technology:<br />
<blockquote>
An old tradition and <b>a new technology </b>have converged to make possible an unprecedented public good. The old tradition is the willingness of scientists and scholars to publish the fruits of their research in scholarly journals without payment, for the sake of inquiry and knowledge. <b>The new technology is the internet.</b> The public good they make possible is the world-wide electronic distribution of the peer-reviewed journal literature and <b><i>completely free and unrestricted access</i></b> to it by all scientists, scholars, teachers, students, and other curious minds. <b><i>Removing access barriers</i></b><b><i> to this literature</i></b> will accelerate research, enrich education, share the learning of the rich with the poor and the poor with the rich, make this literature as useful as it can be, and lay the foundation for uniting humanity in a common intellectual conversation and quest for knowledge. <i>(see <a href="http://www.soros.org/openaccess/read">http://www.soros.org/openaccess/read</a>)</i></blockquote>
BOAI promises that the "new technology" of the Internet (actually the Web) will transform our relationship to knowledge. But that was also one of the promises of the electric telegraph a century ago<br />
<blockquote>
From the telegraph's earliest days, accounts of it had predicted "great social benefits": diffused knowledge, collective amity, even the prevention of crimes. (<i>Telegraphic realism: Victorian fiction and other information systems</i> by Richard Menke.)</blockquote>
There has been much good and effective work to support OA from both technical and policy perspectives - Southampton's part includes the development of the <a href="http://www.eprints.org/">EPrints repository platform</a> as well as the <a href="http://roar.eprints.org/">ROAR OA monitoring service</a> - but critics still point to a disappointing amount of fruit from our efforts. Repositories multiply and green open access (self-deposited) material increases; knowledge about (and support for) OA has spread through academic management, funders and politicians, but it has not yet become a mainstream activity of researchers themselves. And now, a decade into the Open Access agenda, we are grasping the opportunity to replay all our missteps and mistakes in the pursuit of Open Data.<br />
<br />
I am beginning to wonder whether by defining open access as a phenomenon of scholarly communication, we mistakenly created from the outset an alien and unimportant concept for the scientists and scholars who long ago outsourced the publication process to a support industry. As a consequence, OA has been best understood by (or most discussed by) the practitioners of scholarly and scientific communication - librarians and publishers - rather than by the practitioners of scholarship and science.<br />
<br />
We have seen that the challenge of the Web can't be neatly limited to dissemination practices. In calling for researchers open the outputs of their research, we inevitably argue with researchers to reconsider the relationship that they have with their own work, their immediate colleagues, their academic communities, their institutions, funders and their public. It turns out that we haven't been able to divorce the output of research from the conduct and the context of research activity. Let's move on from there.<br />
<br />
In a recent paper <a href="http://www.jcheminf.com/content/3/1/36">Openness as infrastructure</a>, John Wilbanks discussed the three missing components of an open infrastructure for science: the infrastructure to collaborate scientifically and produce data, the technical infrastructure to classify data and the legal infrastructure to share data - extending the technical infrastructure with a legal framework. I think that we need to go further and refocus our efforts and our rhetoric about "Open Access to Scientific Information" towards "Open Activity by Scientists" supported by three kinds of infrastructure:<br />
<ol>
<li>Human Engagement</li>
<li>Methodological Analysis and</li>
<li>Social Trust.</li>
</ol>
The aim of open access to scientific outputs and outcomes will not occur until scientific practitioners see the benefit of the scientific commons, not as an anonymous dumping ground for information that can be accessed by all and sundry, but as a field of engagement that offers richer possibilities for their research and their professional activities. To realise that, scientists need more than email and Skype to work together, more than Google to aggregate their efforts and more than a copyright disclaimer to negotiate and mediate the trust relationships that make the <i>openness</i> that OA promises a safe and attractive, and hence realistic, proposition.<br />
<br />
What I'm saying isn't new - there has been lots of effort and discussion about improving the benefits of repository technology to the end user/researcher, and about lowering the barriers of use. <a href="http://www.jisc.ac.uk/whatwedo/programmes/inf11/jiscdepo.aspx">JISC have funded a number of projects in its Deposit programme</a>, trying various strategies to increase user engagement with OA. As well as continuing to pursue this approach, we also need to step back from obsessing about the technology of information delivery, think bigger thoughts about scientific people and scientific practice and tell a bigger and more relevant story.Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com4tag:blogger.com,1999:blog-3746807651848493410.post-65732589742775887352011-10-09T15:05:00.003+01:002011-10-09T15:28:33.359+01:00Using EPrints Repositories to Collect Twitter DataA number of our Web Science students are doing work analysing people's use of Twitter, and the tools available for them to do so are rather limited since Twitter changed the terms of their service so that the functionality of TwapperKeeper and similar sites has been reduced. There are personal tools like NodeXL (a plugin for Microsoft Excel running under Windows) that do provide simple data capture from social networks, but a study will require long-term data collection over many months that is independent of reboots and power outages.<br />
<br />
They say that to a man with a hammer, the solution to every problem looks like a nail. And so perhaps it its unsurprising that I see a role for EPrints in helping students and researchers to gather, as well as curate and preserve, their research data. Especially when the data gathering requires a managed, long-term process that results in a large dataset.<br />
<br />
<table cellpadding="0" cellspacing="0" class="tr-caption-container" style="float: left; text-align: right;"><tbody>
<tr><td style="text-align: center;"><a href="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi6a0KHt2nTNEoE86Fzrx1_p3j4GsG9O7Whv08qWEe68HoJHV4SId6S8NfB94cnmBLz_p4-4Q0im4mL8FSg_13Ni67QtZhH7y_LDfMdYP9HDZ1kPU7NqWeat371F6GHkua-y67IBTOR9Is/s1600/eprintstweetstream.png" imageanchor="1" style="margin-left: auto; margin-right: auto;"><img border="0" height="320" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEi6a0KHt2nTNEoE86Fzrx1_p3j4GsG9O7Whv08qWEe68HoJHV4SId6S8NfB94cnmBLz_p4-4Q0im4mL8FSg_13Ni67QtZhH7y_LDfMdYP9HDZ1kPU7NqWeat371F6GHkua-y67IBTOR9Is/s320/eprintstweetstream.png" width="155" /></a></td></tr>
<tr><td class="tr-caption" style="text-align: center;"><b>EPrints Twitter Dataset,<br />
Rendered in HTML</b></td></tr>
</tbody></table>In collecting large, ephemeral data sets (tweets, Facebook updates, Youtube uploads, Flickr photos, postings on email forums, comments on web pages) a repository has a choice between:<br />
<br />
(1) simply collecting the raw data, uninterpreted and requiring the user to analyse the material with their own programs in their own environments<br />
<br />
(2) partially interpreting the results and providing some added value for the user by offering intelligent searches, analyses and visualisations to help the researchers get a feel for the data.<br />
<br />
We experimented with both approaches. The first sounds simple and more appropriate (don't make the repository get in the way!), but in the end the job of handling, storing and providing a usable interface to the collection of temporal data means that some interpretation of the data is inevitable.<br />
<br />
So instead of just constantly appending a stream of structured data objects (tweets, emails, whatever) to an external storage object (a file, database or cloud bucket) we ingest each object into an internal eprints dataset with appropriate schema. There is a tweet dataset for individual tweets, and a timeline data set for collections of tweets - in theory multiple timeline datasets will refer to the same objects in the tweet dataset. These datasets can be manipulated by the normal EPrints API and managed by the normal EPrints repository tools: you can search, export and render tweets in the same way that you can for eprints, documents, projects and users.<br />
<br />
EPrints collects Twitter data by regular calls to the Twitter API, using the search parameters given by the user. The figure on the left shows the results of a data collection (on the hashtag "drwho") resulting in a single twitter timeline that is rendered as HTML for the Manage Records page. In this rendering, the timeline of tweets is shown as normal on the left of the window, with lists of top tweeters, top mentions, top hashtags and top links together with a histogram of tweet frequency on the right. These simple additions serve to give an overview of the data to the researcher - not to try to take the place of their bespoke data analysis software, but simply to help understand some of the major features of the data <i>as it is being collected</i>. The data can be exported in various formats (JSON, XML, HTML and CSV) for subsequent processing and analysis. The results of this analysis can themselves be ingested into EPrints for preservation and dissemination, along with the eventual research papers that describe the activity.<br />
<br />
All this functionality will soon be released as an EPrints Bazaar package; as of the time of writing we are about to release it for testing by our graduate students. The infrastructure that we have created will then be adapted for other Web temporal data capture sources as mentioned above (Flickr, YouTube, etc).<br />
<div class="separator" style="clear: both; text-align: center;"></div>Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com1tag:blogger.com,1999:blog-3746807651848493410.post-278829672750624592011-06-26T17:44:00.001+01:002011-06-26T17:44:44.927+01:00Mendeley: Measuring OA ratesHaving talked about Mendeley's OA deposit rates <a href="http://repositoryman.blogspot.com/2011/06/mendeley-download-vs-upload-growth.html">in my last blog post</a>, I thought it worthwhile to check how representative my chosen discipline (Computer Science) was. Rather than download the entire community for each other discipline, I have performed a quick and dirty sample of some of the available literature in each discipline using the search function. Each Mendeley search result offers the option of saving the PDF (if available) to your library, so it is a simple matter to <span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;">wget</span> some search results and <span class="Apple-style-span" style="font-family: 'Courier New', Courier, monospace;">grep</span> for PDFs.<br />
<br />
The table below shows the results of this procedure for 11 disciplines (two illustrative keywords each). The "available PDFs" column records the number of PDFs <i>offered on the first page of the search results </i>(each page contains 200 results); the total number of results shows the relative coverage of the topic in Mendeley.<br />
<br />
Computer Science appears to be in the 5-10% range of OA (18 or 11 PDFs out of a page of 200 results) which does seem to be just about average. Social Science, Medicine, Health Science, Economics and the Humanities appear to have fewer PDFs and Maths and Physics appear to have rather more.<br />
<br />
<table border="0" cellpadding="0" cellspacing="0" class="MsoNormalTable" style="border-collapse: collapse; margin-left: 4.65pt; width: 320px;"><tbody>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><b><span style="color: black; font-family: Calibri;">Search term<o:p></o:p></span></b></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><b><span style="color: black; font-family: Calibri;">Discipline<o:p></o:p></span></b></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><b><span style="color: black; font-family: Calibri;">Available PDFs<o:p></o:p></span></b></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><b><span style="color: black; font-family: Calibri;">Total Results<o:p></o:p></span></b></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">chromatography</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="SpellE"><span style="color: black; font-family: Calibri;">Chem</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">10<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">14260<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">crystallography</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="SpellE"><span style="color: black; font-family: Calibri;">Chem</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">27<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">4921<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span style="color: black; font-family: Calibri;">JAVA<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span style="color: black; font-family: Calibri;">CS<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">18<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">848<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">software</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span style="color: black; font-family: Calibri;">CS<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">11<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">15185<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">geology</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span style="color: black; font-family: Calibri;">Earth<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">36<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">4180<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">hydrodynamic</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span style="color: black; font-family: Calibri;">Earth<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">40<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">2853<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">econometrics</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="SpellE"><span style="color: black; font-family: Calibri;">Economics</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">13<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">565<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">microeconomics</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="SpellE"><span style="color: black; font-family: Calibri;">Economics</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">5<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">88<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">biodiversity</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="SpellE"><span style="color: black; font-family: Calibri;">Env</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">14<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">4668<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">climate</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="SpellE"><span style="color: black; font-family: Calibri;">Env</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">14<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">13003<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">nursing</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span style="color: black; font-family: Calibri;">Health<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">6<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">10723<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">palliative</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span style="color: black; font-family: Calibri;">Health<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">6<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">1978<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">archaeology</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span style="color: black; font-family: Calibri;">Hum<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">6<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">1730<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span style="color: black; font-family: Calibri;">Foucault<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span style="color: black; font-family: Calibri;">Hum<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">11<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">248<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">algebra</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span style="color: black; font-family: Calibri;">Math<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">101<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">4424<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="SpellE"><span class="GramE"><span style="color: black; font-family: Calibri;">cohomology</span></span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span style="color: black; font-family: Calibri;">Math<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">171<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">525<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">cancer</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span style="color: black; font-family: Calibri;">Med<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">11<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">52315<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">pharmacology</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span style="color: black; font-family: Calibri;">Med<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">4<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">62285<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">quasar</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="SpellE"><span style="color: black; font-family: Calibri;">Phys</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">127<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">556<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">telescope</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="SpellE"><span style="color: black; font-family: Calibri;">Phys</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">101<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">2347<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">cognition</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="SpellE"><span style="color: black; font-family: Calibri;">Psy</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">11<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">18805<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">schizophrenia</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="SpellE"><span style="color: black; font-family: Calibri;">Psy</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">17<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">4055<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">criminology</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="SpellE"><span style="color: black; font-family: Calibri;">SocSci</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">2<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">154<o:p></o:p></span></div></td></tr>
<tr style="height: 15pt;"><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="GramE"><span style="color: black; font-family: Calibri;">sociology</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 95pt;" valign="bottom" width="95"><div class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm;"><span class="SpellE"><span style="color: black; font-family: Calibri;">SocSci</span></span><span style="color: black; font-family: Calibri;"><o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">2<o:p></o:p></span></div></td><td nowrap="" style="height: 15pt; padding-bottom: 0cm; padding-left: 5.4pt; padding-right: 5.4pt; padding-top: 0cm; width: 65pt;" valign="bottom" width="65"><div align="right" class="MsoNormal" style="font-family: Cambria; font-size: 12pt; margin-bottom: 0.0001pt; margin-left: 0cm; margin-right: 0cm; margin-top: 0cm; text-align: right;"><span style="color: black; font-family: Calibri;">2005</span></div></td></tr>
</tbody></table>Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com2tag:blogger.com,1999:blog-3746807651848493410.post-79576233242865416152011-06-26T13:31:00.000+01:002012-01-04T12:09:29.695+00:00Mendeley: Download vs Upload GrowthThere was a lot of talk about Mendeley at <a href="http://indico.cern.ch/event/oai7">OAI7 in Geneva</a>, especially the news that in the first quarter of 2011 the number of articles downloaded for free jumped from 300,000 to 800,000. That's really good news, confirming Mendeley as a successful service in the Open Access domain. Having done an analysis of Mendeley's impact on Open Access (see <a href="http://repositoryman.blogspot.com/2010/08/comparing-social-sharing-of.html">Comparing Social Sharing of Bibliographic Information with Institutional Repositories</a>) just under a year ago, I thought I'd repeat the analysis to see the extent of the impact of their growth on deposits as well as downloads.<br />
<br />
Results: the number of members of the Computer Science discipline appears to be 2.2x larger than last August (increased to 74736 from 34230.) Of these, only 12102 appear in the Computer Science directory listing, whose contents are now filtered by Mendeley according to their "profile completion"; the gross number was kindly provided for me by Steve Dennis at Mendeley. This filtering takes care of the long tail of accounts that have never been used. Of the filtered users, 1676 are "OA active", having publicly shared at least one PDF document (up 21% on last August). The total number of PDFs shared by this group is 8014, up 16% on last August with 4.8 PDFs being shared per "active OA user" (down from 5.0 last August).<br />
<br />
So a big increase in user numbers results in a small increase in publicly shared PDFs, confirming (I think) that Mendeley are not preaching to the choir, and are mainly attracting users who are not already "OA active". Users of Mendeley have clearly transitioned from "scholarly knowledge collectors" to "scholarly knowledge sharers". The challenge still remains how to change their behaviour from "scholarly asset maintainers" to "scholarly asset sharers".Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com0tag:blogger.com,1999:blog-3746807651848493410.post-10977682331582555232011-04-27T14:00:00.000+01:002011-04-27T14:00:28.859+01:00Experimenting With Repository UI DesignI'm always on the lookout for engaging UI paradigms to inspire repository design, and I recently noticed that Blogger has made some new "<a href="http://www.google.com/support/blogger/bin/answer.py?hl=en&answer=1229061">dynamic views</a>" available. It provides a variety of smart presentation styles aren't a million miles away from the ones emerging on smartphone apps, combining highly visual and animated layouts.<br />
<br />
So I've imported some repository contents into Blogger to get some hands on experience, and I'd be interested in any feedback on whether this looks useful or compelling.<br />
<br />
<ul><li>The new blog is called <a href="http://mikeolection.blogspot.com/">Mike O'Lection</a> - it's a little DSpace repository joke. .</li>
<li>New views</li>
<ul><li>Sidebar: <a href="http://mikeolection.blogspot.com/view/sidebar">http://mikeolection.blogspot.com/view/sidebar</a></li>
<li>Timeslide: <a href="http://mikeolection.blogspot.com/view/timeslide">http://mikeolection.blogspot.com/view/timeslide</a></li>
<li>Mosaic: <a href="http://mikeolection.blogspot.com/view/mosaic">http://mikeolection.blogspot.com/view/mosaic</a> (very Tumblr)</li>
<li>Snapshot: <a href="http://mikeolection.blogspot.com/view/snapshot">http://mikeolection.blogspot.com/view/snapshot</a></li>
<li>Flipcard: <a href="http://mikeolection.blogspot.com/view/flipcard">http://mikeolection.blogspot.com/view/flipcard</a></li>
</ul><li>Original repository pages: <a href="http://eprints.ecs.soton.ac.uk/17386/">http://eprints.ecs.soton.ac.uk/17386/</a>, <a href="http://eprints.ecs.soton.ac.uk/21289/">http://eprints.ecs.soton.ac.uk/21289/</a>, <a href="http://eprints.ecs.soton.ac.uk/21622/">http://eprints.ecs.soton.ac.uk/21622/</a>, <a href="http://eprints.ecs.soton.ac.uk/21030/">http://eprints.ecs.soton.ac.uk/21030/</a></li>
</ul><br />
These views suit various different types of material, but the constant theme that is emerging is that a good visual is pretty much <i>de rigeur</i> for any resource. This means that relying on the thumbnail image of an article's first page is not going to be a good strategy (hint: they all look the same.) I can forsee the need to extract figures and artwork from the PDFs and Office Documents uploaded to a repository.<br />
<br />
(Over the next few days I hope to put some more examples on the blog to help get a better feel for how this will work. But I think I might make a bulk Blogger exporter for EPrints because manual cut and pasting is only enjoyable for a few minutes!)Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com1tag:blogger.com,1999:blog-3746807651848493410.post-23773374912990613182011-04-26T11:13:00.004+01:002011-04-27T13:25:40.939+01:00Mobile Use of RepositoriesWhile looking at the impact of mobile devices on the development of the Web I found useful information in this March 2011 press release from web analytics company StatCounter, <a href="http://gs.statcounter.com/press/android-overtakes-blackberry-for-first-time">charting the rise of Android</a>.<br />
<blockquote>StatCounter data also pinpoints the rise and rise of mobile devices to access the Internet. The use of mobile to access the Internet compared to desktop has more than doubled worldwide from 1.72% a year ago to 4.45% today. The same trend is evident in the US with mobile Internet usage more than doubling over the past year from 2.59% to 6.32%.</blockquote>I thought I'd see whether this behavior applies equally to repositories and so I had a poke around in the usage states for eprints.ecs.soton.ac.uk and this is what I found:<br />
<ul><li>53,285 PDF downloads from 27 March 2011 (4am) - 3rd Apr 2011 (4am).</li>
<li>Of these 33,304 are attributed to crawlers and 19,981 to real browsers.</li>
<li>Only 0.93% of the browser downloads occur on mobile devices (70% iOS, 22% Android, 7% Blackberry and 1% Symbian)</li>
</ul>The use of mobiles that we are seeing for accessing research outputs in repositories is less than 1/4 of the general use of mobile Internet. An obvious reason for that is the unpalatable mixture of PDF pages and small devices, but popular applications like Mekentoshj's Papers and Mendeley for iPhone seem to indicate that an attractive mobile experience should be possible.<br />
That implies that there's another exciting opportunity for repository developers to up their game!Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com1tag:blogger.com,1999:blog-3746807651848493410.post-34682206244413043532011-04-14T01:47:00.003+01:002011-04-14T02:07:05.992+01:00Faculty of 1000 Posters - Still Looking for a Silver BulletThe <a href="http://posters.f1000.com/">F1000 Open Access Poster Repository</a> was brought to my attention by a recent Tweet. I love repositories with posters in - they're copyright-lite and very visually attractive - and I've <a href="http://repositoryman.blogspot.com/2007/08/creative-uses-of-repository.html">long advocated for more use to be made of these kinds of scholarly communication</a>. With some success, I have pushed hard for the poster artwork to be made available online in all the conferences I have been involved in organising.<div><br /></div><div>The Faculty of 1000 has a special relationship with some Biomedical conferences, inviting authors to upload their posters to the open access F1000 site. Perhaps this is an effective new way of gaining open access to specific kinds of early-report research material?</div><div><br /></div><div>The F1000 posters site contains 909 posters. 649 of those are derived from 28 invited conferences (an admirable average of 23 posters per conference), and the remaining 260 posters are uploaded on an <i>ad hoc</i> basis from authors attending 148 other conferences (an average of 1.7 posters per conference).</div><div><br /></div><div>While it is clear that the invitation approach is much more effective than the <i>laissez faire</i> approach, the huge size of biomedical conferences (often displaying several thousand posters over the course of four days) means that the overall success rate of this OA strategy is only 4.2% (a figure I reached by counting the total number of posters at a sample of 7 of the 28 invited conferences).</div><div><br /></div><div>So, still no silver OA bullet!</div><div><br /></div><div><br /></div>Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com1tag:blogger.com,1999:blog-3746807651848493410.post-75154989877299141272011-03-21T15:40:00.004+00:002011-03-21T18:24:29.563+00:00I Won't Review Green OA, It's Spam - I DO NOT LIKE IT Sam-I-Am<a href="http://www.timeshighereducation.co.uk/story.asp?sectioncode=26&storycode=415480">According to the Times Higher</a>, Michael Mabe (chief executive of the International Association of Scientific, Medical and Technical Publishers and a visiting professor in information science at University College London) fears that repositories are essentially "electronic buckets" with no quality control. He also expressed doubts that the academy would be able to successfully introduce peer review to such repositories, partly because it would be difficult to attract reviewers who had no "brand allegiance" to the repositories.<div><br /></div><div>Let's think about this....</div><div><br /></div><div>Q: Who are the authors of papers?</div><div>A: Researchers.</div><div><br /></div><div>Q: Who put papers in repositories?</div><div>A: The authors.</div><div><br /></div><div>Q: Who review papers?</div><div>A: The authors of other papers.</div><div><br /></div><div>Q: Where do they get papers to review?</div><div>A: From a URL provided by the journal editorial board.</div><div><br /></div><div>Q: Who are the editorial board?</div><div>A: Authors of other papers.</div><div><br /></div><div>Q: Just remind me what the publishers do?</div><div>A: Their most important job is to organise the processes that get the peer review accomplished by the other authors (see above).</div><div><br /></div><div>Q: Where does the brand value of a journal come from?</div><div>A: It's a bit complicated, but mainly from the prestige of the authors on the editorial board and the prestige of the papers that the authors write. There is a default brand that comes from the publishing company that owns the journal, but of course <i>that</i> comes recursively from the brand value of all the journals that it owns. </div><div><br /></div><div>Q: "Electronic buckets" don't sound very valuable, do they?</div><div>A: No they certainly don't - I mean, imagine the kind of material that normally ends up in a bucket! Who would want to peer-review that? But hang on - who stores stuff in buckets anyway? That's a bit of a problematic metaphor for a storage system! Try replacing "buckets" with "library shelves" and the statement becomes more accurate. What kind of material do you find on library shelves? Things that people might want to read. Things that people might want to review.</div><div><br /></div><div>Q: But how would authors know what to review in a repository without the publishing company's branding?</div><div>A: I suppose an editorial board would send them a URL.</div><div><br /></div>Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com3tag:blogger.com,1999:blog-3746807651848493410.post-1444027726030671322011-03-11T18:04:00.005+00:002011-03-11T18:15:02.240+00:00You Can't Trust Everything You Read on the WebHouston, we have a problem. It turns out that trusting repositories as authoritative sources of research information is all very well and good, except when the repository is an authoritative source of demonstration (fake) documents. Sebastien Francois (one of the EPrints team at Southampton) has just reported that Google Scholar is indexing the fake documents that we make available in demoprints.eprints.org.<div><br /></div><div>So when your weaker students start citing</div><div><span class="Apple-style-span" style=" -webkit-border-horizontal-spacing: 2px; -webkit-border-vertical-spacing: 2px; font-family:Arial, sans-serif;"><span class="person_name"><span class="Apple-style-span" style="font-size:small;"></span></span></span></div><blockquote><div><span class="Apple-style-span" style=" -webkit-border-horizontal-spacing: 2px; -webkit-border-vertical-spacing: 2px; font-family:Arial, sans-serif;"><span class="person_name"><span class="Apple-style-span" style="font-size:small;">Freiwald, W.</span></span><span class="Apple-style-span" style="font-size:small;"> and </span><span class="person_name"><span class="Apple-style-span" style="font-size:small;">Bonardi, X.</span></span><span class="Apple-style-span" style="font-size:small;"> and </span><span class="person_name"><span class="Apple-style-span" style="font-size:small;">Leir, X.</span></span><span class="Apple-style-span" style="font-size:small;"> (1998) </span><em><span class="Apple-style-span" style="font-size:small;">Hellbenders in the Wild.</span></em><span class="Apple-style-span" style="font-size:small;"> Better Farming, 1 (4). pp. 91-134.</span></span></div><div></div></blockquote><div><span class="Apple-style-span" style=" -webkit-border-horizontal-spacing: 2px; -webkit-border-vertical-spacing: 2px; font-family:Arial, sans-serif;"><span class="Apple-style-span" style="font-size:small;"></span></span>you know that it's just a teensy misunderstanding, OK? But if anyone needs their citation count artificially boosting, I have a repository available to monetize.</div>Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com0tag:blogger.com,1999:blog-3746807651848493410.post-9819164247498981592011-03-07T09:20:00.005+00:002011-03-07T09:45:14.367+00:00Google, Content Farms and RepositoriesIn recent news, <a href="http://www.telegraph.co.uk/technology/google/8347975/Google-changes-search-engine-to-favour-quality-content.html">Google has altered its ranking algorithms</a> to favour sites with original material rather than so-called content farms that simply redistribute material found on other sites. Although <a href="http://scobleizer.com/2011/02/26/thank-you-google/">users report satisfaction with improved results</a>, this action has caused quite a furore with some <a href="http://www.smartplanet.com/technology/blog/thinking-tech/quality-sites-also-falling-victim-to-new-googles-spam-killing-search-engine/6403/">genuine sites losing significant business</a> as well.<div><br /></div><div>I have been worried about how this would affect repositories, after all we technically fit into the definition of content farms: sites that exist to redistribute material that is published elsewhere. Bearing in mind that Google delivers the vast majority of our visitors to us, if the changes were to impact on our rankings, we might suffer quite badly. Now that there's been a couple of weeks for the changes to migrate around the planet, our usage stats point to business as usual.</div><div><br /></div><div>First of all, downloads over the last quarter - no dramatic tailoffs in the last week.</div><div><img style="cursor:pointer; cursor:hand;width: 400px; height: 240px;" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiSBn8MWwLmH2t7QRJDfkkGrzm7zFJ422BipaOX90hJnjCdzdiqWwhoi5VhOjGjaim2sTDBJsNPoXS1NcQ9vzcNsGPX3T7ThIcCG3jZClysjcfRxgqw-3WDMBMoWFGdfoVPl2SXcnTPbRk/s400/ecs_QbSyYuUyVsR974NWUGpkyg.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5581269927653298850" /></div><div><br />And a comparison with last year (apologies the different vertical scale) shows year-on-year stability.</div><div><img style="cursor:pointer; cursor:hand;width: 400px; height: 240px;" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEhGxmRcdZ_nmA5GQDniG_u959khAJ8T0X3pIBAno98D_J8q_3DwPsUKP5OYse0ZSjOsWV0GdAFaqDoo7dMysjU_FbXed9h1VcjnzzG84BXm0heUyWQ_HsZ6QGa0OtPHPFR2igDqJW-A46s/s400/ecs__wGpEDopeWTPAsif8_E_yw.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5581271324514395378" /></div><div><br /></div><div>So good news there: our repositories haven't been classed as valueless redistribution agents. That would have been a bit of a blow to our morale!</div><div><br /></div>Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com1tag:blogger.com,1999:blog-3746807651848493410.post-72623256776183264942011-03-06T08:12:00.012+00:002011-03-06T10:54:40.781+00:00The Missing Sixth Star of Open Linked Data?<div style="margin-left:1em; float: right; vertical-align:text-bottom"><img style="cursor:pointer; cursor:hand; height: 300px;" src="https://blogger.googleusercontent.com/img/b/R29vZ2xl/AVvXsEiRrSA89H9LRSLgZAIzPdfWsBOhaukhQjvvqG5HKVt5O5Ml6aARpm2ngFl_ihEbYauV6ChXjOW7XmzA4QavLejolAVALYrKDyxGz1yKtu8SyfJAzChUstEagb7RUN30W0fYkG5kKDZHl0c/s400/OpenWebStars.jpg" border="0" alt="" id="BLOGGER_PHOTO_ID_5580915251028553234" /></div><div>In my previous posting I proposed the idea of the <a href="http://repositoryman.blogspot.com/2011/03/five-stars-of-open-access-aka-linked.html">5 stars of open access</a>. There is of course one feature that the original "taxonomy" misses out completely - repositories! Not just "my favourite repository platform", but the idea of persistent, curated storage. Consequently, my proposal for open access doesn't mention repositories - a bit of an oversight!</div><div><br /></div><div>At the moment, the entry level to the 5 stars is simply "put it on the web, with an open license". Perhaps we should change this to "put it in a repository with an open license"; perhaps we could designate a "zeroth star" for "just put it on the Web". However, the Linked Data Research Lab at DERI already propose a <a href="http://lab.linkeddata.deri.ie/2010/lod-badges/">no-star level</a>, which involves material being put on the web <i>without</i> an explicit license.</div><div><br /></div><div>You can get away with putting material on the Web without any concern about their future safety - but not for long, especially if you want to build services on top of that material.<br /><br />Services like CKAN (Comprehensive Knowledge Archive Network, http://ckan.net/) are registries of open knowledge packages currently favoured by the open data community. This registry is built on a simple content management environment, and by November 2010 was already returning HTTP 400- and 500-class error codes for 9% of its listed data source URLs.</div><div><br />A more extreme example is seen in the UK, where police forces recently started to release data about crime reports. But "whenever a new set of data is uploaded, the previous set will be removed from public view, making comparisons impossible unless outside developers actively store it" (<a href="http://www.guardian.co.uk/technology/2011/feb/02/uk-crime-maps-developers-unhappy">see The Guardian for more details</a>).</div><div><br /></div><div>Repositories have an opportunity to provide management, persistence and curation services to the open data community and its international collections of linked data. Whether our OA platforms are chosen (DSpace? EPrints? Fedora? Zentity?) is not the issue - it is the philosophy and practices of repository that are vital to the Open Data community, because the data is important and long-lived.<br /></div><div><br /></div><div>On the other hand, I have argued that reuse (and in this case retention) are the enemy of access. "Just putting it up on the Web" is an easier injunction than "deposit it in a repository" (especially if you haven't got a repository installed) and hence more likely to succeed. So we shouldn't put repositories on the Linked Data on-ramp (step/star 1), but if not there, then where should they go?</div><div><br /></div><div>I would argue that by step 3 (using open formats) or 4 (adding value with identifiers and semantic web tech) the data provider is being asked to make a more substantial investment, and to boost the value of their data holdings. <i>This</i> seems to be an appropriate point to add in extra features, especially when they will help secure the results of that investment. So the 5 stars of Linked Data would mention repositories in Level 4, but the five stars of Open Access could do so in Level 1 because they are already an accepted part of OA processes.</div><br /><div>I'm not sure I'm comfortable with mixing the levels - it makes for confusion. Wouldn't it be much better to have one set of processes that apply to all forms of openness - the basic principles of the Web? In my previous post I pointed out that you can add 5* links to 2* PDFs and spreadsheets, so I think possibly that the solution lies in the fact that the 5 stars are not sequential stages, but 5 more-or-less independent principles that each make openness more valuable and useful: licensing, machine readability, open standards, entity identification, interlinking. To which we could add "sustainability", making (see diagram above) is a constellation of linked data properties. </div><div><br /></div><div><br /></div>Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com3tag:blogger.com,1999:blog-3746807651848493410.post-17882563218201294522011-03-04T10:55:00.007+00:002011-03-04T13:25:44.367+00:00The Five Stars of Open Access (aka Linked Documents)<div>Yesterday I was having a discussion about Scholarly Communications, Open Access, Web 2 and the Semantic Web with some colleagues in our newly formed "Web and Internet Science Research Group" at Southampton. As we were comparing and contrasting more than a decade's experience of open access/open data/OER/Open Government Data, we made the following observation: <b>reuse is the enemy of access</b>.</div><div><br /></div><div>There have been efforts to replace PDF with HTML as a scholarly format to make data mining more easy, and movements to establish highly structured Learning Objects rich in pedagogic metadata to facilitate interoperability of e-learning material. (I have been involved in both of these!) But both have been ignored by the community - they are too hard, they fly in the face of current practice, they involve users learning new skills or making more effort. Some would argue that similar comments could be made about preservation and open access, or even just repositories and open access.</div><div><br /></div><div>Although "reuse is the enemy of access" is quite a bold statement it's really just a reformulation of the old saw "the best is the enemy of the good". Attempts to do something with the material we have available are always more complex than just looking at the material we have available. Adding services, however valuable and desirable, are more problematic than "just making material available". In the repository community we've worked hard to help users get something for nothing (or something for as little effort as possible), and I'm proud that people recognise that philosophy in EPrints. But it's still a tension - you have to present Open Access as a bandwagon that's easy to climb on!</div><div><br /></div><div>So I'm particularly impressed with Tim Berners-Lee's <a href="http://www.w3.org/DesignIssues/LinkedData.html">Five Stars of Linked Data</a> as a means of declaring an easy onramp to the world of Linked Data, while at the same time setting out a clear means of evaluating and improving contributions and the processes required to support them. It allows the community to have their cake and eat it; to claim maximum participation (a bigger community is a more successful community) and appropriate differentiation (better value is a better agenda).</div><div><br /></div><div>I think this approach would have served the Open Access communities (OA/OER/Open Data) very well (why didn't we think of it?) But it could yet do so, and so in the spirit of reuse I offer some early thoughts on the Five Stars of Open Access.</div><div><span class="Apple-style-span" style="font-size:medium;"><span class="Apple-style-span" style=" -webkit-border-horizontal-spacing: 2px; -webkit-border-vertical-spacing: 2px; font-family:serif;"></span></span></div><blockquote><div>★ Available on the web (whatever format), but with an open licence</div><div>★★ Available as machine-readable editable data (e.g. Word instead of PDF page description)</div><div>★★★ as above plus non-proprietary format (e.g. HTML5 instead of Word)</div><div>★★★★ All the above plus, use open standards from W3C (RDF and microformats) to identify things, so that people can understand your stuff</div><div>★★★★★ All the above, plus: link your data to other people’s data to provide context <i>i.e.</i> link citations to DOIs and other entities to appropriate URIs (e.g. project names, author names, research groups, funders etc).</div></blockquote><div>These are directly taken from Tim's document, with some subtle variations, and are intended for discussion. For a start, it shows that we haven't even got very far into 1-star territory, as we mainly fudge the licensing issue. (This comes from the fact that unlike data, our documents are often re-owned by third parties.) Pressing on, the second star is available for editable source documents rather than page images and this is also a minority activity. In our repository, there are 7271 PDFs vs 820 Office/HTML/XML documents. So a long way to go there. The third star seems even more remote (376 documents). And as for the fourth star's embedded metadata?</div><div>But the fifth star: this seems to be so valuable. If we could just get there - properly linked documents, no chasing down references, the ability to easily generate citation databases, easy lookup of the social network of authors. Sigh. What's not to like? And you can even add 5* facilities to PDF, so perhaps we will find some short cuts!</div><br /><div>If we develop these five stars, it will help us to function as positive Open Access evangelists, while also promoting the future benefits that we would like to work towards. No mixed messages. No confusion.</div>Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com2tag:blogger.com,1999:blog-3746807651848493410.post-12730151555130927742011-02-27T22:21:00.005+00:002011-02-28T08:01:12.073+00:00Open Access - Who Calls the Shots Now?<a href="http://repositoryman.blogspot.com/2008/01/journey-of-thousand-miles-begins-with.html">Three years ago on this blog</a> (doesn't time fly!) I contrasted the efforts that librarians and academics could make in furthering Open Access. My argument (such as it was) focused on the relationships between the two communities, noting that when it came to research, librarians could only advise and assist but that academics could lead and command. Or at least in theory! In particular I backed the idea that change would come from senior managers in the academic world and from research funders. In the intervening time we have indeed seen a big increase in OA leadership <a href="http://www.eprints.org/openaccess/policysignup/">in the form of mandates being adopted</a>, but I wonder if the pace of change is not about to put even researchers in the back seat.<div><br /></div><div>The Web was developed at CERN, in Switzerland, and took over the world in more than a geographic sense. It emerged from its home in a highly-funded, very collaborative, international research laboratory and carried the culture and design assumptions of its birthplace (open information exchange, minimal concern over intellectual property control, no requirements for individuals to monetize knowledge production) and stamped them on the rest of society, regardless of society's estimation of its own needs (for more, see the presentation<i> </i><i><a href="http://eprints.ecs.soton.ac.uk/21605/">The Information Big Bang & Its Fundamental Constants</a>). </i>One manifestation of the clash between the Web and "how society has historically operated" was the <a href="http://www.soros.org/openaccess/read.shtml">Budapest Open Access Initiative</a> some ten years after the initial development of the Web.</div><div><br /></div><div>The Web's culture of open information exchange has more recently had a very visible effect in the area of Open Government Data. A simple re-statement of the objectives of the Semantic Web as <a href="http://www.w3.org/DesignIssues/LinkedData.html">The Five Stars of Linked Data</a> has powered a tremendous focus of activity in national and local government when allied with political agendas of Transparency and Accountability. Portals like data.gov.uk and data.gov provide access to "the raw data driving government forward" which can be used to "help society, or investigate how effective policy changes have been over time". In the UK, the Treasury's COINS database of public spending is one of 5,600 public datasets that have been made available as part of the initiative. In the US, the Open Government Directive requires each department to publish high value data sets and states that "it is important that policies evolve to realize the potential of technology for open government." Both US and UK government see the opening up of public data as the driver for political improvement, innovation and economic growth, with the <a href="http://pdcengagement.cabinetoffice.gov.uk/pdc/">Public Data Corporation</a> as the focus of British development of an entire social and economic Open Data ecosystem. </div><div><br /></div><div>Having watched Open Access lobbyists engage in political processes in the UK and US (with a handful of Senators, Congressmen and MPs sometimes for OA and sometimes against) it is rather a shock to see the President and the Prime Minister suddenly mandating a completely revolutionary set of national policies based on the technological affordances of the Web, and in the teeth of plenty of advisors' entrenched opposition. And rather a shock to realise that offices even more elevated than a vice chancellor are enthusiastically joining the world of open resources and open policies.</div><div><br /></div><div>But data and publications are different things, and publications are privately owned by private publishing companies rather than stockpiled by the government. However, the decade of Open Access debate has shown that progress in OA (and OER and open data) is impeded more by individual and institutional inertia than corporate opposition. When the highest offices of government are confidently pushing forward a programme of open participation, will academics have the luxury of treading water?</div><div><br /></div><div>How will our governments sudden enthusiasm for open data affect Open Access? Perhaps not at all. Perhaps Universities are too insulated from the administrative whims and shocks of Washington and Whitehall to be affected. (How many researchers have even heard of data.gov?) Even so, governments will indirectly cause a shakeup in the administration of public research funding, and the infrastructure needed for universities to adequately respond to the requirements of open funders will cause them to become more open themselves.</div><div><br /></div><div>The public climate that informs the private OA debates and decisions in University boardrooms will change; pro-OA researchers and librarians will no longer be arguing from such a defensive position, not appearing as idealistic hippies. Even in the absence of direct government mandates, pro-OA decisions will be easier to support and less contentious to implement. The values of the research communities will change as public values and expectations change - when even governments become more accountable through open data, research communities that insist that their data and their research is their private property, for the sole benefit of the furtherance of their own careers, will soon appear old-fashioned and untenable.</div><div><br /></div><div>So watch this space. It may be that Cameron and Obama will indirectly achieve what Harnad and Suber have been toiling for. I wonder what I'll have to say in another three years' time?</div>Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com0tag:blogger.com,1999:blog-3746807651848493410.post-73305761462365335002011-02-27T12:55:00.006+00:002011-02-27T20:48:16.111+00:00Rehabilitating The Third Star of Linked DataThe mantra of open data is: put your data on the web / with an open license / in a structured, reusable format / that is open / using open identifiers / that are linked with other data. <div><br /></div><div>The third step/star in this process is commonly explained as using CSV rather than Excel, (because the former is an open format, but the latter is a closed proprietary standard). You'll see this position stated at <a href="http://www.w3.org/DesignIssues/LinkedData.html">Linked Data Design at the W3C</a> and sites all around the world are copying it.</div><div><br /></div><div>We really need to think a bit harder about this: Excel's native format is an open standard, and although an XML encoding of a the complete semantics of a spreadsheet is hardly a straightforward thing to deal with, it is simple enough to extract data from. In particular, I don't see that it is significantly more difficult than dealing with CSV!</div><div><br /></div><div>Once you've unzipped the Office Open XML data, you can iterate around the contents of the spreadsheet, or extract individual cells with ease. And without any .NET coding or impenetrable Microsoft APIs. Here's a simple example that lists the addresses and contents of all the cells in a spreadsheet.</div><blockquote><xsl:template match='/'><br /> <xsl:for-each select="/worksheet/sheetData/row/c"><br /> <xsl:value-of select="@r"/> = <xsl:value-of select="v"/><br /> </xsl:for-each><br /></xsl:template><br /><br /></blockquote><div>Of course it's simplified: i've missed off the namespaces, and strings are actually stored in a lookaside table and there are multiple sheets in a single document, but even so I'd rather wrangle XML than wrestle with CSV quotes any day.</div>Leslie Carrhttp://www.blogger.com/profile/16951479417243623642noreply@blogger.com3