Journal subscriptions are negotiated, but article processing charges are fixed prices

Scientific publishing is moving to open access. This means that it’s moving from a subscription, reader-pays, to an author-pays model (normally termed article processing charges, or APCs).

This will have a couple of impacts:

1. Institutions which publish more and read less lose to institutions which read more and publish less. Thus, research universities will probably lose out to the pharmaceutical industry (as I’m sure that the pharmaceutical industry is proportionally reading a lot of papers, but not publishing as much).

This is not a big deal, but I thought I would mention it. Some people seem to be very angry at pharmaceutical companies all the time for not paying their fair share, but it’s a tough business and part of the point of publicly funded research is to enable downstream users (like pharmaceuticals) to flourish. Moving to APCs seems to be another move in supporting pharma (probably smaller biotech upstarts being the biggest beneficiary). Teaching-focused universities will also benefit.

2. More importantly, though, the move to APC (article processing charges) instead of subscriptions is also a move from a product that is sold at a variable price to one that is bought a fixed price.

Variable pricing (or price discrimination) is a natural feature of many of the markets that look like subscriptions, where fixed costs are more important than marginal costs (in this case, the extra cost of allowing access to the journal once the journal is done is, basically, zero).

Plane tickets and hotel rooms are less extreme cases where the price will fluctuate to attempt to charge more to those willing to pay more (typically, the wealthier; but sometimes the people willing to pay more and those counting their pennies are the same people, just in different situations: sometimes I really want a specific flight, other times I don’t even care exactly where I am going).

So, in the subscription model, some institutions will pay more. Maybe it’s not fair, but hedge funds such as Harvard will get bilked, while poorer institutions will be able to negotiate down. In the APC model, everyone will pay roughly the same (there may be some discounts for bulk purchasing, but not the variation you have today and it may even be the large institutions which will negotiate down, while the smaller players will pay full price).

Many publishers have policies to favor publications from very poor countries, charging them lower APCs. Naturally this is a good policy, but it will not be fine-grained. Universities in Bulgaria (EU member-state with a GDP per capita of 7,350 USD) are considered as wealthy as private American research universities. They will be expected to pay the same for their PlOS Biology papers.

Against Science Communication

Science communication, science outreach, is widely seen as a good thing, perhaps even a necessary one (Scientists: do outreach or your science dies). Let me disagree: For the most part, cutting-edge science should not be communicated to the public in mass media and doing so harms both the public and science.

At the end of the post, I provide some alternatives to the state of the world.

What is wrong with science communication?

For all the complaints that the public has lost faith in science, I often anecdotally find that members of the public have more faith in scientific results than most scientists. Most scientists know to take any paper with a grain of salt. The public does not always realize that, they don’t have the time (or the training) to dig into the paper and are left with a click bait headline which they need to take at face value or reject “scientific results”.

Most of the time, science communication has to simplify the underlying science so much as to be basically meaningless. Most science doesn’t make any sense even to scientists working in adjacent fields: We publish a new method which is important to predict the structure of a protein that is important because other studies have shown that it is important in a pathway that other studies have shown is active in response to … and so on. At the very bottom, we have things that the public cares about (and which is why we get funded), but the relationships are not trivial and we should stop pretending otherwise.

When I thumb through the latest Nature edition, only a minority of titles are meaningful to me. Sometimes, I genuinely have no idea what they are talking about (One-pot growth of two-dimensional lateral heterostructures via sequential edge-epitaxy); other times, I get the message (Structure of the glucagon receptor in complex with a glucagon analogue) but I don’t have the context of that field to understand why this is important. To pretend that we can explain this to a member of the public in 500 words (or 90 seconds on radio/TV) is idiotic. Instead, we explain a butchered version.

Science outreach harms the public. Look at this (admittedly, not so recent) story: The pill is linked to depression – and doctors can no longer ignore it (The Guardian, one of my favorite newspapers). The study was potentially interesting, but in no way conclusive: it’s observational with obvious confounders: the population of women who start taking the pill at a young age is not the same as that that takes the pill at a later time. However, reading any of the panicked articles (Guardian again, the BBCQuartz: this was not just the Oprah Winfrey Show) may lead some women to stop taking the pill for no good reason, which is not a positive for them. (To be fair, some news outlets did publish a skeptical view of it, e.g., The Spectator, Jezebel).

Science outreach discredits science. The public now has what amounts to a healthy skepticism of any nutritional result, given the myriad “X food gives you cancer” or “Chocolate makes you thin“. If you think that there are no spillovers from publicizing shaky results into other fields of science (say, climate science), then you are being naïve.

Valuing science outreach encourages bad science. If anything, the scientific process is already too tipped towards chasing the jazzy new finding instead of solid research. It’s the worst papers that make the news, let’s not reward that. The TED-talk giving, book writing, star of pop psychology turned out to be a charlatan. Let’s not reward the same sillyness in other fields.

What is the alternative?

The alternative is very simple: instead of communicating iffy, cutting-edge stuff, communicate settled science. Avoid almost all recent papers and focus on established results and bodies of literature. Books that summarize a body of work can do a good job.

But this is not news! Frankly, who cares? Journalists do, but I think the public is happy to read non-news stuff. In fact, most people find that type of reporting more interesting that the flashy study of the week.

Or use opportunities to tie it to current events. For example, when major prizes are given out (like the Nobel prize, but also the tiers below it), publish your in-depth reporting the topics. Since often the topics are predictable, you can prepare your reporting carefully and publish it when ready. You don’t even need the topic/person to actually win. In the weeks prior to Nobel Prize week, you can easily put up a few articles on “the contenders and their achievements”.

Finally, let me remark that, for the members of the public who do wish to be informed of cutting-edge science and are willing (perhaps even eager) to put in the mental work of understanding it, there are excellent resources out there. For example, I regularly listen to very good science podcasts on the microbe.tv website. They do a good job of going through the cutting edge stuff in an informed way: they take 30 minutes to go through a paper, not 30 seconds. They won’t ever be mainstream, but that’s the way it should be.

Nonspecific Citations

The point of this post is to introduce the term nonspecific citation.

A common problem with antibody is non-specific binding: while the antibody may be targetted to grab on to protein P, it actually also grabs X, Y, and Z, which are somewhat similar to P.

A nonspecific citation is when paper gets cited as an example of a broad class rather than for its specific ideas.

§

In the case of a review article, it might make some sense: if you are just mentioning the field, you might as well cite my review of bioimage informatics. Even here, though, you could cite other reviews.

In the case of the research articles, though, it is often more of a throwaway citation: we need to mention work in the area of cell image analysis and I have a fairly recent paper on this, so you cite that. Or you cite somebody else’s work as it does not really matter the specific contents of that citation.

§

There are intermediate levels of specificity.

Maybe you are writing a paper that somehow touts the general usefulness of local features and you write a sentence such as:

Local features are useful (or SURF has been used) in many context such as A [paper], B [paper], and cell image analysis [my work].

Then there is the completely nonspecific:

Recent work in bioimage informatics […, my work, …] …

§

This will of course lead to a citation Matthew Effect: if you are highly cited, you are likely to get even more citations by getting more nonspecific citations.

§

Whenever I have a paper I really want to read that is not open-access and to which my institution does not have access, I generally get it by asking the author for a preprint. It has never been a problem and has even led to some good follow up discussions.

On the other hand, if I am not looking for a specific paper but just something in a general area to get acquainted, then it whether something it open access or not may make me choose to read (and then subsequently cite) one paper instead of the other. In fact, just seeing an IEEE link will be enough for me to not even click through to the abstract.

Maybe if open access does have an advantage in getting citations, it’ll mostly come in the form of nonspecific ones.

IMPACT or How I Learned to Start Worrying and Fear Altmetrics

Altmetrics is the idea that scientific publications should be judged (perhaps primarily) on the impact they have in the general media, including on social media. This is in alternative to looking at either citations of journal impact factors.

§

People who know me outside of science know that antibiotic overuse is a pet peeve of mine [1]. We just published a paper touching on this very subject. It also touched on antibiotic use in agriculture. Both of these can be sold as hot subjects and it’d certainly be possible to try to get some attention in social media with a few bolder statements: antibiotic use in factory farming causes antibiotic resistant infections!

Oh, the altmetrics would go through the roof, but we don’t have the data to support anything like that claim. Our data and analysis is congruent with the idea that antibiotic overuse by humans and farm animals leads to increased resistance which may lead to increased antibiotic resistant infections, but we must acknowledge that there are a large number of confounders and no proof of direct causality. Broadly speaking, people in countries that like to give their animals antibiotics also take a bunch themselves, thus we cannot disentangle farm-to-fork from human antibiotic (over)use. Furthermore, the presence of antibiotic resistant genes is not sufficient to infer the presence of clinically-relevant antibiotic resistant pathogens (this may be a limitation of current methods of analysis, naturally, but a limitation it is). The paper, naturally, has more details on these questions.

We wrote as good scientists, presenting our data and conclusions, acknowledging limitations. We hope to get scientific recognition for this. Most directly in the form of citations, naturally, but more generally in recognition (those people in the Bork lab did a really good job both on their own data and in reviewing other work).

If our incentives were to stir up controversy in social networks, then they would point away from this towards a more polemical stance (and whilst they may, in some sense, draw more engagement with scientific results, they would, in a more fundamental sense, move the discourse away from a evidence-based direction [2]).

When writing blogposts, I put in short pithy sentences for twitter; it’d be dangerous if I did the same when writing a journal paper.

§

Metrics don’t just measure, they also shape behaviour, you need to solve for the equilibrium.

You need to ask: would it be a good thing if people started, on the margin, to optimize for your metric? In the case of scientists and altmetrics, the answer may be NO.

§

An unrelated criticism of altmetrics is that they’d be outright gamed and that the scientific world has nowhere close to the capacity to fight spam like google et al. do. The linked article is also notable for using the word meretricious in the title.

Also, do read the rejoinder.

[1] I’m the sort of guy that when a person complains that their doctor didn’t give them antibiotics for the flu is liable to praise the doctor instead of expressing empathy.
[2] In fact, public diffusion of speculative scientific results can lead to mistrust of science as these speculative results will then tend to contradict themselves leading to dismissal of science in general.

Friday Links

1. On Schekman’s pledge to not publish high-profile. I almost called this a balanced view, but then realized that I probably used that phrase to refer to Derek Lowe’s work at least twice in the past. The man is smart and balanced, what can I say?

2. An interesting meeting report (closed access, sorry). Two highlights:

While discussing mutations that predispose to cancer, Nazneen Rahman (Institute of Cancer Research, UK) rightly reminded us that people make big decisions and have parts of their anatomy removed based on their genotype.

[…]

Jeanne Lawrence (University of Massachusetts Medical School, USA) convincingly showed that her lab was able to silence one entire copy of chromosome 21 in stem cells in vitro. Trisomy 21 or Down’s syndrome is caused by an extra copy of chromosome 21. […] Lawrence and colleagues inserted XIST (human X-inactivation gene) into chromosome 21 in stem cells with trisomy 21. They then showed using eight different methods that a single copy of the chromosome had indeed been silenced.

3. A good explanation of Bitcoin, the protocol

4. Interesting article about wine & technology in The Economist (which is one of the few mainstream magazines whose science coverage is worth reading [1]).

[1] Actually, I think it’s the only one who can be consistently trusted, but I enjoy anything by Ed Yong wherever he publishes and been reading some excellent articles by Carl Zimmer in The Atlantic.

Why Science is a Third World Economy

Because people are cheap and things are expensive.

§

To a large extent, it is easier to get money to pay for people (salaries [1]) than to pay for things. Other times, people show up who are willing to work without being paid (they are self-funded). But then you need to get them materials to work with. For that, you need to actually spend some money.  And sometimes you actually have money, but it can only pay for things of type X, but not of type Y, which is what you wanted.

So, it often feel very much like the third-world: a lot of people standing around a few physical resources, and replacement of capital by labour.

§

A while back I read a review which was comparing several technologies for the same measurement task [2]. There were two high-quality methods in terms of the output. One was very automated but required you buy some kit (~$400), the other was artisanal.

The authors wrote that the first one was good because it was very fast, but expensive. The other one took a long time, but was cheap. They didn’t even price in the cost of labour! They didn’t even ask how many hours of graduate student time you can get for $400.

Which, of course, makes some sense in the public-funded bureaucratic world where money is not fungible. You cannot often reallocate money from stipends to materials.

§

And then there is that expensive piece of equipment that is not really used because there was a specific half-a-million grant to buy it, but then enthusiasm petered out and the person who was going to use it had gotten a different job by the time the thing was delivered that nobody here really cared to pick it up.

Yep, that’s a third world thing too.

[1] or stipends which are exactly like a salary, except for tax purposes.
[2] I could probably find it now if I looked, but I don’t actually want to lose track of the main point.

People do read your thesis

Last week, @proflikesubstance wrote that you should Publish papers. Your thesis means nothing:

Get the papers out. Don’t focus on an arcane document that will gather dust for the next 50 years until the departmental office needs space and throws the old ones out.

§

I didn’t even get the bound copies of my thesis. It’s a PDF sitting on a department server (it’s open access!).

Thus, this image of a thesis as a big block of paper, gathering dust in a hard-to-reach library section is out of date. It is a document, widely available, sometimes even read. During my graduate studies, I did read a few theses by others.

Because things are now electronic, people will read your thesis. Probably not PIs (who will read the executive summary, ie, the papers), but graduate students will (and, incredible though it may seem, graduate students are people too). The median paper is probably read by very few people too, by the way.

§

My thesis is a staple thesis: short intro, review, paperpaperpaper, paper [yet unpublished], short “conclusion”, software paper as appendix.

Have your cake and eat it too.

I did, however, enjoy being able to write without any page limits and put in more details than made it into the final version of the papers. I also put in some side comments that I thought were cool but on the periphery of what the chapter was about. In some ways, I had the opportunity to write down the type of thought I would now cut out of the paper and put on the blog.

§

This is not to say that you shouldn’t publish in peer-reviewed outlets. You should (as I said, my thesis is a staple thesis, almost all of the content is available in peer-reviewed literature too). But we shouldn’t use bad or out-of-date arguments, like nobody reads your thesis, for it.

You should publish because your résumé looks better with publications than with a thesis. Theses are like assholes, everybody has one (everyone who is applying to the sort of academic jobs we are discussing).

So, write your thesis and publish.

Universities Merging: Improved Rank

Logo of the University of Lisbon
Logo of the University of Lisbon (Photo credit: Wikipedia)
Logo of the Technical University of Lisbon
Logo of the Technical University of Lisbon (Photo credit: Wikipedia)

The university where I got both my BS and MS from will soon cease to exist as a separate entity. The University of Lisbon and the Technical University of Lisbon are merging into one single institution.

The main reason (as I understand it from people at those institutions)  is that there are currently still too many institutional barriers to the cross-disciplinary work that many people are doing across these institutions and that it makes no sense to have two half-universities. A hundred years ago, the faculties were well-defined; nowadays, these divisions make less sense (you can easily imagine a 1950s administrator asking why would an engineer need to have a meeting with a medical specialist? Today, those interactions are crucial).

One interesting side effect, however, is that these institutions will shoot up in several international rankings of research institutes. Many of them do not correct for institute size, so if they measure research output, now it will be much larger: all those papers, all those citations, all those research awards that used to get split into two pots will now go to the same bin.

Nothing will have changed but the accounting. Still, a combined institution will climb many steps in the ranking ladder.

How Long Does PLoS Take to Review a Paper? All PLoS Journals Now

Due to popular demand (at least two people asked, surely that’s demand), here is a generalization of Monday’s work to include a few more PLoS journals. (This was mostly because it was easier to generalize my scripts to process process any PLoS journal.)

Here are the images for all PLoS journals. PLoS 1 at the end is the same figure I posted on Monday.

PLoS Pathogens:

stataccept-plospathogens

PLoS Neglected Tropical Diseases:

stataccept-plosntds

PLoS Medicine:

stataccept-plosmedicine

PLoS Genetics:

stataccept-plosgenetics

PLoS CompBio:

stataccept-ploscompbiol

PLoS Biology:

stataccept-plosbiology

PLoS One:

stataccept-plosone

For PLoS journals, except PLoS Medicine, it seems that the average acceptance time is 5 months. PLoS Medicine takes 7 months.

PLoS Medicine is the journal that takes the longest to review, PLoS One is the fastest (although some of the papers may have been reviewed in another PLoS journal before, speeding up the process).

§

Script are on github.

How Long Does Plos One Take to Accept A Paper?

How long do papers take to review?

Too long.

No, seriously, how long? I did a little measurement.

I downloaded the 360 most recent papers from Plos One (as of Friday). They are all annotated with submission and acceptance dates, so it was easy to just compute the differences.

The plot below is a histogram (one bin per day) in grey with a Kernel density estimate as a solid line.

Histogram of acceptance times

§

The result is it takes about 3 to 4 months to get a paper accepted, but with substancial variance.

§

Looking at the figure, I had to ask who the poor people were who published that paper which was longest in revision.

Alternative Sigma Factor Over-Expression Enables Heterologous Expression of a Type II Polyketide Biosynthetic Pathway in Escherichia coli by David Cole Stevens, Kyle R. Conway, Nelson Pearce, Luis Roberto Villegas-Peñaranda, Anthony G. Garza, and Christopher N. Boddy. DOI: 10.1371/journal.pone.0064858

Submitted on 29 March 2011 and accepted on 22 April 2013, this paper was 755 days in revision.

The fastest acceptance was only 19 days. However, this being Plos One, it is possible that the paper had been reviewed for another Plos journal, rejected with positive reviews on significance grounds, and had those reviews transferred to Plos One. After this, acceptance followed without a new round of peer review.

§

This is a gimmick. There is perhaps a paper to be written where this is extended to see what areas of research/keywords/&c matter to acceptance time. If I had more free time I might write that paper.

The code for the above is available on github.

UpdateFollowup with all PLoS Journals.