i’m a chordata! urochordata!

July 20, 2010

“Privatizing” the Reviewer Commons?

Filed under: paper review,professional musings,publishing,rantish — Tags: — jebyrnes @ 1:05 pm

This post was chosen as an Editor's Selection for ResearchBlogging.orgLet’s face it. The current journal system is slowly breaking down – in Ecology if not in other disciplines as well. The number of submissions is going up exponentially. At the same time, journals are finding it harder and harder to find reviewers. Statistics such as editors contacting 10 reviewers to find 3 are not uncommon. People don’t respond, they take a long time to review, or just take a long time and THEN don’t respond leading to a need for still more reviewers to be found (this has held up 2 of my pubs for 3+ extra months). The consequences are inevitable. I’ve heard (and experienced) more and more stories of people submitting to journals for which their work is perfectly suited, only to have them rejected without review for trivial, if any, reason. (I know the plural of anecdote is not data – see refs in the article below for a more rigorous discussion).

Even if an article is reviewed, once rejected, it begins the revision cycle afresh at a new journal, starting the entire reviewer finding-and-binding process over again, yielding considerable redundancy of effort. This is slowing the pace of science, and the pace of our careers – a huge cost for young scientists.

How do we solve the tragedy of the reviewing commons?

Jeremy Fox and Owen Petchy lay out an intriguing suggestion (or see here for free pdf) and couple it with a petition. If you’re convinced by their article, go sign it now.

In essence, they want to “privatize” the reviewer commons. They propose the creation of a public online Pubcred bank. To submit a paper, one pays three credits. For every review, they receive one credit. This maintains a minimum 3:1 submit:review ratio which we should all be maintaining. Along with this, they propose that reviews are passed from journal to journal if a paper is rejected. They authors cannot hide from comments, hoping to roll the dice and get past critical reviewers. This lessens the workload for everyone and boosts science.

There are of course a million details to be worked out – what about new authors (they propose an allowable overdraft), multi-authored papers (split the cost), bad reviews (no credits for you!), etc.? Fox and Petchy lay out a delightfully thoughtful and detailed response to all of these (although I’m sure more will crop up – nothing is perfect).

I think a Pubcred system is absolutely essential to the forward progress of modern science, and I whole-heartedly support this proposal (and signed the petition). At the same time, I think there is a second problem worth thinking about that is related to the proliferation of articles.

Namely, the review and re-review cycle. We all start by submitting to the highest impact journal that we think will take our articles. This can lead to a cycle of review and re-review that takes time and energy from reviewers, and can be gamed by authors who do not revise before resubmitting (who among us has not seen this happen?).

For this reason, at a minimum, the sharing of rejection reviews from journal-to-journal and authors being forced to respond is *ESSENTIAL* to the Pubcred system working. On the other hand, Pubcreds are going to require a large co-ordinating effort between journals – many of whom are published by different organizations. If we are going to go to this trouble already, one wonders if a system where authors submit articles to a common reviewing pool, and journals select articles after review and revision (asking for any additional revisions as needed) as proposed by Stefano Allesina might be even more efficient.

Then again, let’s come back to the real world. Such a system would require a sea-change in the world of academic publishing, and I don’t think we’re there yet. The Pubcred bank will require its own journal compliance hurdles in the first place, and a need for multiple publishers to agree and co-ordinate their actions. No small feat. Given its technical simplicity and huge benefits to journals, this task will hopefully be minor. Implementing Pubcreds gets us a good part of the way there, and begins to tackle what is rapidly becoming a large problem lurking in the background. It won’t solve everything (or maybe it will!), but it should certainly staunch the current tide of problems.

So please, read the article, and if you agree, go sign the petition already!

Update: For more thoughtful discussion see this post at Jabberwocky Ecology and a thoughtful response by Fox and Petchey.

Fox, J., & Petchey, O. (2010). Pubcreds: Fixing the Peer Review Process by “Privatizing” the Reviewer Commons Bulletin of the Ecological Society of America, 91 (3), 325-333 DOI: 10.1890/0012-9623-91.3.325

Stefano Allesina (2009). Accelerating the pace of discovery by changing the peer review algorithm arXiv.org arXiv: 0911.0344v1

June 29, 2010

In the Grass, On the Reef – Vlogging Research

Filed under: neat!,shout out — jebyrnes @ 8:53 am

So, you’re an ecology power-couple starting a new life. You’ve got some elegant and incredibly ambitious experiments designed, funded, and ready to go. You’re about to revolutionize the study of coastal ecosystems…but where?

Where is in the Gulf of Mexico.

And right before your work really takes flight, the Deepwater Horizon happens.

(add to that that one of you is taking some course from some silly guy on this thing called structural equation modeling – why would you ever do that? I mean, come on, really.)

So, what do you do? Blog it.

Dr. Randall Hughes and Dr. David Kimbro are two of the finest ecologists I know (and former colleagues out at BML). Their blog/vlog, in collaboration with WFSU-TV, should be a fascinating exploration of how ecologists conduct research, as well as tracking research efforts on the effects of oil on marshes, seagrass, and oyster beds in real time.

So go check out In the Grass, On the Reef.

June 28, 2010

How a Scientist Sees the World?

Filed under: Uncategorized — jebyrnes @ 8:17 am

I wish I could say this is wrong, but…From an excellent series on science over at Abstruse Goose.

OK, they did get one thing wrong. There should be a few more arrows coming out of that rabbit. And the birds, too. What, you think I’m kidding? No indeed!

June 25, 2010

Tunicate Whiskey

Filed under: ascidians,tunicate culinary adventures — jebyrnes @ 10:30 am

From this issue of the Ascidian News comes word of something….well….awesome.

Tunicate Whiskey.

To wit, hollow out one Halocynthia roretzi. Fill tunic with Soju (korean whiskey). Drink.

For some reason, I think this would work really well with Laphroaig or other Islay malts. Or, perhaps, give that nice Islay character to something more generic – spice up your Johnny walker with a little Styela.

On second thought, a big Styela is about the size of a shot glass…

(for other great pictures, see here, and, if you’re interested, here’s another nice ascidian gastronomy tip.)

June 17, 2010

Do Not Log-Transform Count Data, Bitches!

Filed under: R,paper review,rantish,statistics — jebyrnes @ 3:28 pm

ResearchBlogging.org OK, so, the title of this article is actually Do not log-transform count data, but, as @ascidacea mentioned, you just can’t resist adding the “bitches” to the end.

Onwards.

If you’re like me, when you learned experimental stats, you were taught to worship at the throne of the Normal Distribution. Always check your data and make sure it is normally distributed! Or, make sure that whatever lines you fit to it have normally distributed error around them! Normal! Normal normal normal!

And if you violate normality – say, you have count data with no negative values, and a normal linear regression would create situations where negative values are possible (e.g., what does it mean if you predict negative kelp! ah, the old dreaded nega-kelp), then no worries. Just log transform your data. Or square root. Or log(x+1). Or SOMETHING to linearize it before fitting a line and ensure the sacrament of normality is preserved.

This has led to decades of thoughtless transformation of count data without any real thought as to the consequences by in-the-field ecologists.

But statistics has had a better answer for decades – generalized linear models (glm for R nerds, gzlm for SAS goombas who use proc genmod. What? I’m biased!) whereby one specifies a nonlinear function with a corresponding non-normal error distribution. The canonical book on this was first published ’round 1983. Sure, one has to think more about the particular model and error distribution they specify, but, if you’re not thinking about these things in the first place, why are you doing science?

“But, hey!” you might say, “Glms and transformed count data should produce the same results, no?”

From first principles, Jensen’s inequality says no – consider the consequences for error of the transformation approach of log(y) = ax+b+error versus the glm approach y=e^(ax+b)+error. More importantly, the error distributions from generalized linear models may often be far far faaar more appropriate to the data you have at hand. For example, count data is discrete, and hence, a normal distribution will never be quite right. Better to use a poisson or a negative binomial.

But, “Sheesh!”, one might say, “Come on! How different can these models be? I mean, I’m going to get roughly the same answer, right?”

O’Hara and Kotze’s paper takes this question and runs with it. They simulate count data from negative binomial distributions and look at the results from generalized linear models with negative binomial or quasi-poisson error terms (see here for the difference) versus a slew of transformations.

Intriguingly, they find that glms (with either distribution) always perform well, while each transformation performs poorly at some or all values.

Estimated root mean-squared error from six different models. Curves from the quasi-poisson model are the same as the negative binomial. Note that the glm lines (black solid) all hang out around 0 as opposed to the transformed fits.

More intriguingly to me are the results regarding bias. Bias is the deviation between a fit parameter and its true value. Bascially, it’s a measure of how wrong your answer is. Again, here they find almost no bias in the glms, but bias all over the charts for transformed fits.

Estimated mean biases from six different models, applied to data simulated from a negative binomial distribution. A low bias means that the method will, on average, return the 'true' value. Note that the bias for transformed fits is all over the place. But with a glm, bias is always minimal.

They sum it up nicely

For count data, our results suggest that transformations perform poorly. An additional problem with regression of transformed variables is that it can lead to impossible predictions, such as negative numbers of individuals. Instead statistical procedures designed to deal with counts should be used, i.e. methods for fitting Poisson or negative binomial models to data. The development of statistical and computational methods over the last 40 years has made it easier to fit these sorts of models, and the procedures for doing this are available in any serious statistics package.

Or, more succinctly, “Do not log-transform count data, bitches!”

“But how?!” I’m sure some of you are saying. Well, after checking into some of the relevant literature, it’s quite straightforward.

Given the ease of implementing glms in languages like R (one uses the glm function, checks diagnostics of residuals to ensure compliance with model assumptions, then can use Likliehood ratio testing akin to anova with, well, the Anova function) this is something easily within the grasp of the everyday ecologist. Heck, you can even do posthocs with multcomp, although if you want to correct your p-values (and there are reasons to believe you shouldn’t), you need to carefully consider the correction type.

For example, consider this data from survivorship on the Titanic (what, it’s in the multcomp documentation!) – although, granted, it’s looking at proportion survivorship, but, still, you’ll see how the code works:

library(multcomp)
### set up all pair-wise comparisons for count data
data(Titanic)
mod <- glm(Survived ~ Class, data = as.data.frame(Titanic), weights = Freq, family = binomial)

### specify all pair-wise comparisons among levels of variable "Class"
### Note, Tukey means the type of contrast matrix. See ?contrMat
glht.mod <- glht(mod, mcp(Class = "Tukey"))

###summaryize information
###applying the false discovery rate adjustment
###you know, if that's how you roll
summary(glht.mod, test=adjusted("fdr"))

There are then a variety of ways to plot or otherwise view glht output.

So, that’s the nerdy details. In sum, though, the next time you see someone doing analyses with count data using simple linear regression or ANOVA with a log, sqrt, arcsine sqrt, or any other transformation, jump on them like a live grenade. Then, once the confusion has worn off, give them a copy of this paper. They’ll thank you, once they’re finished swearing.

O’Hara, R., & Kotze, D. (2010). Do not log-transform count data Methods in Ecology and Evolution, 1 (2), 118-122 DOI: 10.1111/j.2041-210X.2010.00021.x

April 1, 2010

My Dissertation in Under 7 Minutes

ResearchBlogging.org
I recently attended the DISCCRS symposium for recent PhDs of a wide variety of disciplines whose work (past or present) deals with climate change. The week-long meeting was phenomenal, seeding me with thoughts, ideas, and basically making me feel quite good about the work I’m doing (if also very pessimistic about how society is dealing with Climate Change). Perhaps one of the most interesting exercises of the whole thing was something we had to do as a sort of getting-to-know-you. We had to present our dissertation in 7 minutes.

That’s right. 6 years of blood, sweat, and tears in 7 minutes. Oh, and for a totally non-specialist audience.

I thought this was an amazing challenge. Granted, I only ended up really presenting on 3 of my chapters (see papers below). But I really liked the results.

Apologies for the sound quality – the acoustics of the room were pretty bad. Props to The Urban Matador for some sound editing. And malaprops to me for not annunciating or projecting as much as I usually do. Bad scientist. Bad! Bad!

Also, please note that this is with a particular emphasis on how our work relates to climate change.



Byrnes, J., Reynolds, P., & Stachowicz, J. (2007). Invasions and Extinctions Reshape Coastal Marine Food Webs PLoS ONE, 2 (3) DOI: 10.1371/journal.pone.0000295

Byrnes, J., Stachowicz, J., Hultgren, K., Randall Hughes, A., Olyarnik, S., & Thornber, C. (2005). Predator diversity strengthens trophic cascades in kelp forests by modifying herbivore behaviour Ecology Letters, 9 (1), 61-71 DOI: 10.1111/j.1461-0248.2005.00842.x

Byrnes, J., & Stachowicz, J. (2009). The consequences of consumer diversity loss: different answers from different experimental designs Ecology, 90 (10), 2879-2888 DOI: 10.1890/08-1073.1

March 31, 2010

Our Future: Hot n’ Tasty?

Filed under: paper review — Tags: — jebyrnes @ 6:55 am

Climate change. It’s going to wreak no small amount of havoc on mother nature (and if you’re reading this but think all of this climate change stuff is poppycock, please visit Skeptical Science and then come back). How good of a guide is our intuition for what will happen?

This is a great question when it comes to predator-prey relationships and the food chain. One may well think that, heck, from first principles we know that adding heat to a system speeds things up. So, you know, things should continue just as they are – just faster, and maybe with some range-limits and annual timing moved around a bit.

This is quite a sensible proposition. It’s also wrong.

Biomass of phytoplankton and zooplankton under different temperature regimes. Images from Wikipedia.

Recent work from Mary O’Connor and colleagues in PLoS Biology as well as Oliver Beveridge and colleagues in the Journal of Animal Ecology points out some intriguing and non-intuitive effects of heating up food webs – in two completely different systems no less.

O’Connor et al. examine the relationship between phytoplankton and their predators under nutrient enriched conditions. While the little green guys grow more quickly (per capita primary production goes up with increasing temperature), this doesn’t matter to voracious copepods. Their metabolism is sped up even faster, leading to more and more copepods and less and less phytoplankton at higher temperatures.

But what about layering a little complexity on top of that? Beveridge et al. look at a three-level food chain, with bacteria at the bottom, a protozoan consumer, and a ciliate top predator. They cross a manipulation of number of links in the food chain against an increase in temperature and let things run for 6 weeks. While cranking up the heat leads bacteria activity to pick up markedly (and even moreso when their consumer is around), temperature crossed with food chain length does some funny things.

Density of different organisms under conditions of differing food chain length and temperature. Note, if you like the knitted bacteria, you can find the patterns at loxosceles - http://loxosceles.org/crafty/bacterium.html.

Depending on the number of links in the food chain, the relationship between temperature and bacterial density is positive, negative, or U-shaped. In contrast, their consumer, Colpidium, increases where it’s warm when there are no predators, but markedly declines with temperature when it’s predator is around. It’s predator, Didinium, increases in density only at intermediate temperatures. Again, shifts in metabolism and predation rates throughout the food chain appears to be key.

Together, the two studies suggest a dynamic interplay between metabolic activity, rates of predation, and population dynamics. Different levels of a food chain can be affected in very different ways. The simple faster-herbivore-kill-kill scenario is sadly discredited. Rather, we need an understanding of how warming will affect different types of organisms’ rates of growth, death, and predation. Only then can we determine climate change’s dynamic impact on food webs.

O’Connor, M., Piehler, M., Leech, D., Anton, A., & Bruno, J. (2009). Warming and Resource Availability Shift Food Web Structure and Metabolism PLoS Biology, 7 (8) DOI: 10.1371/journal.pbio.1000178

Beveridge, O., Humphries, S., & Petchey, O. (2010). The interacting effects of temperature and food chain length on trophic abundance and ecosystem function Journal of Animal Ecology, 79 (3), 693-700 DOI: 10.1111/j.1365-2656.2010.01662.x

March 5, 2010

The Map of Science

Filed under: neat!,science! — Tags: , , — jebyrnes @ 9:38 am

Why does it take so long for awesome cutting-edge statistical to make their way over to ecology? There are a myriad of techniques out there that have been around for 20, 30, 40, or more years that could help so many ecologists from banging their head into a wall over and over and over and…well, you get the point. But, it takes quite a while for them to percolate over to us. This is not for lack of user-friendly tools, often. Rather it has to do with the connectivity of disiciplines.

For example, I was having a lovely conversation with Jim Grace the other day about using Structural Equation Modeling for predictive purposes, and we ended up chatting a little about history. SEM as it is done currently – using maximum likelihood approaches to fit a model to a covariance or correlation matrix – really dates to the late 1960s and early 1970s. Before then, scientists in a number of disciplines used a wide variety of approaches to examine path models (a là Sewall Wright’s Path Analysis), or perform Factor Analysis, or approach other multivariate models that often included latent variables. These techniques were fairly heterogeneous, even though they attempted to do roughly similar things.

It took Karl Jöreskog‘s wonderful papers outlining his LISREL technique and software using maximum likelihood to really bring the whole enterprise together into modern SEM.

And yet, despite the fact that this seminal work was published in the 70s, there are Ecological papers well into 90s that use piecewise regression models to fit path analyses. Why?

The answer can be summed up by this beautiful diagram detailing the connectivity of science in 2004 from the ever-interesting eigenfactor.org (and hat-tip to Jim for pointing it out to me).

Orange circles represent fields, with larger, darker circles indicating larger field size as measured by Eigenfactor score™. Blue arrows represent citation flow between fields. An arrow from field A to field B indicates citation traffic from A to B, with larger, darker arrows indicating higher citation volume. Image from eigenfactor.org.

Basically, these methods were developed for economics, and saw their first heavy use there and and sociology, political science, education, and psychology. In terms of connectivity, Ecology & Evolution sites on the other side of a doughnut hole of communication (with the occasional exception of psychology). Historically, the fields where the newest techniques are being developed are rarely examined by ecologists, and it is to our loss. Fortunately, I think this is a historical trend. With the rise of search engines, message-boards, and copious mailing lists, I do wonder if a connectivity graph from 2004-2010 would be much tighter.

Connectivity can only be a boon for science. With environmental issues beginning to impinge on every endeavor, it has become more important than ever to survey the breadth of what is out there.

So, hey, sign-up for alerts for a journal that you think will have no relevance to you. Who knows what might drop into your inbox.

February 15, 2010

Viva la Neo-Fisherian Liberation Front!

Filed under: statistics — Tags: , , — jebyrnes @ 1:41 pm

p≤0.05

This post was chosen as an Editor's Selection for ResearchBlogging.org Significant p-values. For so many scientists using statistics, this is your lord. Your master. Heck, it has its own facebook group filed under religious affiliations (ok, so, maybe I created that.) And it is a concept to whose slavish devotion we may have sacrificed a good bit of forward progress in science over the past half century. Time to blow up the cathedral! Or so says Stuart Hurlbert and Celia Lombardi in a recent fascinating review.

But first, for the uninitiated, what does it mean? Let’s say you’re running an experiment. You want to see whether fertilizer affects the growth rate of plants. You get a bunch of random plots, seed them, and add fertilizer to half of them. You then compare the mean growth rates of the two groups of plots. But are they really different? In essence, a p value gives you the probability that they are the same. And if it is very low, you can reject the idea that they are the same. Well, sort of.

A p value, as defined by the Patron Saint of Statistics for us experimental grunts, R. A. Fisher, is the probability of observing some result given that a hypothesis being tested is true. Of, if d=data, and h=a hypothesis, p(d|h) in symbolic language – | means given. Typically, this hypothesis being tested is a null hypothesis – that there is no difference between treatments, or the slope of a line is 0. However, note a few things about this tricky statement. 1) It is not the probability of accepting the hypothesis you’re trying to reject. 2) It makes no claims any particular hypothesis being true. For all practical purposes, in the framework of testing a null hypothesis, however, a low p value means there is a very low probability that

OK. But what is this 0.05 thing all about? Well, p will range from 0 to 1. As formalized by Jerzy Neyman and Egon Pearson (no, not THAT Egon), the idea of Null Hypothesis Significance Testing (NHST) is one where the researcher established a critical value of p, called α. The researcher then tests the null statistical hypothesis of interest, and if p falls at or below alpha, the results are deemed ‘statistically significant’ – i.e. you can safely reject the null. By historical accident of old ideas, copyright, a little number rounding, a lack of computational power to routinely calculate exact p values in the 30s, and some early textbooks 0.05 has become the standard for much of science.

Indeed, it is mother’s milk for any experimental scientist who has taken a stats course in the last 40+ years. It is enshrined in some journal publication policies. It is used for the quality control of a great deal of biomedical research. It is the result we hope and yearn for whenever we run an experiment.

It may also be a false god – an easy yes/no that can lead to into the comfortable trap of not thinking critically about a problem. After all, if your test wasn’t “significant”, why bother with the results? This is a dangerous line of thinking. It can seriously retard scientific progress and certainly has led to all sorts of jerrymandering of statistical tests and datasets, or even adjusting α up to 0.10 or down to 0.01, depending on the desired result. Or, worse, scientists misreading the stats, and claiming that a REALLY low p value meant a REALLY large effect (seriously!) or that a very high value means that one can accept a null hypothesis.

Scientists are, after all, only human. And are taught by other humans. And while they are trained in statistics, are not statisticians themselves. All too human errors creep in.

Aside from reviewing a tremendous amount of literature, Hurlbert and Lombardi perhaps best sum up the case as follows – suppose you were to look at the results of two different statistical tests. One one, p=0.051. In the other, p=0.049. If we are going with the α=0.05 paradigm, then one test we would not reject the null. In the other we would and label the effect as ‘significant’.

Clearly, this is a little too arbitrary. H&L lay out a far more elegant solution – one that is being rapidly incorporated in many fields and has been advocated for some time in the statistical literature. It is as follows:

1) Report a p-value for a test. 2) Do not assign it significance, but rather refer to the level of support it gives for rejecting a null – strong, weak, moderate, practically non-existent. Make sure this statement of support is grounded in the design and power of the experiment. Suspend judgement on rejecting a null if the p value is high, as p-value testing is NOT the same as giving evidence FOR a null (something so many of us forget). 3) Use this in accumulation with other lines of evidence to draw a conclusion about a research hypothesis.

This neoFisherian Significance Assessment (NFSA) seems so simple, so elegant. And it puts the scientist back into the science in a way that NHST does not.

There have been of course other proposals. Many have advocated throwing out p values and reporting confidence intervals and effect sizes. This information can be incredibly invaluable, but CIs can often be p values in disguise. Effect sizes are great, but without an estimate of variability, they can be deceiving. Indeed, the authors argue that p value reporting is the way to assess the support for rejecting a null, but that the nuance with which it is done is imperative.

They also review several other alternatives and critiques – Bayesian ideas or information theoretic approaches, although I think there is some misunderstanding there that leads the authors to see conflict with their views where there actually is none. Still, it does not distract from their main message.

It should also be noted that this is one piece of a larger agenda by the authors to force scientists (and particularly ecologists) to rethink how we approach statistics. There’s another paper out there that demonstrates why one-tailed t-tests are the devil (the appendix of which is worth leading to see how conflicted even textbooks on the subject can be), and another is in review on why corrections for multiple hypothesis testing (e.g. Tukey tests and Bonferroni corrections) are in many cases quite unnecessary.

Strong stuff (although who would expect less from the man who gave us the clarion call against pseudoreplication). But intellectually, the arguments make a lot of sense. If anything, it forces a greater concentration on the weight of evidence, rather than a black-and-white situation. It puts the scientist back in the limelight, forcing them to build a case and apply their knowledge, skills, and creativity as a researcher.

Quite liberating. We shall see if it is adopted, and where it leads. I leave you with an excerpt from the concluding remarks.

We came along after the dust had settled, and have just tried to push over the last remaining structures of the old cathedral and to show the logic of the neoFisherian reformation. Most of the stone building blocks from the old cathedral were still of value. They just needed to be reassembled with fresh mortar by a new generation of scientists and statisticians to increase the guest capacity and beautify the gardens of the neoFisherian cottage.

Hurlbert, S. H., & Lombardi, C. M. (2009). Final collapse of the Neyman-Pearson decision theoretic framework and rise of the neoFisherian Annales Zoologici Fennici, 46, 311-349

February 12, 2010

Low Down Poor Abused Data Blues

Filed under: silly — jebyrnes @ 10:11 am

I’m having one of those days. One of those “I have a meeting that will finalize the analyses for a paper, let’s just check that ONE thing that didn’t matter in early versions of this model, so, it shouldn’t matter now, and OMGWTFBBQ everything changed!” kinds of days. Well, OK, not everything – just the new piece of the story that intrigued me the most because it was so delightfully intuitive. It’s made me feel a little blue. Well, a lot blue. So, fingers flying across the keyboard to fit new models, I flipped on some Muddy Waters. At the same time Miriam G suggested the Low Down Poor Abused Data Blues. While I’m no Robert Johnson, Muddy Waters, or BB King, clearly, this must be done. I’ve got a first verse. I’m welcome to contributions:

This is me. Right now. Except replace the guitar with a laptop furiously churning through R code.

Low Down Poor Abused Data Blues

Oh my data’s got autocorrelation,
eats up my treatment effect.

Oh my data’s got autocorrelation,
eats up my treatment effect.

But the decrease in sample size, it makes my analysis such a wreck.

Older Posts »

Powered by WordPress