An improbable informatician
A black screen with a tiny blinking cursor. Silence except for the quiet whir of a room full of computers. No Microsoft Windows here. What have I got myself into?

Where are the LOLcatz??

This was the world I found myself in when, in total ignorance, I mentioned to the Undergrad Research Opportunities team that I was ‘interested in bioinformatics’ (sounds cool and technical huh?). At this point my computing prowess amounted to some vague notions about Excel, and familiarity with the word ‘bioinformatics’ from first year biology lectures.

The job involved processing brain MRI scans to test for differences related to prescription drugs.

At my side was a coffee stained Introduction to Unix.
“Create a file called list1 containing the following fruit: orange, plum, mango, grapefruit…”
How this was meant to help me get the job done was less than clear.
A long 18 months followed, but thanks to my excellent (read ‘patient’) supervisors, I eventually got my head around Unix.
The next year it was back to good old biochemistry. The stuff you can see down a microscope, knock a gene out of, or get a horrific infection from:

But as I recovered from my honours project, in my fever dreams the command line was calling, even commanding me back.
The efficiency of a loop… the power of combining lots of different data to answer an interesting question… the joy of heading to the pub while it does your work for you… I was hooked!!
…but still fairly useless at coding.
I spent about a year mashing Unix and Excel together to work through a genomics project. All the while, I was reading papers in journals like PLoS Computational Biology, and wondering how they made those awesome figures?
from Goncalves et al., 2017 DOI:10.1371/journal.pcbi.1005297
The answer, I eventually discovered, was R. The free, powerful, widely used and completely hackable salve to any scientist’s Excel-induced rage.

This coding language really comes into its own with Rstudio - where you can write code and see the graphical results instantly on your desktop.
I enrolled in a PhD in infectious disease biology, and decided to get SRS about R.
I scoured the user guides for the software I wanted to use.
I chopped up code and put it back together to try to isolate the important parts.
I was a Guinea pig for the DataSociety R course and learned a lot through their materials.
I replaced online examples with my own data, searched Google relentlessly, patched up and saved anything that worked, until my figures started to look presentable.
The more I practiced, the more I found (overly-nerdy) uses for R.
Go-karting lap times….that would work as a box plot! Snow depth this winter? Line chart. Journal publishing cost by impact factor? Scatter plot. The interactive plot functions are especially cool:
https://plot.ly/~plot_ranger/13.embed
In all, this is the tool to help you find the stories in your data (and make them look great), with maximal reproducibility, and progressively less effort.
If you’re sick of scouring through Excel sheets, Prism is getting too fiddly, and VLOOKUP is losing its charm, come to ResBaz and try R. You won’t Rgret it.
To discuss any R problems, contact me at rp.brendan@gmail.com, or through twitter @ansellbr3

















































