About Andrew

I'm a health librarian in Sydney, Australia, who also happens to be a geocacher.

A scattergories meme

I usually save the memes for a day when I’m struggling to think of what to write, but I’m going to use one for today’s post. The rule is that each answer must start with the last letter of your previous answer. Here’s my contribution:

Name – Andrew
Animal – Wolf
Girls name – Fiona
Colour – Aubergine
Movie – Empire Strikes Back
Something you wear – Kilt
Drink – Tequila
Food – Apricot
Item in the bathroom – Toothbrush
Place – Honduras
Reason to be late – Slept in

Well, that gives me a 100% posting rate for #blogjune so far. Don’t think it will last though.

Another year, another #blogjune

Well, June has rolled around again, which means it’s time for the annual #blogjune blogfest. There’s a list on Twitter if you want to keep up with all the posts. This will be the fifth year that I’ve participated, and I’ve managed to post every day in June for the last four years. However, I can’t see myself posting something every day this year. I’ve got a plan for a week where there will be daily posts, but otherwise I’ll be posting when I get the chance (and inclination).

As with last year, I won’t the only member of my family taking part in #blogjune. My wife will working on a modified version called #doodlejune, and our sons will be posting to their blog with their dinner reviews. Hopefully they’ll do a better job of posting every day than I will.

Twitter activity at the MLA conference

The Medical Library Association (MLA) has just held its annual conference in Toronto, Canada. Being based in Australia means that it’s hard for me to attend, but the Twitter stream is a very useful way of keeping up with what’s going on at the conference. I thought I would use it as a test of using the R programming language to conduct some very basic Social Network Analysis (SNA) on the Twitter stream of the conference. See my previous post for a background of my interest in R and SNA.

I used the twitteR package for R to retrieve all the tweets with the hashtag #mlanet16 which had been sent between 13th and 18th May (there were 9,985). Next, the graphTweets package was used to turn these tweets into a data frame which only included all the Twitter accounts which had been included in a mention i.e. an @ message, and/or sent a retweet. This data frame was then converted into a graphml file, which I opened in Gephi, a free data visualisation tool. If you’re interested, the code I used was:

library(twitteR)
library(igraph)
library(graphTweets)
setup_twitter_oauth("API key", "API secret", "access token", "access secret")
tweets <- searchTwitter("#mlanet16", n=15000, lang="en", since="2016-05-13", until="2016-05-18")
tw_df <- twListToDF(tweets)
edges <- getEdges(data = tw_df, tweets = "text", source = "screenName", "retweetCount", "favorited", str.length = 20)
nodes <- getNodes(edges)
g <- graph.data.frame(edges, directed = TRUE, vertices = nodes)
write.graph(g, "F:/mlagraph.graphml", format="graphml")

The raw, unfiltered data looks like this:

MLA complete

Each node represents an individual Twitter account (I’ve left them unlabelled in order not to identify anyone). This is a bit messy and hard to read, so I filtered the data to make the graph easier to interpret. The graph below shows the top nodes based on their “out-degree”, with the larger nodes having a larger out-degree. Out-degree is a measure of the influence of a node, i.e. how many outward ties they have to other nodes.

MLA out-degree

Another filter I applied was in-degree, which is a measure of the number of inward connections that each node has. Nodes with a high in-degree have a high prestige, as other nodes try to connect with them. The in-degree graph looks like this:

MLA in-degree

There are a range of other measures that can be used to filter the data, so I’ll have a play around a bit more. I certainly wouldn’t call myself an expert in R or social network analysis after doing this, but it has been a great introduction to what R can do.

Tinkering with R and Social Network Analysis

My interest in Social Network Analysis (SNA) began when I was studying the Data, Analytics, and Learning MOOC (DALMOOC) through edX a couple of years ago (see my posts from during the course here). During the course it was mentioned that Twitter lends itself to SNA, so I did some fiddling around with analysing the Twitter streams of various library conferences. I used some of the tools that I was introduced to during the DALMOOC, such as Gephi and NodeXL, and managed to produce some graphs. However I put this on the backburner while I focussed on preparing my poster for the EBLIP8 Conference.

Earlier this year, though, I got the urge to start learning more about the R programming language. Although I have absolutely no background in coding or programming (unless you count copying BASIC programs out of a book for my Commodore 128 when I was a kid), I’d heard about the R programming language, and wanted to find out a bit more about it. I came across the free Datacamp course on R and did the first few lessons, but haven’t worked on it for a while now. I started looking around to see if there were any R packages that could do SNA on Twitter data, and I found that there were a few that I could use. There were websites which had some example code which I was able to copy and do some tweaking on (such as this one and this one), and before long I was collecting and analysing my own data.

I still wouldn’t call myself a coder or programmer, but I’m starting to get the hang of using R. It’s pretty easy to use, especially when you’re using code that is freely available and not having to develop your own. In my next post I’ll show some examples of SNA that I prepared based on the tweets sent at the 2016 Medical Library Association conference.

Finally at 400!

Today was the day that we (the geocachers known as Sensible Shoes) finally reached 400 finds. It’s taken us 8 1/2 years to get there, but get there we did. We’ve been saying for the last couple of years that “this is the year we get to 400”, but it hasn’t happened until today.

We were at 395 this morning, and found three more before lunch to get us to 398. All the caches we found were either in or very close to playgrounds, so the boys had a good time too, riding their scooters and playing on the play equipment. We even got to climb “Bluluru”!

Seeing that today was my last day of holidays before returning to work, I thought it would be nice to reach our 400 milestone today. So after lunch we headed out and got the remaining two to get us to 400. The 400th find was near a park that the boys love going to, but unfortunately their favourite piece of play equipment has been fenced off (presumably for repairs), so we had to make do with the flying fox. After a couple of turns each we headed off to celebrate our achievements, and as we were walking to the car the rain began and only got heavier once we were in the car. Perfect timing!

Now that we’ve finally made it to 400, our next goal is to reach 450 by our nine-year cacheversary in May.

So what do the numbers say?

As usual, my final post for #blogjune is going to look at my blog stats for the past month to see if anyone’s actually read any of my posts. The data will come from a couple of sources – the stats plugin on the blog, my web host, and Google Analytics. Hopefully they’ll be pretty similar.

Firstly, the most-viewed post was So what does a health librarian need to know, followed by So what’s changed since #blogjune last year. These were posts from earlier in the month, so I guess it’s not surprising that they’re at the top. It seems that either Wednesday or Thursday was the most popular day for people to visit the blog (depending on which source you use). The various sources also differed on where these visitors were from – one had the top three countries for visitors being Australia, the US and New Zealand, while another listed them as the US, Australia and France, while the final one had Australia, Bangladesh and Germany as the top three.

Continuing the trend from the last couple of years, Firefox was the most popular browser, followed by Chrome, then Internet Explorer. The mobile browser (Android and iOS) percentage has continued to increase – about 6% this year compared to about 4% last year. Also following the trend from last year is a drop in the total number of views for the month compared to June last year.

So that’s it for another year of #blogjune. I’ll probably be back next year, hopefully with a bit more of a plan and focus for what I’m going to write about.

What I got out of #blogjune this year

For me the main theme that came out of this year’s #blogjune was the future of the Australian biblioblogosphere. There were some posts looking back to try and discover the “golden age” of library blogging (Hugh, Kate), and then others trying to figure out what the future may hold for library blogging in Australia (Kate, Stephanie). I don’t really have much to add to these; I agree that there did seem to be period where blogging was more prevalent than it is now, and I think that there is still a place for blogging in the professional discourse of the Australian LIS industry. It looks like the League of Librarians (or some sort of collaborative blogging venture) is ready to go!

Some of the discussion early this month regarding impostor syndrome also hit home for me. I think I was (perhaps still am?) guilty of thinking that because I have a blog I need to be posting about the “big issues” in the profession. I related to Sally’s post where she says:

Maybe I’m just not cut out to write certain types of posts. It’s mostly the strong opinion, kind-of-ranty posts that I struggle with. I don’t really understand why I feel driven to publish them at all, when I find them so difficult. Do I feel like it’s expected, in order to be taken seriously?

Sometimes I do try and tackle the big issues, but never usually in great depth or as a way of starting a conversation – I’m usually responding to someone else’s brilliantly written post on the subject. I toyed briefly with the idea of setting up a separate blog which I would use for “professional” content and leave this one for the personal stuff. But then I thought of a different approach, and it’s one that I’m going to try to stick to from now on. I’m going to try to comment on blogs more often, instead of writing a short post on the topic. Every now and then I might need to write a post if I have a lot to say, but mostly I’ll be keeping the discussion on the original post.

I have found it harder to come up with posts for #blogjune this year than in previous years, but I’ll get there. A lot of the posts were a result of having to write something instead of wanting to write something. Tomorrow’s post will be my usual wrap-up of the statistics for the blog during June. It will be interesting to see how this year compares to previous years.

A night at the drive-in

We’ve just got home after going to the drive-in to see Minions. It was a bit chillier than when we saw The Lego Movie last year, but we were prepared with our parkas, beanies and blankets. Compared to the last time we went, there were certainly less people sitting outside like we were – most of them stayed in their cars. The boys loved the movie (as we’d expected) and I thought it was OK but not great. We had planned to head to the Australian Museum today to take advantage of their “Free for all” weekend, but Tom has been sick the last couple of days, so we decided to take it easy at home instead. I like going to the drive-in – I didn’t go when I was  kid, and it’s nice to take the boys so they can experience it. It’s cheaper than going to the cinema too – it only costs us $25 for the four of us, and we take our own food. I’m sure this won’t be the last time we go there this year.

Getting back into text correcting on Trove

I’ve mentioned previously that I’ve done some text correcting of the digitised newspapers on Trove. It’s been a year since I’ve done any though, and I got back into it with a modest 49 lines today. I thought that my new focus will be on the articles which mention the original Children’s Hospital in Sydney, the Royal Alexandra Hospital for Children which was located at Camperdown. We’re celebrating 20 years this year since the hospital moved to Westmead, so I thought it would be timely to work on these articles. I’ve done the articles from the 1960’s (there were only two) and will move backwards until I get to the 1900’s. I’m hoping that these articles won’t be as repetitive as the one’s I was working on before (on Victoria Cross winners), and I can learn a bit about the history of the hospital, too.

Buffy the Vampire Slayer in the medical literature

As requested by Teresa, the subject for this week’s search for pop culture icons in the medical literature is Buffy the Vampire Slayer. It appears that there is a gap in the literature on this topic, as I couldn’t find many articles at all. The ones I did find weren’t strictly medical in their focus, but were from the psychology/sociology/cognitive science fields. The one that deals with Buffy most directly is “Buffy the Vampire Slayer and Xena: Warrior Princess: reception of the texts by a sample of lesbian fans and web site users” (it looks like the article is freely available). The other couple that I thought were most relevant didn’t feature Buffy as the main topic of the paper, but did use the show as part of their methodology. The first one uses scenes from Buffy as part of an experiment on gender recognition. The second mentions Buffy as an example of a TV show from the late 1990’s/early 2000’s which introduced a gay or lesbian character into the cast.

I’ve enjoyed these searches over the last couple of weeks, and I may continue this theme for an occasional Friday post, but probably not a regular basis. I’m always happy to take suggestions for shows/characters/movies to use as a subject.