Wordcloud for National Day Rally 2014 speech

< 1 min. readInspired by a wordcloud from Obama’s State of the Union address, let’s look at creating a wordcloud for Singapore’s National Day Rally 2014 speech using R. library(rvest) library(RCurl) ## Loading required package: bitops url <- "http://www.pmo.gov.sg/mediacentre/prime-minister-lee-hsien-loongs-national-day-rally-2014-speech-english" # scrapes the speech from the URL above curlSpeech<- getURL(url) speech <- curlSpeech %>% html() %>% html_nodes(".view-mode-full") %>% html_text() […]

A virtual environment for data science

< 1 min. readI wanted to conveniently use data science tools without the hassle of installing the required languages and packages, while benefiting from the strengths of the Linux command line tools. There is a pre-packaged VM called the Data Science Toolbox that fills this need. It comes with R and Python installed, along with the respective popular […]

Data Science Ontology

< 1 min. readThis site created by Sean McClure, data scientist at ThoughtWorks, shows an overview of data science concepts. I find this structured approach very useful as a gauge to discover areas of improvement. It also serves to provide more information via Wikipedia links at the terminal nodes.

Calculus Cheat Cheets

< 1 min. readOn the journey of Data Science, you will probably come across Calculus. For students and working professionals alike, the availability of cheat sheets make lives easier. I’m happy and grateful to find out that the Calculus cheat sheets I’ve referred to during my university days are still available on the net, thanks to Paul Dawkins […]

Video tutorial on dplyr

< 1 min. readFor data manipulation in R, you will probably have heard of the plyr package. Dplyr is plyr’s next iteration by Hadley Wickham. It offers greater ease of use and speed over plyr and I find it more intuitive to use. The above video by Kevin Markham is an excellent introduction to dplyr. Enjoy! (Via Revolution […]

Deep Learning introduction – Andrew Ng

< 1 min. readWhile media are already reporting how Deep Learning – machine learning through the use of neural network techniques – can contribute to increased ad profits, it would be great to know more on the background. Here’s an introductory video on Deep Learning by Andrew Ng at the RSS (Robotics: Science and System) Conference 2014.