R Weekly 2018-6 R Consortium Proposals, Data Day Texas
Highlight
-
Introducing Maëlle Salmon, rOpenSci’s new Research Software Engineer
-
R-Podcast Episode 23 -Interviews with Romain Francois and Thomas Lin Pedersen
Insights
-
The 2018 R Consortium R User Group Support Program is Underway.
-
Introducing Maëlle Salmon, rOpenSci’s new Research Software Engineer
-
Notes - Things They Forgot to Teach You In R, rstudio::conf18
R in the Real World
R in Organizations
New Packages
CRAN
errorist - Automatically Search Error and Warning Messages
searcher - Query Search Interfaces
-
dirichletprocess - building flexible Dirichlet process objects to model data in a nonparametric Bayesian framework.
-
TSrepr - time series representations computing
GitHub only
-
pmatch - Haskell- and ML-like pattern matching for R
-
geniusR - Easily access song lyrics from Genius
-
tabr - Create tidyverse-friendly tables of frequencies, inspired by Stata
Package Releases
-
profmem 0.5.0- Simple Memory Profiling for R, e.g.
profmem({ example("lm") })
. Now with support for nested profiling as well as suspending and resuming active profiling sessions. -
RVowpalWabbit 0.0.12 - R with Vowpal Wabbit fast out-of-core learning system
-
digest 0.6.15 - creates hash digests of arbitrary R objects
-
sparklyr 0.7 - R Interface to Apache Spark
Videos and Podcasts
-
NSSD NO.53: Shaken, Not Stirred:Hilary and Roger discuss the connection between intelligence analysts and data scientists, and Hilary helps Roger prepare for an upcoming talk.
-
R-Podcast Episode 23 -Interviews with Romain Francois and Thomas Lin Pedersen - Straight from
rstudio::conf
2018 Eric speaks with Romain Francois and Thomas Lin Pedersen. You’ll hear Romain’s thoughts on the growth of Rcpp and the project that helped him become closer to the R community, as well as Thomas’ journey to enhancingggplot2
and the new packages he’s developed covering network analyses and dynamic APIs from R.
Resources
Tutorials
-
Scraping Wikipedia Tables from Lists for Visualising 2000 Years of Changing Conflict Dynamics
-
Using ggplot2 to create basic visualizations, including a pie chart
-
Analysis of Strava and Garmin running data for my 2017 Philadelphia Marathon training
-
Visualising intersecting sets of twitter followers with rtweet and UpSetR
-
Usage of TSrepr package for time series preprocessing and dimensionality reduction.
-
A tutorial post on building a drag and drop data input interface using shiny and R
-
Have you ever asked yourself, “how should I approach the classic pre-post analysis?”
-
Information Gain From Using Ordinal Instead of Binary Outcomes
-
Data Wrangling Part 1: Basic to Advanced Ways to Select Columns
-
Mining Census Data for Historical Context – Fairfield County, Connecticut in 1920
-
Average spend, activities and length of visit in the NZ International Visitor Survey
-
Data Wrangling Part 2: Transforming your columns into the right shape
-
Using RSelenium and Docker To Webscrape In R - Using The WHO Snake Database
Gist & Cookbook
R Project Updates
Updates from R Core:
-
configure
will usepkg-config
to find the flags to link tojpeg
if available (as it should be for the recently-releasedjpeg-9c
andlibjpeg-turbo
). (This amends the code added in R 3.3.0 as the module name injpeg-9c
is not what that tested for.) -
The
duplicated()
method for data frames is now based onlist
s (instead of string coercion). Consequentlyunique()
is better distinguishing data frame rows, fixing PR#17369 and PR#17381. -
Calling
names()
on an S4 object derived from"environment"
behaves (by default) like callingnames()
on an ordinary environment. -
The environment variable
R_MAX_VZISE
can now be used to specify the maximal vector heap size. On macOS, unless specified by this environment variable, the maximal vector heap size is set to the maximum on 16GB and the available physical memory. This is to avoid having theR
process killed when macOS over-commits memory.
Upcoming Events
-
satRday Cape Town 2018 March 17
satRday Cape Town -
R/Finance 2018 June 1 and 2
Applied Finance with R. -
CascadiaRConf June 2, 2018 Portland, OR, US
-
7eme Rencontres R 5 & 6 July 2018
Rennes - Agrocampus -
useR! 2018 July 10, 2018
The annual useR! conference is the main meeting of the international R user and developer community.
More past events at R conferences & meetups.
Jobs
Call for Participation
Quotes of the Week
We are a new chapter of @RLadiesGlobal and we're interested to hear what the remote #rstats community would like to see in a remote chapter!
— RLadies Remote (@RLadiesRemote) 30 de gener de 2018
Please fill out this form if you are interested in being involved with our chapter!#RLadies https://t.co/rIaPY48aCy
Nerd tweet: this chart/project is why moving to #Rstats from tools like Excel and one-time web scrapers is a game-changer.
— John Burn-Murdoch (@jburnmurdoch) January 28, 2018
Was able to simply hit "run" on a script I wrote *18 months ago*, and then sit back and watch the actual tennis while data streams in & charts update 🤓🎾🤖 https://t.co/WSG4V8bXhL
Data scientists: what is the most underrated / undervalued skill for a new data scientist?
— Caitlin Hudon👩🏼💻 (@beeonaposy) 29 de gener de 2018