Course Outline
Day 1
Introduction and preliminaries
- Enhancing R's usability and available graphical user interfaces (GUIs)
- RStudio
- Complementary software and documentation
- R in the context of statistics
- Interactive use of R
- Introductory overview session
- Obtaining help for functions and features
- R commands, case sensitivity, and conventions
- Recalling and correcting previous commands
- Executing commands from files or redirecting output
- Managing data persistence and removing objects
Simple manipulations; numbers and vectors
- Vectors and assignment
- Vector arithmetic
- Generating regular sequences
- Logical vectors
- Handling missing values
- Character vectors
- Index vectors; selecting and modifying subsets of a data set
- Other object types
Objects, their modes and attributes
- Intrinsic attributes: mode and length
- Modifying the length of an object
- Retrieving and setting attributes
- Object classes
Ordered and unordered factors
- Specific examples
- The tapply() function and ragged arrays
- Ordered factors
Arrays and matrices
- Arrays
- Array indexing and subsections
- Index matrices
- The array() function
- Mixed vector and array arithmetic: The recycling rule
- Outer product of two arrays
- Generalized transpose of an array
- Matrix facilities
- Matrix multiplication
- Linear equations and inversion
- Eigenvalues and eigenvectors
- Singular value decomposition and determinants
- Least squares fitting and QR decomposition
- Constructing partitioned matrices using cbind() and rbind()
- Concatenation with arrays
- Generating frequency tables from factors
Day 2
Lists and data frames
- Lists
- Constructing and modifying lists
- Concatenating lists
- Data frames
- Creating data frames
- Using attach() and detach()
- Working with data frames
- Attaching arbitrary lists
- Managing the search path
Data manipulation
- Selecting, subsetting observations and variables
- Filtering and grouping
- Recoding and transformations
- Aggregation and combining data sets
- Character manipulation using the stringr package
Reading data
- Text files
- CSV files
- XLS and XLSX files
- SPSS, SAS, Stata, and other data formats
- Exporting data to txt, csv, and other formats
- Accessing data from databases using SQL
Probability distributions
- R as a repository of statistical tables
- Examining data distribution
- One- and two-sample tests
Grouping, loops and conditional execution
- Grouped expressions
- Control statements
- Conditional execution: if statements
- Repetitive execution: for loops, repeat, and while
Day 3
Writing your own functions
- Simple examples
- Defining new binary operators
- Named arguments and defaults
- The '...' argument
- Assignments within functions
- Advanced examples
- Efficiency factors in block designs
- Dropping all names in a printed array
- Recursive numerical integration
- Scope
- Customizing the environment
- Classes, generic functions, and object orientation
Statistical analysis in R
- Linear regression models
- Generic functions for extracting model information
- Updating fitted models
- Generalized linear models
- Families
- The glm() function
- Classification
- Logistic Regression
- Linear Discriminant Analysis
- Unsupervised learning
- Principal Components Analysis
- Clustering Methods (k-means, hierarchical clustering, k-medoids)
- Survival analysis
- Survival objects in R
- Kaplan-Meier estimate
- Confidence bands
- Cox PH models with constant covariates
- Cox PH models with time-dependent covariates
Graphical procedures
- High-level plotting commands
- The plot() function
- Displaying multivariate data
- Display graphics
- Arguments to high-level plotting functions
- Basic visualization graphs
- Visualizing multivariate relations with lattice and ggplot packages
- Using graphics parameters
- Graphics parameters list
Automated and interactive reporting
- Integrating R output with text
Creating HTML and PDF documents
Testimonials (6)
At the end of the class, we had a great overview of the language, we were provided tools to continue learning and were provided suggestions on how to continue learning. We covered AI/ML information.
Victor Prado - Global Knowledge Network Training Ltd
Course - R
The R-programming overview training is quite intensive but Tomasz is always helpful, energetic and up to date. On top of it, he is passionate about R. I would highly recommend his R sessions to anyone interested in R.
Luiza Panoschi - Global Knowledge Network Training Ltd
Course - R
Practice exercises were relevant and very helpful to reinforce the knowledge.
Andy Kwan - Environment and Climate Change Canada
Course - R
Follow-along exercises after slide presentation kept engagement.
Robin White - Environment and Climate Change Canada
Course - R
Michael was very knowledgeable and clear in his instruction of the training. Course was well structured to teach the desired subject as well as the right amount of room was left to adjust to fit our needs better. Over all, I am very happy with the course.
Brock Batey - Environment and Climate Change Canada
Course - R
I really enjoyed the knowledge of the trainer.