# R 101

• Course Number
RP0101EN
• Classes Start
Any time, Self-paced
• Estimated Effort
3 hours
• Audience
• Course Level
• Language

R is a powerful language for data analysis, data visualization, machine learning, statistics. Originally developed for statistical programming, it is now one of the most popular languages in data science. In this course, you'll be learning about the basics of R, and you'll end with the confidence to start writing your own R scripts. But this isn't your typical textbook introduction to R. You're not just learning about R fundamentals, you'll be using R to solve problems related to movies data. Using a concrete example makes the learning painless. You will learn about the fundamentals of R syntax, including assigning variables and doing simple operations with one of R's most important data structures -- vectors! From vectors, you'll then learn about lists, matrix, arrays and data frames. Then you'll jump into conditional statements, functions, classes and debugging. Once you've covered the basics - you'll learn about reading and writing data in R, whether it's a table format (CSV, Excel) or a text file (.txt). Finally, you'll end with some important functions for character strings and dates in R.

## COURSE SYLLABUS

### Module 1 - R basics

• Math, Variables, and Strings
• Vectors and Factors
• Vector operations

### Module 2 - Data structures in R

• Arrays & Matrices
• Lists
• Dataframes

### Module 3 - R programming fundamentals

• Conditions and loops
• Functions in R
• Objects and Classes
• Debugging

### Module 4 - Working with data in R

• Reading CSV and Excel Files
• Writing and saving data objects to file in R

### Module 5 - Strings and Dates in R

• String operations in R
• Regular Expressions
• Dates in R

## GENERAL INFORMATION

• This course is free.
• It is self-paced.
• It can be taken at any time.
• It can be audited as many times as you wish.
• This course makes heavy use of hands on labs. Everything required to do these labs is provided with the course. The only thing you need is a modern web browser and access to internet.

• None

• None

## COURSE STAFF

### Saeed Aghabozorgi

Saeed Aghabozorgi, PhD is a Data Scientist in IBM with a track record of developing enterprise level applications that substantially increases clients’ ability to turn data into actionable knowledge. He is a researcher in data mining field and expert in developing advanced analytic methods like machine learning and statistical modelling on large datasets.

### Polong Lin

Polong Lin is a Data Scientist at IBM in Canada. Under the Emerging Technologies division, Polong is responsible for educating the next generation of data scientists through Cognitive Class. Polong is a regular speaker in conferences and meetups, and holds a M.Sc. in Cognitive Psychology.

### Cognitive Class Course Development Team

Thanks to Cognitive Class course development team, Cognitive Class interns and all individuals contributed to the development of this course:
João Henrique Rezende,
Helly Patel,
Mandeep Kaur,
Hiten Patel,
Marta Aghili,
Anita Vincent,
Iqbal Singh ,
Rishabh Jain,