Improve your experience. We are very sorry but this website does not support Internet Explorer. We recommend using a different browser that is supported such as Google Chrome or Mozilla Firefox.

R Programming for Data Science (v1.0)

This course will teach you the fundamentals of programming in R to get you started. It will also teach you how to use R to perform common data science tasks and achieve data-driven results for the business.

Description

Overview

In our data-driven world, organizations need the right tools to extract valuable insights from that data. The R programming language is one of the tools at the forefront of data science. Its robust set of packages and statistical functions makes it a powerful choice for analyzing data, manipulating data, performing statistical tests on data, and creating predictive models from data. Likewise, R is notable for its strong data visualization tools, enabling you to create high-quality graphs and plots that are incredibly customizable.

Course Objectives

In this course, you will use R to perform common data science tasks. You will:

  • Set up an R development environment and execute simple code
  • Perform operations on atomic data types in R, including characters, numbers, and logicals
  • Perform operations on data structures in R, including vectors, lists, and data frames
  • Write conditional statements and loops
  • Structure code for reuse with functions and packages
  • Manage data by loading and saving datasets, manipulating data frames, and more
  • Analyze data through exploratory analysis, statistical analysis, and more
  • Create and format data visualizations using base R and ggplot2
  • Create simple statistical models from data

Who Should Attend

This course is designed for students who want to learn the R programming language, particularly students who want to leverage R for data analysis and data science tasks in their organization. The course is also designed for students with an interest in applying statistics to real-world problems. A typical student in this course should have several years of experience with computing technology, along with a proficiency in at least one other programming language.

Course Outline

Module 1: Setting Up R and Executing Simple Code

  • Set Up the R Development Environment
  • Write R Statements

Module 2: Processing Atomic Data Types

  • Process Characters
  • Process Numbers
  • Process Logicals

Module 3: Processing Data Structures

  • Process Vectors
  • Process Factors
  • Process Data Frames
  • Subset Data Structures

Module 4: Writing Conditional Statements and Loops

  • Write Conditional Statements
  • Write Loops

Module 5: Structuring Code for Reuse

  • Define and Call Functions
  • Apply Loop Functions
  • Manage R Packages

Module 6: Managing Data in R

  • Load Data
  • Save Data
  • Manipulate Data Frames Using Base R
  • Manipulate Data Frames Using dplyr
  • Handle Dates and Times

Module 7: Analyzing Data in R

  • Examine Data
  • Explore the Underlying Distribution of Data
  • Identify Missing Values

Module 8: Visualizing Data in R

  • Plot Data Using Base R Functions
  • Plot Data Using ggplot2
  • Format Plots in ggplot2
  • Create Combination Plots

Module 9: Modeling Data in R

  • Create Statistical Models in R
  • Create Machine Learning Models in R

Prerequisites

To ensure your success in this course, you should be comfortable with basic computer programming concepts, including but not limited to: syntax, data types, conditional statements, loops, and functions. You can obtain this level of skills and knowledge by taking the Introduction to Programming with Python® course. You should also have at least a high-level understanding of fundamental data science concepts, including but not limited to: data engineering, data analysis, data storage, data visualization, and statistics.

Similar courses

Introduction to DAX for Power BI

Using Data Analysis Expressions to solve common business problems in Power BI

More Information
Microsoft Power BI: Data Analysis Practitioner

Analyze business data, visualize insights, and share those insights across the enterprise

More Information
Tableau Part 2

In this course, you will perform advanced data visualization and data blending with Tableau.

More Information
55339: Programming in C#

In this course, students will review the basics of C# program structure, language syntax, and implementation details, and then consolidate their knowledge throughout the week as they build an application that incorporates several features of .NET. The course aims to follow the spirit of the Microsoft Official Curriculum course 20483, while bringing it completely up-to-date with the latest features of C#, .NET 6.0 and Visual Studio 2022.

More Information
55337: Introduction to Programming

In this course you will, explain core programming fundamentals such as computer storage and processing, create and use variables and constants in programs, discuss how to create and use functions in a program, use decisions structures in a computer program, create and use repetition (loops) in a computer program, explain pseudocode and its role in programming, implement object-oriented programming concepts, and identify application errors and explain how to debug an application and handle errors.

More Information
Introduction to R for Data Analysis

R is a functional programming environment for business analysts and data scientists. It's a language that many non-programmers can easily work with, naturally extending a skill set that is common to high-end Excel users. It's the perfect tool for when the analyst has a statistical, numerical, or probabilities-based problem based on real data and they've pushed Excel past its limits.

More Information
AngularJS Training: AngularJS Programming

In this course, you will create single page web applications using the MVC pattern of AngularJS, understand the programming model provided by the AngularJS framework, define Angular controllers and directives, and control Angular data bindings.

More Information
Comprehensive Angular 12 Programming

In this course, you will develop single page Angular applications using Typescript, set up a complete Angular development environment, create components, directives, services, pipes, forms and custom validators, handle advanced network data retrieval tasks using observables, consume data from REST web services using the Angular HTTP Client, handle push-data connections using the WebSockets protocol, work with Angular Pipes to format data, and use advanced Angular Component Router features.

More Information
Introduction to Programming with Python®

This course is designed for people who want to learn the Python programming language in preparation for using Python to develop software for a wide range of applications, such as data science, machine learning, artificial intelligence, and web development.

More Information
Data Wrangling with Python

This course teaches concepts by deep-dive on-hand exercises. Throughout the course, you will learn data wrangling with hands-on exercises and activities. You’ll find checklists, best practices, and critical points mentioned throughout the lessons, making things more interesting.

More Information
Building Data Analytics Solutions using Amazon Redshift

In this course, you will build a data analytics solution using Amazon Redshift, a cloud data warehouse service.

More Information
Data Warehousing on AWS

In this course, you will learn new concepts, strategies, and best practices for designing a cloud-based data warehousing solution using Amazon Redshift, the petabyte-scale data warehouse in AWS.

More Information
Building Data Lakes on AWS

In this course, you will apply data lake methodologies in planning and designing a data lake, articulate the components and services required for building an AWS data lake, secure a data lake with appropriate permission, ingest, store, and transform data in a data lake and query, analyze, and visualize data within a data lake.

More Information
Building Batch Data Analytics Solutions on AWS

In this course, you will learn to build batch data analytics solutions using Amazon EMR, an enterprise-grade Apache Spark and Apache Hadoop managed service.

More Information
55315: Introduction to SQL Databases

In this course you will describe key database concepts in the context of SQL Server, characterize database languages used in SQL Server, describe data modeling techniques, discuss normalization and denormalization techniques, distinguish relationship types and effects in database design, describe the effects of database design on performance, and define commonly used database objects.

More Information
55366: Querying Data with Transact-SQL

In this course, you will create single table SELECT queries, create multiple table SELECT queries, insert, update, and delete data, query data using built-in functions, create queries that aggregate data, create subqueries, create queries that use table expressions, use UNION, INTERSECT, and EXCEPT on multiple sets of data, implement window functions in queries, use PIVOT and GROUPING SETS in queries, use stored procedures in queries, add error handling to queries, and use transactions in queries.

More Information
55321: SQL Server Integration Services

In this course you will, create sophisticated SSIS packages for extracting, transforming, and loading data, use containers to efficiently control repetitive tasks and transactions, configure packages to dynamically adapt to environment changes, use Data Quality Services to cleanse data, successfully troubleshoot packages, create and manage the SSIS Catalog, deploy, configure, and schedule packages, secure the SSIS Catalog.

More Information
Advanced Programming Techniques with Python (v1.11)

In this course, you will expand your Python proficiencies, select an object-oriented programming approach for Python applications, create object-oriented Python applications, create a desktop application, create data-driven applications, create and secure web service-connected applications, program Python for data science, implement unit testing and exception handling, and package an application for distribution.

More Information
55316: Administering a SQL Database

In this course you will authenticate and authorize users, assign server and database roles, authorize users to access resources, use encryption and auditing features to protect data, describe recovery models and backup strategies, backup and restore SQL Server databases, automate database management, configure security for the SQL Server agent, manage alerts and notifications, managing SQL Server using PowerShell, trace access to SQL Server, monitor a SQL Server infrastructure, and import and export data.

More Information
Building Modern Data Analytics Solutions on AWS

In this course, you will learn how to leverage AWS data Services to store, process, analyze, stream, and query data to make decisions with speed and agility at scale, how to modernize data solutions end to end, and obtain skills to put your data to work to make better, more informed decisions, respond faster to the unexpected, and uncover new opportunities.

More Information
DP-300T00: Administering Microsoft Azure SQL Solutions

This course provides students with the knowledge and skills to administer a SQL Server database infrastructure for cloud, on-premises and hybrid relational databases and who work with the Microsoft PaaS relational database offerings. Additionally, it will be of use to individuals who develop applications that deliver content from SQL-based relational databases.

More Information
Programming and Data Wrangling with VBA and Excel

In this course, you will develop and deploy VBA modules to solve business problems.

More Information
PL-300T00: Microsoft Power BI Data Analyst

If you are someone with existing SQL or SQL Server knowledge (or someone highly versed in different data repositories), this is the Power BI course for you. This course covers the various methods and best practices that are in line with business and technical requirements for modeling, visualizing, and analyzing data with Power BI.

More Information
Hands-on Practical Python for Data Wrangling & Transformation

This introductory and beyond level course is for technical users newer to Python who want to learn advanced data handling and transformation skills, using the latest tools and techniques. The course is approximately 50% hands-on to 50% lecture ratio, combining expert lecture, real-world demonstrations and group discussions with machine-based practical labs and exercises. Student machines are required.

More Information

Press enter to see more results