Introduction to R for Data Analysis

R is a functional programming environment for business analysts and data scientists. It's a language that many non-programmers can easily work with, naturally extending a skill set that is common to high-end Excel users. It's the perfect tool for when the analyst has a statistical, numerical, or probabilities-based problem based on real data and they've pushed Excel past its limits.

Description

Overview

This course covers the fundamentals of the R programming language with the focus on using R for statistical data analysis. It includes hands on labs allowing the students to reinforce the programming concepts by creating practical code example of analyzing data using R.

Who Should Attend

No prior knowledge of statistical data analysis or of the R language is assumed. This course is perfect for those interested using the R language to perform the work of a data scientist, answering important business questions by applying a statistical analysis approach to data.

Course Outline

Module 1: Getting Started with R

  • What is R and Where to Get It
  • Development Tools for R
  • Entering R Expressions

Module 2: R Language Fundamentals

  • Variables
  • Operators
  • Logical Flow
  • Vectors
  • Functions
  • Scripts

Module 3: Extending R

  • Base R Packages
  • Installing R Packages
  • Loading and Unloading Modules
  • Removing Packages

Module 4: Working with Data

  • Date Objects
  • Rectangular Data Objects
  • Data Frames
  • Getting Data Into R
  • Working with External Data

Module 5: Data Analytics

  • Summarizing Data
  • Descriptive Statistics
  • Measuring Variance
  • The Standard Deviation
  • Predictive Analysis

Module 6: Visualizing Your Data

  • About Graphical Analysis
  • Line, Pie and Bar Charts
  • Box-Whisker, Scatter, Pairs, and Dot Plots

Similar courses

Using Data Analysis Expressions to solve common business problems in Power BI

More Information

Analyze business data, visualize insights, and share those insights across the enterprise

More Information

In this course, you will perform advanced data visualization and data blending with Tableau.

More Information

This course is designed for people who want to learn the Python programming language in preparation for using Python to develop software for a wide range of applications, such as data science, machine learning, artificial intelligence, and web development.

More Information

This course teaches concepts by deep-dive on-hand exercises. Throughout the course, you will learn data wrangling with hands-on exercises and activities. You’ll find checklists, best practices, and critical points mentioned throughout the lessons, making things more interesting.

More Information

In this course, you will build a data analytics solution using Amazon Redshift, a cloud data warehouse service.

More Information

In this course, you will learn new concepts, strategies, and best practices for designing a cloud-based data warehousing solution using Amazon Redshift, the petabyte-scale data warehouse in AWS.

More Information

In this course, you will learn how to build an operational data lake that supports analysis of both structured and unstructured data. You will learn the components and functionality of the services involved in creating a data lake. You will use AWS Lake Formation to build a data lake, AWS Glue to build a data catalog, and Amazon Athena to analyze data. The course lectures and labs further your learning with the exploration of several common data lake architectures.

More Information

In this course, you will learn to build batch data analytics solutions using Amazon EMR, an enterprise-grade Apache Spark and Apache Hadoop managed service.

More Information

In this course you will describe key database concepts in the context of SQL Server, characterize database languages used in SQL Server, describe data modeling techniques, discuss normalization and denormalization techniques, distinguish relationship types and effects in database design, describe the effects of database design on performance, and define commonly used database objects.

More Information

In this course, you will create single table SELECT queries, create multiple table SELECT queries, insert, update, and delete data, query data using built-in functions, create queries that aggregate data, create subqueries, create queries that use table expressions, use UNION, INTERSECT, and EXCEPT on multiple sets of data, implement window functions in queries, use PIVOT and GROUPING SETS in queries, use stored procedures in queries, add error handling to queries, and use transactions in queries.

More Information

In this course you will, create sophisticated SSIS packages for extracting, transforming, and loading data, use containers to efficiently control repetitive tasks and transactions, configure packages to dynamically adapt to environment changes, use Data Quality Services to cleanse data, successfully troubleshoot packages, create and manage the SSIS Catalog, deploy, configure, and schedule packages, secure the SSIS Catalog.

More Information

This course will teach you the fundamentals of programming in R to get you started. It will also teach you how to use R to perform common data science tasks and achieve data-driven results for the business.

More Information

In this course you will authenticate and authorize users, assign server and database roles, authorize users to access resources, use encryption and auditing features to protect data, describe recovery models and backup strategies, backup and restore SQL Server databases, automate database management, configure security for the SQL Server agent, manage alerts and notifications, managing SQL Server using PowerShell, trace access to SQL Server, monitor a SQL Server infrastructure, and import and export data.

More Information

In this course, you will learn how to leverage AWS data Services to store, process, analyze, stream, and query data to make decisions with speed and agility at scale, how to modernize data solutions end to end, and obtain skills to put your data to work to make better, more informed decisions, respond faster to the unexpected, and uncover new opportunities.

More Information

The creation of data-backed visualizations is a key way data scientists, or any professional, can explore, analyze, and report insights and trends from data. Tableau® software is designed for this purpose. Tableau was built to connect to a wide range of data sources and allows users to quickly create visualizations of connected data to gain insights, show trends, and create reports.

More Information

This course provides students with the knowledge and skills to administer a SQL Server database infrastructure for cloud, on-premises and hybrid relational databases and who work with the Microsoft PaaS relational database offerings. Additionally, it will be of use to individuals who develop applications that deliver content from SQL-based relational databases.

More Information

In this course, you will develop and deploy VBA modules to solve business problems.

More Information

If you are someone with existing SQL or SQL Server knowledge (or someone highly versed in different data repositories), this is the Power BI course for you. This course covers the various methods and best practices that are in line with business and technical requirements for modeling, visualizing, and analyzing data with Power BI.

More Information

This introductory and beyond level course is for technical users newer to Python who want to learn advanced data handling and transformation skills, using the latest tools and techniques. The course is approximately 50% hands-on to 50% lecture ratio, combining expert lecture, real-world demonstrations and group discussions with machine-based practical labs and exercises. Student machines are required.

More Information

CompTIA Data+ is an early-career data analytics certification for professionals tasked with developing and promoting data-driven business decision-making that gives learners the confidence to bring data analysis to life.

More Information

In this course, you will compose SQL queries to retrieve desired information from a database.

More Information

In this course, you will work with advanced queries to manipulate and index tables. You will also create transactions so that you can choose to save or cancel the data entry process.

More Information

This 2-day entry-level course examines the services and features of Microsoft SQL 2022. (This is NOT a SQL querying course, SQL Querying syntax will not be discussed). The content focuses on database tables, adding and changing data, creating and using stored procedures, entity relationships, and indexes.

More Information

Doing data analysis work is about more than learning a software program (Excel, Power BI, Tableau, etc.) - you need to understand the concepts and theory too. This one day course gets you up to speed (and can be useful either before or after your software classes).

More Information

In this course, you will use various Python tools to load, analyze, manipulate, and visualize business data.

More Information

In this course, students will create complex reports & data sources using the tools in Crystal Reports 2020. Students will not only create more complex reports including sub-reports and cross-tabs, but will also increase their speed and efficiency.

More Information

In this course, students will create a basic report by connecting to a database and modifying the report's presentation.

More Information

Our Exam Cram sessions are intensive, focused review sessions designed to help your team master key concepts and pass their CompTIA certification exams with confidence. Led by expert instructors, these sessions provide in-depth, targeted hands-on practice to ensure your team is fully prepared for exam day. Cloud+ covers mining and manipulating data, applying basic statistical methods, and analyzing complex datasets. This exam cram session is included with the Data+ course.

More Information

This course is designed for professionals in a variety of job roles who are currently using desktop or web-based data management tools such as Microsoft® Excel® or SQL Server® reporting services to perform numerical or general data analysis. This course is also designed for professionals who want to pursue the Microsoft Power BI Data Analyst (Exam PL-300) certification.

More Information

In this course, you will develop your understanding about agile business analysis and the role of the business analyst on an agile team. You will learn how business analysis on an agile project is ‘the same’ and ‘different’ than business analysis performed on waterfall projects. You will understand how the business analysis role changes on an agile team.

More Information

During this 5-day course, students will learn Transact-SQL as implemented in SQL Server 2008, 2012 and 2014. The course starts by establishing a foundation understanding of database concepts and terminology. Students are then prepared to use various Microsoft tools to submit queries and view the result.

More Information

In this course, students will continue their learning on the foundations of report writing with Microsoft® SQL Server® Report Builder and SSRS.

More Information

In this course, you will learn to build streaming data analytics solutions using AWS services, including Amazon Kinesis and Amazon Managed Streaming for Apache Kafka (Amazon MSK).

More Information

The course focuses on common data engineering tasks such as orchestrating data transfer and transformation pipelines, working with data files in a data lake, creating and loading relational data warehouses, capturing and aggregating streams of real-time data, and tracking data assets and lineage.

More Information

This course introduces a process for effectively planning and designing a functional, efficient database. Knowing how to plan a relational database is important to the success of the databases you create. Without planning, you cannot possibly know what the database needs to do, or even what information to include in the database. Planning a database is essential and prevents the extra work of fixing data maintenance problems later on.

More Information