Datails - StepUp Analytics

#### Data Science Workshop With R

Workshop Start Date: 19-10-2019

Workshop End Date: 24-11-2019

Venue: Banaras Hindu University, Dept. Of Statistics

Registration Ends: 30-09-2019

Fee: Rs.500

Duration: 30 hours Weekend Program

• Pre-requisite:

This session is for Banaras Hindu University students

Software: R programming(R Studio). The team will assist in all the installations.

Total Duration: 30 hours

Additional Q&A session based on demands

Mode: Live Online session

Projects: One real-time project based on R, It Will be mentored by faculty.

Certification: Post-workshop completion and project submission, Candidates will be awarded a certificate

R Programming

Data structure

Vector: numeric, character, logical

Factor, Matrix, Data frame, List

Hands-on practice

Operation

Mathematical, Statistical, Logical

Relational, Conditional, ifelse

String Operation, Date operation, etc…

Hands-on practice

Exploratory Data Analysis

External data import & export

Data Summary, Transformation

A subset, Rename, Reshape, Sort, Merge, Append

Tabulation, Aggregation,

Handling missing values, etc…

Hands-on practice

Data Munging with ‘dplyr’

All kind of data manipulation with ‘dplyr’

Hands-on practice

Loop & User Defined Function

Control flow: if-else-if

for loop, while loop, next, break

User-defined function to build an algorithm

Hands-on practice

Merging and joining

Outliers and Missing value treatment

R Graphics & Statistics

Data Visualisation

Generating graphs/plots in R

Graphical parameters

Line, Bar, Pie, Histogram, Density plots

Saving/Exporting R plots

dev.off()

Hands-on practice

Colour and theme manipulation

Hands-on practice

Statistical Science (Descriptive)

Central tendency: Mean, Median

Dispersion: variance, std.deviation

Quartiles, IQR, covariance, correlation

Different Distribution Analysis

Box-plot, Outlier treatment

Hands-on practice

Machine Learning & Modelling

Supervised Learning Models

Hands-on case study for the below algorithm

Multivariate Linear Regression

Univariate analysis & Variable Selection

Model Assumptions & Diagnostic Checks

Logistic Regression for Classification

Logit function, odds ratio,

Model Estimation, Confusion Matrix

Accuracy, Sensitivity, Specificity

Unsupervised Learning Models

Cluster Analysis