Kmeans Clustering

When the input data is unlabeled and we have to find hidden patterns or clusters in the data set unsupervised learning comes in the picture. In clustering what we do…

Time Series Forecasting Using R

Time Series Forecasting Using R : A Starter Pack Some basic theoretical ideas needed before we proceed :- Time Series Data - A time series is a set of observations…

Logistic Regression in R

Why Logistic Regression? The linear Regression model assumes that the response variable Y is quantitative. But in many situations, the response variable is instead qualitative. For example eye colour is…

Data Set Repository

This is the place to discover cool data and work together to solve Data problems faster and seamlessly analyse open data. Data Sets Repository UC Irvine Machine Learning Repository - contains data sets…

How to Use Probability Distribution to Understand Your Data Critically

Is there any basis why probability distribution has to be talked about? What are its uses in understanding data? Can it show a sense of relevance according to one's needs?…

Dispersion

Dispersion   Dispersion means the variability, spread in the data. Average gives a single representative of the data however reliability of average is more if dispersion is less. Consider the…

Testing of Hypothesis and its application using R

Hypothesis Testing The primary objective of any statistical analysis is to gather information about some characteristics of the population. But usually only a part of the population (i.e. sample) can be…

Data Analytics: How to Use Graphs to Present Your Data Smartly

When we say data, these  involve numbers or texts or symbols that represent some pieces of information. More often than not, we can see numbers. Because numbers are involved, it…