Posts

Post has attachment

**Producing a map in 5 lines of code**

Over the past year, I have also been exploring the geospatial capabilities of various R packages. Today, I want to share the most basic of geospatial capabilities, which is producing a map. Using R, you can do this in just 5 lines of code. Let's reproduce...

Post has attachment

**Testing Senzing's Entity Resolution Workbench**

I have the great honor of knowing ex-IBM Fellow Jeff Jonas, the co-Founder, CEO and Chief Scientist of Senzing . Apart from being exceptionally talented, Jeff is also an amazing human being who is always willing to help others. I have personally been the ...

Post has attachment

**Visualization of the 1854 London Cholera Outbreak**

This post attempts to visualize the 1854 London Cholera Outbreak based on data collected by Dr. John Snow and provided in the HistData R package. Dr. Snow was able to identify that cholera was a water borne disease by visualizing his data in 1854 and was ab...

Post has attachment

Post has attachment

Public

**Regression in R**

M y latest publicly available R notebook created in IBM's Data Science Experience is here ! This notebook provides a tutorial on: This notebook covers: Fitting and interpreting linear models ; Evaluating model assumptions; and Selecting among competin...

Post has attachment

Public

**Coefficient of Alienation**

If you thought the coefficient of alienation referred to the hostility I receive from my family as I update my blog on a Saturday afternoon, I would not fault you too much. However, this blog is a blog about predictive analytics which is based on Statistic...

Post has attachment

Public

**Homoscedasticity and heteroscedasticity**

Homoscedasticity and heteroscedasticity - two of the scariest sounding terms in all of Statistics! So what do they mean? When one calculates the variance or standard deviation of a dataset of random variables, one assumes that the variance is constant acro...

Post has attachment

Public

**Standard Deviation versus Absolute Mean Deviation**

One of the first things that any student of statistics learns is 2 popular measures of descriptive statistics: mean and standard deviation. Has the approach to calculating Standard Deviation ever got you wondering about the need to square the distances from...

Post has attachment

Public

**Basic Statistics in R**

M y latest publicly available R notebook created in IBM's Data Science Experience is here ! This notebook provides a tutorial on: This notebook covers: Descriptive statistics Frequency and contingency tables Correlations and covariances t-tests; and N...

Post has attachment

Wait while more posts are being loaded