The Role of Statistics in Data Science

Sharing some thoughts on these two related fields.

Photo credit: @lira_n4 via Twenty20

The relevance of statistics in the field of data science.

Trends and patterns are used to tell a story — this is how we communicate any findings to our audience. Some statistical tools and methods are applied in data science to be able to generate descriptions and predictions about the data.

Discuss the importance of data preparation before data analysis.

This is where data preparation plays a crucial role. It’s one of the phases that take up— if not the phase that takes up —the most time in the data value chain. Data preparation helps protect the integrity of the data.

A Linux user currently exploring the wonders of data science. Curious as a cat and sleeps like one. She’s also into skygazing, anime, and gamification.