Sharing some thoughts on these two related fields.
The relevance of statistics in the field of data science.
Statistics is an established field that describes and analyzes data. Data science may be a relatively new field but it is quite similar to statistics that it also looks at patterns and trends of properties that we know about a sample or a population.
Trends and patterns are used to tell a story — this is how we communicate any findings to our audience. Some statistical tools and methods are applied in data science to be able to generate descriptions and predictions about the data.
Discuss the importance of data preparation before data analysis.
Data needs to be explored to be able to find out how we can analyze it to give the insights we need. The whole experience of data analysis is determined by the quality of the data that we have.
This is where data preparation plays a crucial role. It’s one of the phases that take up— if not the phase that takes up —the most time in the data value chain. Data preparation helps protect the integrity of the data.