Types of Data


2025-05-09 at 10.05.54.jpg

<aside> 💡

Data type determines which approaches are suitable when summarizing your data.

</aside>

Taxonomy of data types is important to help determine the type of visual display, data analysis, or statistical model. Moreover, data type for a variable determines how R or Python etc will handle computations for that variable.

Data doesn’t always start a rectangular form (like a DataFrame): unstructured data (e.g., text) must be processed and manipulated so that it can be represented as a set of features in the rectangular data. Data in relational databases must be extracted and put into a single table for most data analysis and modeling tasks.

<aside> 💡

The basic data structure in data science is a rectangular matrix in which rows are records and columns are variables (features).

</aside>

Pie Charts


When to use?

Useful if you want to compare basic proportions, i.e. which group has higher frequency than others.

When NOT to use?

Pie Charts are less useful if all slices are of similar size, i.e. if there’s little variation between groups.

Video games sold across various categories

Video games sold across various categories

<aside> 💡

Frequency = Statistical way of saying how many items there are in a particular category or interval?

</aside>

Bar Charts