Statistics: The science of collecting, organizing, analyzing, interpreting, and presenting data.
Data Analysis: The process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
Descriptive Statistics: Numerical values that summarize and describe the features of a dataset.
Mean: The average of a dataset.
Median: The middle value of an ordered dataset.
Mode: The most frequently occurring value in a dataset.
Standard Deviation: A measure of data dispersion around the mean.
Variance: The average of the squared differences from the mean.
Range: The difference between the highest and lowest values.
Quartiles (Q1, Q3): Values dividing a dataset into four equal parts.
Interquartile Range (IQR): The range of the middle 50% of the data (Q3−Q1).
Frequency: The count of occurrences for a particular value or category.
Relative Frequency: The proportion of occurrences for a particular value or category.
Correlation Coefficient (r): A measure of the linear relationship between two variables.
Coefficient of Determination (R²): The proportion of variance in the dependent variable explained by the independent variable(s).
Linear Regression: A statistical method to model the relationship between a dependent variable and one or more independent variables by fitting a linear equation.
Scatter Plot: A graph showing the relationship between two numerical variables.
Histogram: A graphical representation showing the distribution of numerical data.
Bar Chart: A graph used to display and compare categorical data.
Pie Chart: A circular statistical graphic divided into slices to illustrate numerical proportion.