Data Analysis Techniques # MCQs Practice set

Q.1 Which of the following is a measure of central tendency?

Variance
Standard deviation
Mean
Range
Explanation - The mean is a measure of central tendency representing the average of a set of values.
Correct answer is: Mean

Q.2 What does a high standard deviation indicate?

Data points are close to the mean
Data points are spread out from the mean
Mean is equal to median
There are no outliers
Explanation - Standard deviation measures the amount of variation or dispersion in a set of values. A high value means data points are widely spread.
Correct answer is: Data points are spread out from the mean

Q.3 Which chart is best for displaying the frequency of categories?

Histogram
Bar chart
Scatter plot
Line graph
Explanation - Bar charts are used to display categorical data with rectangular bars representing frequency or count.
Correct answer is: Bar chart

Q.4 The correlation coefficient ranges from:

0 to 1
-1 to 1
-100 to 100
0 to 100
Explanation - The correlation coefficient (r) ranges from -1 (perfect negative correlation) to +1 (perfect positive correlation).
Correct answer is: -1 to 1

Q.5 Which technique is used to compare means of more than two groups?

t-test
ANOVA
Chi-square test
Correlation analysis
Explanation - ANOVA (Analysis of Variance) is used to compare the means of three or more groups to determine if there are significant differences.
Correct answer is: ANOVA

Q.6 Which of the following is a non-parametric test?

t-test
ANOVA
Mann-Whitney U test
Pearson correlation
Explanation - Non-parametric tests do not assume normal distribution. Mann-Whitney U test is used to compare differences between two independent groups on an ordinal or non-normal interval variable.
Correct answer is: Mann-Whitney U test

Q.7 In a scatter plot, the independent variable is usually plotted on the:

X-axis
Y-axis
Z-axis
Both axes
Explanation - The independent variable is plotted on the X-axis, while the dependent variable is plotted on the Y-axis.
Correct answer is: X-axis

Q.8 Which measure indicates the strength and direction of a linear relationship between two variables?

Variance
Covariance
Correlation coefficient
Standard deviation
Explanation - The correlation coefficient quantifies the strength and direction of a linear relationship between two variables.
Correct answer is: Correlation coefficient

Q.9 A histogram is primarily used for:

Comparing means
Showing frequency distribution
Measuring correlation
Testing hypotheses
Explanation - Histograms display the frequency distribution of continuous or discrete data using adjacent bars.
Correct answer is: Showing frequency distribution

Q.10 Which test is used to examine the relationship between two categorical variables?

Chi-square test
t-test
ANOVA
Regression analysis
Explanation - Chi-square tests assess whether there is a significant association between two categorical variables.
Correct answer is: Chi-square test

Q.11 Which of the following is a measure of dispersion?

Median
Mode
Range
Skewness
Explanation - Range measures dispersion by subtracting the smallest value from the largest value in a dataset.
Correct answer is: Range

Q.12 If a dataset has a positive skew, which of the following is true?

Mean < Median
Mean > Median
Mean = Median
Data is symmetric
Explanation - In a positively skewed distribution, the mean is pulled in the direction of the long tail, making it larger than the median.
Correct answer is: Mean > Median

Q.13 Which analysis technique predicts the value of one variable based on another?

Correlation
Regression
Factor analysis
Chi-square test
Explanation - Regression analysis is used to predict the value of a dependent variable based on one or more independent variables.
Correct answer is: Regression

Q.14 What is the purpose of a box plot?

To display mean and standard deviation
To summarize data distribution and identify outliers
To show correlation
To compare two categorical variables
Explanation - Box plots display median, quartiles, and possible outliers, giving a summary of data distribution.
Correct answer is: To summarize data distribution and identify outliers

Q.15 Which test would you use to compare the means of two independent groups?

Independent t-test
Paired t-test
ANOVA
Chi-square test
Explanation - Independent t-test compares the means of two unrelated groups to determine if there is a statistically significant difference.
Correct answer is: Independent t-test

Q.16 In regression, the coefficient of determination (R²) indicates:

The slope of the regression line
The proportion of variance explained by the model
The correlation coefficient
The standard error
Explanation - R² indicates how much of the variance in the dependent variable is explained by the independent variable(s) in the regression model.
Correct answer is: The proportion of variance explained by the model

Q.17 A p-value less than 0.05 typically indicates:

The null hypothesis is rejected
The null hypothesis is accepted
There is no effect
The sample size is too small
Explanation - A p-value < 0.05 suggests that the observed effect is statistically significant, leading to rejection of the null hypothesis.
Correct answer is: The null hypothesis is rejected

Q.18 Which analysis is used to reduce data into a smaller set of underlying variables?

Regression analysis
Factor analysis
t-test
Chi-square test
Explanation - Factor analysis identifies underlying factors that explain the patterns of correlations among observed variables.
Correct answer is: Factor analysis

Q.19 Which of the following is not an assumption of parametric tests?

Normality of distribution
Homogeneity of variance
Interval or ratio scale
Data must be categorical
Explanation - Parametric tests require interval or ratio data, not categorical data. Categorical data is analyzed using non-parametric tests.
Correct answer is: Data must be categorical

Q.20 A scatter plot with points closely clustered along a line indicates:

Weak correlation
Strong correlation
No correlation
Negative skewness
Explanation - Points closely clustered along a line in a scatter plot suggest a strong linear relationship between the variables.
Correct answer is: Strong correlation

Q.21 Which technique is used to test the difference between observed and expected frequencies?

t-test
Chi-square test
Correlation
ANOVA
Explanation - Chi-square goodness-of-fit test compares observed frequencies with expected frequencies to determine if differences are statistically significant.
Correct answer is: Chi-square test

Q.22 Which of the following is a visual representation of the distribution of a continuous variable?

Pie chart
Histogram
Bar chart
Box plot
Explanation - Histograms are used to display the frequency distribution of continuous variables with adjacent bars.
Correct answer is: Histogram

Q.23 In multiple regression, multicollinearity refers to:

High correlation among independent variables
Low correlation among dependent variables
High variance in residuals
Non-normality of errors
Explanation - Multicollinearity occurs when independent variables are highly correlated, which can distort the estimates of regression coefficients.
Correct answer is: High correlation among independent variables

Q.24 Which of the following is used to test the equality of means when assumptions of ANOVA are violated?

Kruskal-Wallis test
Chi-square test
Pearson correlation
Independent t-test
Explanation - Kruskal-Wallis test is a non-parametric alternative to one-way ANOVA for comparing medians of three or more groups when normality assumptions are violated.
Correct answer is: Kruskal-Wallis test

Q.25 Which analysis technique is used to examine the effect of multiple independent variables on one dependent variable?

Simple regression
Multiple regression
Correlation
t-test
Explanation - Multiple regression examines the impact of two or more independent variables on a single dependent variable, providing coefficients for each predictor.
Correct answer is: Multiple regression