Data & Information # MCQs Practice set

Q.1 Which of the following best describes 'data'?

Raw facts and figures
Meaningful information
Instructions for a computer
Organized knowledge
Explanation - Data refers to raw, unprocessed facts such as numbers, text, or images, which can later be processed into meaningful information.
Correct answer is: Raw facts and figures

Q.2 Information is created when data is:

Stored
Processed and organized
Deleted
Ignored
Explanation - Information results when raw data is processed, structured, and given context to make it meaningful.
Correct answer is: Processed and organized

Q.3 Which of these is an example of qualitative data?

Height in centimeters
Temperature in degrees
Eye color
Weight in kilograms
Explanation - Qualitative data describes attributes or qualities, such as colors or names, unlike numerical (quantitative) data.
Correct answer is: Eye color

Q.4 Which file format is most suitable for storing structured data?

JPEG
CSV
MP3
GIF
Explanation - CSV (Comma-Separated Values) format is commonly used for storing structured tabular data.
Correct answer is: CSV

Q.5 What is 'metadata'?

Data about data
Random information
Raw numbers
Duplicate content
Explanation - Metadata provides information about other data, such as file size, author, or creation date.
Correct answer is: Data about data

Q.6 Which of the following is an example of primary data?

A survey you conducted
A news article
A research paper summary
A Wikipedia entry
Explanation - Primary data is collected firsthand by the researcher, while others are secondary sources.
Correct answer is: A survey you conducted

Q.7 Which data structure is best for storing hierarchical data?

Array
Linked List
Tree
Stack
Explanation - Tree structures represent data in a hierarchy, such as organizational charts or file systems.
Correct answer is: Tree

Q.8 In databases, a 'tuple' represents:

A row in a table
A column in a table
The entire table
A set of tables
Explanation - In relational databases, a tuple corresponds to a row, while attributes correspond to columns.
Correct answer is: A row in a table

Q.9 What does 'big data' typically refer to?

Small datasets
Extremely large and complex datasets
Structured spreadsheets
Short-term memory
Explanation - Big data involves massive, complex datasets that traditional tools may not handle effectively.
Correct answer is: Extremely large and complex datasets

Q.10 Which of these is an example of unstructured data?

Relational database
Spreadsheet table
Social media posts
CSV file
Explanation - Unstructured data does not follow a clear format or model, like images, videos, and social media text.
Correct answer is: Social media posts

Q.11 Which operation combines two datasets based on a related column?

Union
Join
Sort
Filter
Explanation - A join operation merges datasets using a common attribute (like a foreign key).
Correct answer is: Join

Q.12 Which measurement scale allows ranking but not precise differences?

Nominal
Ordinal
Interval
Ratio
Explanation - Ordinal data indicates order (like ranks), but differences between ranks may not be equal.
Correct answer is: Ordinal

Q.13 Which of the following best represents binary data?

Yes/No responses
A written paragraph
An image caption
Temperature in Celsius
Explanation - Binary data has only two possible values, often represented as 0 and 1, or Yes and No.
Correct answer is: Yes/No responses

Q.14 A data warehouse is mainly used for:

Daily transaction processing
Long-term storage and analysis
Short-term cache memory
Running mobile apps
Explanation - Data warehouses are designed for storing large volumes of historical data and supporting decision-making.
Correct answer is: Long-term storage and analysis

Q.15 Which of the following is an example of discrete data?

Number of students
Height of a tree
Speed of a car
Temperature outside
Explanation - Discrete data can take only specific integer values, such as counts of objects.
Correct answer is: Number of students

Q.16 Which encoding system is widely used to represent characters in computers?

JPEG
ASCII
MP4
ZIP
Explanation - ASCII (American Standard Code for Information Interchange) is a character encoding standard widely used in computers.
Correct answer is: ASCII

Q.17 Data cleaning in data science refers to:

Deleting all files
Correcting or removing inaccurate records
Sorting numbers in order
Encrypting data for safety
Explanation - Data cleaning ensures datasets are accurate, consistent, and free of errors for analysis.
Correct answer is: Correcting or removing inaccurate records

Q.18 Which of these is NOT a data type?

Integer
String
Boolean
Processor
Explanation - Processor is hardware, while integer, string, and boolean are common data types.
Correct answer is: Processor

Q.19 Which step comes immediately after data collection?

Data storage
Data cleaning
Data visualization
Data sharing
Explanation - After collecting data, it is stored in databases or files before further processing.
Correct answer is: Data storage

Q.20 In statistics, what is a 'dataset'?

A single number
A collection of related data
A mathematical formula
A chart
Explanation - A dataset refers to a collection of related data points organized for analysis.
Correct answer is: A collection of related data

Q.21 Which of these represents time-series data?

Population by age group
Daily stock prices
Eye color categories
Exam grades
Explanation - Time-series data records values at different points in time, such as daily prices or monthly sales.
Correct answer is: Daily stock prices

Q.22 Which of these is NOT an example of structured data?

Database records
Spreadsheets
Audio recordings
CSV files
Explanation - Structured data follows organized models like rows and columns, whereas audio files are unstructured.
Correct answer is: Audio recordings

Q.23 Which of the following best defines 'data redundancy'?

Data stored securely
Data backed up online
Unnecessary duplication of data
Data analyzed for trends
Explanation - Data redundancy occurs when the same piece of data is stored multiple times unnecessarily.
Correct answer is: Unnecessary duplication of data

Q.24 Which field deals with extracting useful patterns from large datasets?

Cybersecurity
Data mining
Networking
Cloud computing
Explanation - Data mining is the process of discovering patterns, correlations, and trends from large datasets.
Correct answer is: Data mining

Q.25 Which representation is most commonly used for binary data in computers?

0 and 1
A and B
True and False only
Positive and Negative signs
Explanation - Binary data is represented in computers using the digits 0 and 1, which correspond to off/on states in hardware.
Correct answer is: 0 and 1