Basic Concepts:
What is a decision tree in machine learning?
A. A method for clustering data points
B. A supervised learning algorithm used for classification and regression
C. An unsupervised learning technique for dimensionality reduction
D. A technique for visualizing high-dimensional data
Answer: B
In a decision tree, what are internal nodes responsible for?
A. Representing the root node of the tree
B. Making predictions based on input features
C. Connecting nodes to form branches
D. Testing specific conditions on input features
Answer: D
How does a decision tree handle categorical variables?
A. By converting them into binary variables
B. By using them directly without any transformation
C. By clustering them into distinct groups
D. By applying logistic regression to them
Answer: A
What is pruning in the context of decision trees?
A. Adding more branches to increase model complexity
B. Removing branches to prevent overfitting
C. Standardizing the splits across different nodes
D. Converting continuous variables into categorical variables
Answer: B
What is entropy used for in decision tree algorithms?
A. To measure the impurity or randomness of a dataset
B. To calculate the variance of the target variable
C. To normalize the distribution of residuals
D. To penalize complex models
Answer: A
Splitting Criteria:
6. Which criterion is commonly used to measure impurity in classification trees?
A. Mean Squared Error (MSE)
B. Information Gain (IG)
C. Gini Index
D. Variance
Answer: C
How does the Gini Index differ from Information Gain as a splitting criterion?
A. Gini Index prefers splits that maximize the information gain.
B. Gini Index is more sensitive to outliers compared to Information Gain.
C. Gini Index is based on the variance of the target variable.
D. Gini Index measures the probability of incorrect classification.
Answer: D
When constructing a decision tree, what is the role of the splitting criterion?
A. To evaluate the statistical significance of each feature
B. To choose the best feature and threshold for splitting at each node
C. To standardize the coefficients of the independent variables
D. To preprocess data for analysis
Answer: B
What does the term “pruning” refer to in decision trees?
A. The process of removing outliers from the dataset
B. The method of handling missing values in variables
C. The technique for reducing the size of the tree to avoid overfitting
D. The step of transforming categorical variables into numerical ones
Answer: C
Which splitting criterion is preferred when dealing with continuous variables in decision trees?
A. Gini Index
B. Information Gain
C. Mean Squared Error (MSE)
D. Root Mean Squared Error (RMSE)
Answer: C
Model Evaluation and Applications:
11. How does a decision tree handle missing values during training?
– A. It imputes missing values using the mean of the variable.
– B. It removes instances with missing values from the dataset.
– C. It assigns missing values to a separate category.
– D. It splits the node based on available data.
– Answer: D
What does the term “feature importance” refer to in decision trees?
A. The complexity of the decision tree model
B. The significance of each feature in making accurate predictions
C. The distribution of residuals in the dataset
D. The number of nodes and branches in the tree
Answer: B
How does a decision tree model prevent overfitting?
A. By using regularization techniques like Ridge or Lasso
B. By increasing the number of nodes and branches
C. By pruning the tree to reduce its size
D. By converting continuous variables into categorical variables
Answer: C
In what scenarios would you prefer using a decision tree over other machine learning algorithms?
A. When dealing with high-dimensional data
B. When the relationships between variables are linear
C. When transparency and interpretability are important
D. When there are multicollinearity issues among predictors
Answer: C
How does the depth of a decision tree affect its performance?
A. Deeper trees generally lead to better generalization but may overfit.
B. Shallower trees are more accurate in predicting outcomes.
C. Deeper trees are less sensitive to changes in the dataset.
D. Shallower trees are more computationally expensive to train.
Answer: A
Practical Considerations and Interpretations:
16. What does a decision tree model output for a given instance in a dataset?
– A. The predicted class label
– B. The value of the target variable
– C. The distribution of residuals
– D. The p-value of the regression coefficients
– Answer: A
How does the CART algorithm differ from other decision tree algorithms?
A. CART uses the Gini Index as its default criterion for splitting.
B. CART is specifically designed for regression tasks only.
C. CART cannot handle categorical variables.
D. CART is more computationally intensive compared to other algorithms.
Answer: A
What is the primary advantage of using decision trees in ensemble methods like Random Forests?
A. They reduce the variance of predictions and improve accuracy.
B. They increase the bias of the model and prevent overfitting.
C. They standardize the coefficients of independent variables.
D. They simplify complex relationships between variables.
Answer: A
How does the computational complexity of training a decision tree scale with the size of the dataset?
A. Linearly
B. Quadratically
C. Logarithmically
D. Exponentially
Answer: B
What is the primary disadvantage of using decision trees in machine learning?
A. They are prone to overfitting, especially with noisy data.
B. They cannot handle both categorical and numerical variables.
C. They require extensive preprocessing of data.
D. They are computationally expensive for large datasets.
Answer: A
More MCQS on Management Sciences
- Green supply chain management MCQs
- Sustainable Operations and Supply Chains MCQs in Supply Chain
- Decision support systems MCQs in Supply Chain
- Predictive analytics in supply chains MCQs in Supply Chain
- Data analysis and visualization MCQs in Supply Chain
- Supply Chain Analytics MCQs in Supply Chain
- Demand management MCQs in Supply Chain
- Sales and operations planning (S&OP) MCQs in Supply Chain
- Forecasting techniques MCQs in Supply Chain
- Demand Forecasting and Planning MCQs in Supply Chain
- Contract management MCQs in Supply Chain
- Strategic sourcing MCQs in Supply Chain
- Supplier selection and evaluation MCQs in Supply Chain
- Procurement and Sourcing MCQs in Supply Chain
- Just-in-time (JIT) inventory MCQs in Supply Chain
- Economic order quantity (EOQ )MCQs in Supply Chain
- Inventory control systems MCQs in Supply Chain
- Inventory Management MCQs in Supply Chain
- Total quality management (TQM) MCQs in Supply Chain
- Quality Management MCQs in Supply Chain
- Material requirements planning (MRP) MCQs in Supply Chain
- Capacity planning MCQs in Supply Chain
- Production scheduling MCQs in Supply Chain
- Production Planning and Control MCQs
- Distribution networks MCQs in Supply Chain
- Warehousing and inventory management MCQs in Supply Chain
- Transportation management MCQs in Supply Chain
- Logistics Management MCQs in Supply Chain
- Global supply chain management MCQs in Supply Chain
- Supply chain strategy and design MCQs in Supply Chain
- Basics of supply chain management MCQ in Supply Chains
- Supply Chain Management MCQs
- Introduction to Operations Management MCQs in Supply Chain
- Fundamentals of operations management MCQs
- Operations & Supply Chain Management MCQs
- Business Intelligence MCQs
- distributed computing frameworks MCQs
- Handling large datasets MCQs
- Big Data Analytics MCQs
- neural networks, ensemble methods MCQs
- Introduction to algorithms like clustering MCQs
- Machine Learning MCQs
- time series forecasting MCQs
- decision trees MCQs
- Modeling techniques such as linear and logistic regression MCQs
- Predictive Analytics MCQs
- Power BI MCQs
- using tools like Tableau MCQs
- Techniques for presenting data visually MCQs
- Data Visualization MCQs
- Data manipulation, MCQs
- SQL queries, MCQs
- Database fundamentals, MCQs
- Data Management and SQL, MCQs
- regression analysis, Mcqs
- inferential statistics, Mcqs
- descriptive statistics, Mcqs
- Probability theory, Mcqs
- Statistics for Business Analytics
- regression analysis, Mcqs
- inferential statistics
- descriptive statistics, Mcqs
- Probability theory, Mcqs
- Statistics for Business Analytics
- Management Sciences MCQs