This set of MCQ(multiple choice questions) will help you with the answers of Data Analytics with Python NPTEL Week 11 Assignment Solutions.
Course layout
Week 1: Basics of Python Spyder
Week 2 : Introduction to probability
Week 3 : Sampling and sampling distributions
Week 4 : Hypothesis testing
Week 5 : Two sample testing and introduction to ANOVA
Week 6 : Two way ANOVA and linear regression
Week 7 : Linear regression and multiple regression
Week 8 : Concepts of MLE and Logistic regression
Week 9 : ROC and Regression Analysis Model Building
Week 10 : c2 Test and introduction to cluster analysis
Week 11 : Clustering analysis
Week 12 : Classification and Regression Trees (CART)
NOTE: You can check your answer immediately by clicking show answer button. Moreover, this set of “Data Analytics with Python NPTEL Week 11 Assignment Solution” contains 10 questions.
Now, start attempting the quiz.
Data Analytics with Python NPTEL Week 11 Assignment Solutions
Q1. ________ is used for calculating distance measures in clustering using python
a) distance_matrix
b) spatial_matrix
c) scipy_matrix
d) distance.matrix
Answer: a) distance_matrix
Q2. The formula for dissimilarity computation between two objects for categorical variables is –
Here p is a categorical variable and m denotes the number of matches.
a) D(i,j) = p-m / p
b) D(i,j) = p-m / m
c) D(i,j) = m-p / p
d) D(i,j) = m-p / m
Answer: a) D(i,j) = p-m / p
Q3. Select the correct option for a data set with 7 objects and an interval-scaled variable ‘f’ we have the following measurements:
f = (1, 2, 3, 4, 5, 8, 50) containing one outlying value.
a) Std deviation (std_f) and mean absolute deviation (s_f) are equally affected
b) Mean absolute deviation (s_f) is more affected by the outlier
c) Std deviation (std_f) is more affected by the outlier
d) None of these
Answer: c) Std deviation (std_f) is more affected by the outlier
Q4. Which of the following is true for K-means clustering?
a) It comes under the partitioning method
b) The number of clusters is predefined for this method
c) Cluster similarity is measure in regard to the mean value of the objects in a cluster
d) All of the above
Answer: d) All of the above
Q5. Which of the following can act as possible termination conditions in K-Means?
1. For a fixed number of iterations.
2. Assignment of observations to clusters does not change between iterations. Except for cases with a bad local minimum.
3. Centroids do not change between successive iterations.
4. Terminate when Residual Sum of Squares (RSS) falls below a threshold.
a) 1,3 and 4
b) 1,2,3 and 4
c) 2 and 3
d) None of these
Answer: b) 1,2,3 and 4
Q6. In the figure below, if you draw a horizontal line on y-axis for y=2. What will be the number of clusters formed?
a) 1
b) 2
c) 3
d) 4
Answer: b) 2
Q7. Which of the following clustering requires merging approach?
a) Partitional
b) Naive Bayes
c) Hierarchical
d) None of these
Answer: c) Hierarchical
Q8. State True or False: Hierarchical clustering should primarily be used for exploration
a) True
b) False
Answer: a) True
Q9. State True or False: For finding dissimilarity between two clusters in hierarchical clustering, average-link is the only metric used
a) True
b) False
Answer: b) False
Q10. Hierarchical clustering can either be an agglomerative or divisive algorithm
a) True
b) False
Answer: a) True
<< Prev- Data Analytics with Python Week 9 Assignment Solutions
>> Next-Data Analytics with Python Week 11 Assignment Solutions
DISCLAIMER: Use these answers only for the reference purpose. Quizermania doesn't claim these answers to be 100% correct. So, make sure you submit your assignments on the basis of your knowledge.
For discussion about any question, join the below comment section. And get the solution of your query. Also, try to share your thoughts about the topics covered in this particular quiz.