Data Analytics with Python | NPTEL | Week 11 Assignment Solutions

This set of MCQ(multiple choice questions) will help you with the answers of Data Analytics with Python NPTEL Week 11 Assignment Solutions.

Course layout

Week 1: Basics of Python Spyder
Week 2 : Introduction to probability
Week 3 : Sampling and sampling distributions
Week 4 : Hypothesis testing
Week 5 : Two sample testing and introduction to ANOVA
Week 6 : Two way ANOVA and linear regression
Week 7 : Linear regression and multiple regression
Week 8 : Concepts of MLE and Logistic regression
Week 9 : ROC and Regression Analysis Model Building
Week 10 : c² Test and introduction to cluster analysis
Week 11 : Clustering analysis
Week 12 : Classification and Regression Trees (CART)

NOTE: You can check your answer immediately by clicking show answer button. Moreover, this set of “Data Analytics with Python NPTEL Week 11 Assignment Solution” contains 10 questions.

Now, start attempting the quiz.

Data Analytics with Python NPTEL Week 11 Assignment Solutions

Q1. ________ is used for calculating distance measures in clustering using python

a) distance_matrix
b) spatial_matrix
c) scipy_matrix
d) distance.matrix

Answer: a) distance_matrix

Q2. The formula for dissimilarity computation between two objects for categorical variables is –
Here p is a categorical variable and m denotes the number of matches.

a) D(i,j) = p-m / p
b) D(i,j) = p-m / m
c) D(i,j) = m-p / p
d) D(i,j) = m-p / m

Answer: a) D(i,j) = p-m / p

Q3. Select the correct option for a data set with 7 objects and an interval-scaled variable ‘f’ we have the following measurements:
f = (1, 2, 3, 4, 5, 8, 50) containing one outlying value.

a) Std deviation (std_f) and mean absolute deviation (s_f) are equally affected
b) Mean absolute deviation (s_f) is more affected by the outlier
c) Std deviation (std_f) is more affected by the outlier
d) None of these

Answer: c) Std deviation (std_f) is more affected by the outlier

Q4. Which of the following is true for K-means clustering?

a) It comes under the partitioning method
b) The number of clusters is predefined for this method
c) Cluster similarity is measure in regard to the mean value of the objects in a cluster
d) All of the above

Answer: d) All of the above

Q5. Which of the following can act as possible termination conditions in K-Means?
1. For a fixed number of iterations.
2. Assignment of observations to clusters does not change between iterations. Except for cases with a bad local minimum.
3. Centroids do not change between successive iterations.
4. Terminate when Residual Sum of Squares (RSS) falls below a threshold.

a) 1,3 and 4
b) 1,2,3 and 4
c) 2 and 3
d) None of these

Answer: b) 1,2,3 and 4

Q6. In the figure below, if you draw a horizontal line on y-axis for y=2. What will be the number of clusters formed?

a) 1
b) 2
c) 3
d) 4

Answer: b) 2

Q7. Which of the following clustering requires merging approach?

a) Partitional
b) Naive Bayes
c) Hierarchical
d) None of these

Answer: c) Hierarchical

Q8. State True or False: Hierarchical clustering should primarily be used for exploration

a) True
b) False

Answer: a) True

Q9. State True or False: For finding dissimilarity between two clusters in hierarchical clustering, average-link is the only metric used

a) True
b) False

Answer: b) False

Q10. Hierarchical clustering can either be an agglomerative or divisive algorithm

a) True
b) False

Answer: a) True

<< Prev- Data Analytics with Python Week 9 Assignment Solutions

>> Next-Data Analytics with Python Week 11 Assignment Solutions

DISCLAIMER: Use these answers only for the reference purpose. Quizermania doesn't claim these answers to be 100% correct. So, make sure you submit your assignments on the basis of your knowledge.

For discussion about any question, join the below comment section. And get the solution of your query. Also, try to share your thoughts about the topics covered in this particular quiz.

Course layout

Data Analytics with Python NPTEL Week 11 Assignment Solutions

Related Posts

Leave a Comment Cancel Reply