Computer Vision Quiz-1

Q1. Which of the following is not a good evaluation metric for Multi-label classification?

  1. Mean Average Precision at K
  2. Hamming Score
  3. Accuracy
  4. Top k categorical accuracy

Answer: 3
Explanation: Accuracy is not a good evaluation metric for Multi-label classification. As we know in multi-label each example can be assigned to multiple classes so let’s say if the predicted output was [0, 0, 0, 0, 1, 1,0] and the correct output was [1, 1, 0, 0, 0, 0, 0], my accuracy would still be 3/6 but it should be 0 as it is not able to predict any of the classes correctly.

Q2. Which of the following are the hyperparameters for a Pooling layer?

  1. filter size
  2. stride
  3. which type of Pooling to use (max or average)
  4. All of the above

Answer: 4
Explanation: All of the above mentioned are the hyperparameters for a Pooling layer.

Q3. Images are an example of ________ data?

  1. Structured
  2. Unstructured

Answer: 2
Explanation: Structured data refers to the type of data where each feature has a well defined meaning and opposite is true for unstructured data. So, images are an example of unstructured data.

Q4. For image classification, MaxPooling tends to works better than average pooling?

  1. Yes
  2. No

Answer: 1
Explanation: Because in image classification our main aim is to identify whether a feature is present or not so MaxPooling tends to works better than average pooling.

Q5. What is Pointwise Convolution?

  1. 1×1 convolution
  2. Strided Convolution
  3. convolution followed by MaxPool
  4. convolution followed by Dropout

Answer: 1
Explanation: According to the MobileNet paper, “After depthwise convolution, The pointwise convolution then applies a 1×1 convolution to combine the outputs of the depthwise convolution.”. Refer to Section 3.1 of this research paper to understand more.

Q6. What is a Region Proposal network?

  1. a fully convolutional network that simultaneously predicts object bounds and objectness scores at each position
  2. a fully connected network that simultaneously predicts object bounds and objectness scores at each position
  3. a fully convolutional network that predicts only the objectness scores at each position
  4. a fully connected network that predicts only the object bounds at each position

Answer: 1
Explanation: According to the Faster R-CNN paper, Region Proposal network (RPN) is a fully convolutional network that takes an image(of any size) as input and outputs a set of rectangular object proposals, each with an objectness score. Refer to Section 3.1 of this research paper to understand more.

Q7. In MobileNetv2, the Depthwise Separable Convolutions are replaced by _________ ?

  1. Normal Convolution
  2. Strided Convolution
  3. Bottleneck Residual Block (Inverted Residuals and Linear Bottleneck)
  4. Residual Blocks

Answer: 3
Explanation: In MobileNetv2, the Depthwise Separable Convolutions are replaced by Bottleneck Residual Block (Inverted Residuals and Linear Bottleneck). Refer to Table 1 of this research paper to understand more.

Q8. Can we use Convolutional Neural Networks for image classification?

  1. Yes
  2. No

Answer: 1
Explanation: Generally, Convolutional Neural Networks are preferred for any image related tasks such as image classification, object detection etc.

Leave a Reply