logo Hurry, Grab up to 30% discount on the entire course
Order Now logo

Ask This Question To Be Solved By Our ExpertsGet A+ Grade Solution Guaranteed

expert
Connor EvanssEconomics
(5/5)

947 Answers

Hire Me
expert
Nathan OlivieriMathematics
(5/5)

872 Answers

Hire Me
expert
Robin BlaiseData mining
(5/5)

755 Answers

Hire Me
expert
StatAnalytica ExpertTechnical writing
(5/5)

591 Answers

Hire Me
Python Programming

Implement the EM algorithm for fitting a Gaussian mixture model for the MNIST dataset. We reduce the dataset to be only two cases

INSTRUCTIONS TO CANDIDATES
ANSWER ALL QUESTIONS

Implementing EM algorithm for MNIST dataset.

Implement the EM algorithm for fitting a Gaussian mixture model for the MNIST dataset. We reduce the dataset to be only two cases, of digits "2" and "6" only. Thus, you will fit GMM with C = 2. Use the data file data.dat. True label of the data is provided in label.dat

The matrix images is of size 784-by-1990, i.e., there are totally 1990 images, and each column of the matrix corresponds to one image of size 28-by-28 pixels (the image is vectorized; the original image can be recovered, e.g., using MATLAB code, reshape(images(:,1),28, 28).

 

(a)Select from data one raw image of "2" and "6" and visualize them, respectively.

 

(b) Use random Gaussian vector with zero mean as initial means, and identity matrix as initial

covariance matrix for the clusters. Please plot the log-likelihood function versus the number of

iterations to show your algorithm is converging.

 

(c) Report the finally fitting GMM model when EM terminates: the weights for each component, the

mean vectors (please reformat the vectors into 28-by-28 images and show these images in your

submission). Ideally, you should be able to see these means corresponds to "average" images. No

need to report the covariance matrices.

 

(d) (Optional). Use the pic to infer the labels of the images, and compare with the true labels. Report

the miss classification rate for digits "2" and "6" respectively. Perform K-means clustering with

K = 2. Find out the miss classification rate for digits "2" and "6" respectively, and compare with

GMM. Which one achieves the better performance?

 

Related Questions

. The fundamental operations of create, read, update, and delete (CRUD) in either Python or Java

CS 340 Milestone One Guidelines and Rubric  Overview: For this assignment, you will implement the fundamental operations of create, read, update,

. Develop a program to emulate a purchase transaction at a retail store. This  program will have two classes, a LineItem class and a Transaction class

Retail Transaction Programming Project  Project Requirements:  Develop a program to emulate a purchase transaction at a retail store. This

. The following program contains five errors. Identify the errors and fix them

7COM1028   Secure Systems Programming   Referral Coursework: Secure

. Accepts the following from a user: Item Name Item Quantity Item Price Allows the user to create a file to store the sales receipt contents

Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip

. The final project will encompass developing a web service using a software stack and implementing an industry-standard interface. Regardless of whether you choose to pursue application development goals as a pure developer or as a software engineer

CS 340 Final Project Guidelines and Rubric  Overview The final project will encompass developing a web service using a software stack and impleme