Kmeans algorithm using MapReduce to cluster the provided data set.
$10-30 USD
Cancelado
Publicado hace más de 5 años
$10-30 USD
Pagado a la entrega
A 2-d data with 100,000 instances
is provided.
Using K means Map Reduce in Hadoop with Python, Write the code in python to do below steps
Run the K means algorithm in a single iteration with a different number of k: k =60, k= 80 and k=100.
Report the execution time with different k in a single iteration: k =60, k= 80 and k=100.
Run the algorithm with 30 iterations with a k= 100. Report the results (the final centers and the assignments for each center).
Visualize the result using a 2-d plot.
I have extensive knowledge of implementing the k-means and k-mean++ algorithm in R. I have done it with many data sets multiple times so I am familiar with the process.
• 5 + years of IT Experience in developing Object Orientation, Client /Server and web base application/Internet application using J2EE Internet Technologies.
• 2 + years of IT Experience in Big data.