Using machine learning optimal number of cluster should be formed on the dataset
₹1500-12500 INR
Terminado
Publicado hace alrededor de 5 años
₹1500-12500 INR
Pagado a la entrega
On protein sequences we will use clustering algorithm . The data used is the 33 (α/β)8-barrel proteins belonging to the glycoside hydrolases family 2 from the CAZy database. The 33 proteins are divided into five subfamilies, namely, Ga (for β-galactosidase), GI (for β-glucuronidase), Cs (for exo-β-D-glucosaminidase), Ma, and Un (for β-mannosidase), where each protein is represented as a sequence of symbols from the alphabet set {A, C, D, E, F, G, H, I, K, L, M, N, P, Q, R, S, T, V, W, X, Y}. The lengths of the sequences vary from 598 to 1270. These sequences are multimodular with various types of catalytic modules, known as “(α/β)8-barrel”. By this experiment task is to identify the correct number of clusters (K = 5), in terms of such structural characteristics hidden in the sequences.
To validate the quality of a series of clustering results, each generated by the clustering algorithm on the same sequences set S with various numbers of sequences clusters Cluster Validation Index is used
Hello Sir/Madam,
I cannot see your username unless you send me a message. Are you shreyaupatil? Today i couldn't send you message because you deleted or cancelled your project. Please contact me to discuss the details.
If you are not shreyaupatil sorry for inconvenience.
I am a highly experienced and skilled Computer engineer. I have 9 years of experience in the navy as a network and information security officer. Also i have experience as a freelancer.
I have Master Degree and PhD. candidate on Computer Engineering. Data Mining, Machine Learning and Computer Vision are my specialties. For these reasons you can trust my theoretical knowledge and practical experience and i believe that i am the best freelancer for you.
Please feel free to ask and discuss the details of your project on the live chat.
Regards.
₹7.777 INR en 10 días
4,8 (1 comentario)
2,0
2,0
4 freelancers están ofertando un promedio de ₹13.889 INR por este trabajo
Dear sir.
Your project attracted my attention at first glance, because I've extensive experience in Machine Learning & Matlab Programming.
I'm really confident about your project, and very eager to join your project.
If we have a chance to cooperate, I'll do my best to provide wonderful result.
Looking forward to your response.
Best Regards.
Hi I am a very experienced biostatistician, data scientist and bioinformatician. I have completed several PhD level thesis projects involving advanced statistical analysis of data. I have worked with data from several companies and have done projects involving high level quantitative analysis and data interpretation skills to study the trends, time behaviour and compare the variables in the data. I can do advanced level analysis in SPSS, R, PYTHON, WEKA, TABLEAU and EXCEL tools like machine learning, hypothesis testing, forecasting, T-test, ANOVA etc. I am experienced in proteomics, genomics and metabolomics.
Looking forward to discussion,
Best Regards,
Suyash