Data Analysis

Cerrado Publicado hace 5 años Pagado a la entrega
Cerrado

Function for Calculating Z-scores for data in a Numpy Array

Concept:

As a preliminary step in data analysis, certain types of data are transformed to ensure “good behavior” and “compatibility with other data.” One such transformation is the Z-score. Two series that has been transformed using the Z-score are more easily compared: [login to view URL]

[login to view URL]

In this project, you calculate Z-scores. You will be using your knowledge of Numpy. As well, you learn about the nuances of constructing a function. Also, if you check the links above, you will learn a useful concept in statistics.

Requirement:

The basic requirement is that you produce a function that takes a numpy array as input and outputs an array of the same shape in which the data is transformed through Z-scores. This means that if the array is (5,2), then for each of the two columns, 5 values are used to calculate the mean and sigma and for each of the values in the column, you calculate: (value – mean)/sigma.

A google search “z-score numpy array” will give you plenty of ideas about implementation and you could achieve an efficient implementation with just one line of code! So let me throw in an enhanced requirement: the user also inputs a scalar value (the second input) to indicate the desired column for transformation. Thus, only the specified column is transformed and the other(s) are left alone. For example, if the user inputs a (10,4) array and inputs a column-indicating scalar with a value of 3, this means that only the third column requires transformation (the three remaining columns are left untouched). If the user does not input this scalar, the default is that all columns are transformed. (Hint: a low-level familiarity with Linear Algebra can quickly guide you to an elegant answer here; you can use 1D arrays of ones for sigma and zeros for means and modify them appropriately to complete your calculation. To elaborate, if you use mean = 0 and sigma = 1, the transformation does not change value. Alternatively, use loops and brute force! Whatever works!)

Finally, I want you to implement checks on whether the inputs make sense. For example, is the first input a numpy array (print error message if it is something else) and is the second input appropriate (print error message if scalar does not correspond to a column number)?

Submission:

You are required to code in an ipython notebook and create a pdf. Show your function in one cell – call the function myz. In another cell, create inputs using the following code:

x1 = [login to view URL]([[4,3,12],[1,5,20],[1,2,3],[10,20,40],[7,2,44]])

x2 = 3

x3 = 6

Run the function four times – myz(x1), myz(x1,x2), myz(x1,x3), and myz(x2,x3) – and generate results. In the next cell, please bullet-list things you learned by doing this project. Print to pdf and submit pdf.

Skills: Data Science, Python, Statistical Analysis, Statistics

See more: z score definition, how to calculate z score in excel, z score probability, z score to percentile, z scores table, z value calculator normal distribution, raw score calculator, how to find z score without standard deviation, low level bluetooth programming, low level network problem, net low level programming, low level network interface, preparing low level design document project net, desktop sharing low level, low level programming, low level programming cnet, low level network card, low level access network, money programming low level, low level ways network

Ciencia de datos Python

Nº del proyecto: #17799369

Sobre el proyecto

12 propuestas Proyecto remoto Activo hace 5 años

12 freelancers están ofertando un promedio de ₹309 / hora por este trabajo

anupkelkar02

I am a Python data science expert with experience in Classification and partitioning, Numpy, Pandas, Neural Networks, Association rules, Machine Learning Model, Tensorflow. Also I am a Oracle Certified profession Más

₹500 INR / hora
(42 comentarios)
6.0
invincible1428

Hello, Greetings of the day.!! Your project attracted my attention at first glance, because I've really rich experience in Machine Learning & Python Programming. I have worked on several similar projects before! Más

₹277 INR / hora
(36 comentarios)
5.2
suyashdhoot

Hi I am a very experienced statistician, data scientist and academic writer. I have completed several PhD level thesis projects involving advanced statistical analysis of data. I have worked with data from several comp Más

₹1000 INR / hora
(23 comentarios)
5.5
NoelReubenMuturi

Hello, I am Noel, a proficient statistician. I am proficient in statistical analysis and statistical software such as Excel, SPSS, and R Statistics. I am qualified to take up the project and would love to speak with yo Más

₹277 INR / hora
(0 comentarios)
1.1
aamirk306

I have worked on several similar projects before! I'm really confident about your project, and very eager to join your project.

₹277 INR / hora
(0 comentarios)
0.0
Amitkr333

i have sound knowledge on z score and other hypothesis testing methods . worked on multiple data set to find t score /z score depending on population size and parameters

₹277 INR / hora
(0 comentarios)
0.0
siaditya

It is very similar to the work I have done before

₹333 INR / hora
(0 comentarios)
0.0
iamarebel15

1)competitor on Kaggle and Hackaearth 2)good analytical and statistical knowledge 3)Anomaly Detection project which I had done required a lot of knowledge in Confidence Interval,Z-score,Gaussian estimation etc

₹111 INR / hora
(0 comentarios)
0.0
rma491

Hey! Your project looks very interesting. I have experience with python and coding in general and I am very looking forward to learn new things. I am doing this work not for the money but to learn more skills for m Más

₹277 INR / hora
(0 comentarios)
0.0
jayeshthukarul

Hands on experience on numpy. Good with linear algebra. Sound knowledge of statistics.

₹100 INR / hora
(0 comentarios)
0.0
lantern111

I have good hands on knowledge in python numpy array and stat functions.

₹111 INR / hora
(0 comentarios)
0.0
ThamizhD

in currently doing post graduate programme in machine learning.... I'm more interested in this... had worked on small projects with numpy , statistics etc...

₹166 INR / hora
(0 comentarios)
0.0