Define expected value

The expected value of a random variable is the sum of all the values of the random variable multiplied by their probabilities.

Jenny = 1, Lora = 2, Sanya = 3

1 = 0.1

2 =  0.5

3 = 0.3

expected_one_to_call = 1*0.1 + 2*0.5 + 3*0.3 = 2

Our expected value is 2, (Lora)

The solution for graphical represantation of discrete variables is histogram. But that doesn't work for the continues variables. What are the solutions?

One way to visualize the distribution of continuous variables is to divide the set of possible values into intervals and count the number of values in each interval.

When plotting a histogram in pandas, you can set the number of intervals (bins) and set their boundaries explicitly:

data.hist( bins = [value1, value2, value.... valueN])

To overcome the difficulties of creating a histogram for a continuous variable, we can use a slightly different technique that represents frequency not as the height of a column, but as its area (the length of the interval times the height of the column). This area is the frequency of the continuous variable, and the height of the column is the frequency density. A histogram that uses frequency density is called a density histogram.  To estimate how many values fall in a particular interval, take two values and find the total area of the density histograms between them. The number you get will be an estimate of the number of values in that interval.

Data with negative skew (skewed to the left) has ...

Data with negative skew (skewed to the left) has a mean that is less than the median. The data will have more values below the mean than above it.

How do you call the std func in python. What library do you use?

standard_deviation = np.std(dataset)

Data with positive skew (skewed to the right) has...

Data with positive skew (skewed to the right) has a mean that is greater than the median. The data will have more values greater than the mean than below it.

Find dispersion of data

import numpy as np

np.var(data)

What is Frequency density ?

Frequency density — a value equal to the height of a histogram column whose area reflects the relative frequency of a continuous variable.

Define a discrete variable.

A  discrete variable is any variable that is not continuous on any range (for example a variable that takes the integer values from 0 to 100).

What is a Density histogram

Density histogram — a histogram that uses frequency density.

What is μ?

The mean, often denoted by the Greek letter mu, μ.

Build histograms of data with set interval boundaries 16,18,20,58,100

data.hist(bins=[16,18,20,58,100 ])

If mean > median

is it a positive or negative skew?

if the mean is greater than the median, then the data has positive skew. Similarly, if the mean is less than the median, then the data has negative skew.

