Using the frequency distribution table below, what is the proportion of individuals that scored a 4?

Frequency Distributions 
G&W Ch. 2

Parameters and Statistics

When describing data, it is necessary to distinguish whether the data come from a population or a sample.

Typically, every population parameter has a corresponding sample statistic.

- Parameter—a value that describes a population

- Statistic—a value that describes a sample

Descriptive Statistics

- techniques used to summarize, organize, and simplify data

- can't look at it all - get a quick, good impression

Inferential Statistics

-  techniques used to study samples and then make generalizations about the populations from which they were selected

Variables

discrete - separate categories.  No values can exist between two neighboring categories (e.g., dice)

continuous - infinite fineness.  There are an infinite number of possible values that fall between any two observed values (e.g., time)

- each score corresponds to an interval of the scale

- the boundaries that separate these intervals are called real limits

Frequency Distribution

Goal: simplify the organization and presentation of data

Definition: lists or displays graphically the number of individuals located in each category on the scale of measurement

-  takes a disorganized set of scores and places them in order from highest to lowest, grouping together all individuals who have the same score

-  see the set of scores “at a glance”

-  frequency distribution can be structured either as a table or graph but both display (1) all categories that made up the measurement scale and (2) the number of individuals in each category

Frequency Distribution Tables

data: 3,1,1,2,5,4,4,5,3,5,3,2,3,4,3,3,4,3,2,3

-  X—categories of the measurement scale (usually listed highest to lowest). To calculate the sum of scores, must use both X and f columns

-  f—frequency, number of individuals in that category

- To obtain the total number of individuals in the data set, add up the frequencies

-  p—proportion, the proportion of the total number of responses that fall into this category  (p = f/N)

Cumulative frequencies (cf) show the number of individuals located at or below each score

- Cumulative percentages (c%) show the percentage of individuals accumulated as move up the scale

              X     f      p       %         cf    c%

              5     3     .15    15%     20   100%

              4     4     .20    20%     17   85%

              3     8     .40    40%     13   65%

              2     3     .15    15%     5     25%

              1     2     .10    10%     2     10%

Grouped frequency distribution tables—group the scores into intervals and list these intervals in the frequency distribution table.  The wider the interval, the more information that is lost.  Should have about 5-10 intervals depending on range of data.  Width of each interval should be an easy number (5 or 10) and all intervals should be the same width.

rank or percentile rank—the percentage of individuals in the distribution with scores at or below the particular value.

Frequency distribution graphs/charts

- x-axis (abscissa)—lists the measurement scale categories

- y-axis (ordinate)—lists the frequencies

histogram—a bar is drawn above each X value, so that the height of the bar corresponds to the frequency of the score.  If data is from interval or ratio scale, the bars are draw on so that adjacent bars touch each other.  The touching bars produce a continuous figure, which emphasizes the continuity of the variable. (SPSS? yes)

bar graph—like a histogram, a bar is drawn above each X value, so that the height of the bar corresponds to the frequency of the score. If data is from nominal or ordinal scale, graph is constructed with space between the bars. (SPSS? yes)

frequency distribution polygon—used with interval or ratio scales instead of a histogram.  A single dot is drawn above each score so that the height of the dot corresponds to the frequency. (SPSS? no)

The Shape of a Frequency Distribution

Measures of Shape of a Distribution

Measures to what extent a distribution of values deviates from symmetry around the mean. A value of zero represents a symmetrical or balanced distribution. Skewness values between +/- 1.0 are considered excellent for most purposes, but values between +/- 2.0 are also acceptable in many cases, especially for basic research.

Positive skew--Tail is in the positive direction.  There are fewer larger scores than we would expect with a normal distribution.

Negative skew--Tail is in the negative direction.  There are fewer smaller scores than we would expect with a normal distribution.

Measures the "peakedness" or "flatness" of a distribution. A value of zero indicates the shape is close to a normal distribution.  A positive value indicates a distribution is more peaked than normal.  A negative value indicates a distribution is flatter than normal.  As with skewness, a value of zero represents a distribution that is shaped very similarly to a normal distribution. Kurtosis values between +/- 1.0 are considered excellent for most purposes, but values between +/- 2.0 are also acceptable in many cases, especially for basic research.

A frequency is the number of times a data value occurs. For example, if ten students score 90 in statistics, then score 90 has a frequency of 10. A frequency is a count of the occurrences of values within a data-set. Cumulative frequency is used to determine the number of observations below a particular value in a data set. The cumulative frequency is calculated by adding each frequency from a frequency distribution table to the sum of its predecessors. The last value will always be equal to the total for all data. A relative frequency is a frequency divided by a count of all values. Relative frequencies can be written as fractions, percents, or decimals. Cumulative relative frequency is the accumulation of the previous relative frequencies. The last value will always be equal to 1.Simple. First-type data elements (separated by spaces or commas, etc.), then type f: and further write the frequency of each data item. Each element must have a defined frequency that count of numbers before and after symbol f: must be equal. For example:

1.1 2.5 3.99
f: 5 10 15

Grouped data are formed by aggregating individual data into groups so that a frequency distribution of these groups serves as a convenient means of summarizing or analyzing the data.
groupfrequency
10-205
20-3010
30-4015
This grouped data you can enter:
10-20 20-30 30-40
f: 5 10 15Similar to a frequency table, but instead f: type cf: in the second line. For example:

10 20 30 40 50 60 70 80
cf: 5 13 20 32 60 80 90 100

The cumulative frequency is calculated by adding each frequency from a frequency distribution table to the sum of its predecessors. The last value will always be equal to the total for all observations since all frequencies will already have been added to the previous total.

  • Harmonic HM example
    Find the harmonic mean of 4 and 8.
  • Median and modus
    Radka made 50 throws with a dice. The table saw fit individual dice's wall frequency: Wall Number: 1 2 3 4 5 6 frequency: 8 7 5 11 6 13 Calculate the modus and median of the wall numbers that Radka fell.
  • 75th percentile (quartille Q3)
    Find 75th percentile for 30,42,42,46,46,46,50,50,54
  • The data
    The data set represents the number of cars in a town given a speeding ticket each day for ten days. 2 4 5 5 7 7 8 8 8 12 What is the IQR?
  • Increase the mean
    To which number should the number 4 be changed between the numbers 4,5,7, 1,0,9,7,8, -3,5 to increase these numbers' arithmetic mean by 1.25?
  • The size 2
    The size of pants sold during one business day in a department store are 32, 38, 34, 42, 36, 34, 40, 44, 32, and 34. Find the average size of the pants sold.
  • 45 percentile
    Given the following data 11 15 24 33 10 35 23 25 40 What is P45?
  • Average
    If the average(arithmetic mean) of three numbers x,y,z is 50. What is the average of there numbers (3x +10), (3y +10), (3z+10) ?
  • Dataset:
    Dataset: 35 22 18 54 22 46 28 31 43 22 14 17 25 19 33 14. 1 Group the data into a grouped distribution using 6 classes of equal width. 2. Determine the mean, median, and mode using the raw data. 3. Draw an Ogive curve corresponding to the data and use it
  • Centimeters 66264
    The average height of the basketball team's primary five players is 196 cm. The underscore player is 216 cm tall. What is the average height of the remaining four players in centimeters?
  • An insurance
    An insurance company receives on average two claims per week from a particular factory assuming that the number of claims can be modeled by a Poisson distribution, find the probability that it receives three claims in a given week, more than four claims i

more math problems »

Última postagem

Tag